Forum Discussion
Altera_Forum
Honored Contributor
10 years agoHello,
A lot happens under the hood when you call clEnqueueNDRangeKernel(). We did not do detailed study of various API latencies but 2ms seems a bit high. Make sure you're using a high-resolution timer (such as getCurrentTimestamp() provided in AOCLUtils that come with all example designs) instead of just clock(). However, if you believe that you have done correct measurements then there is nothing you can do. The only suggestion I have is to allocate a large amount of work to single invocation of clEnqueueNDRangeKernel() so this overhead is amortized.