Forum Discussion
Altera_Forum
Honored Contributor
8 years agoLoops iterations are NOT pipelined in NDRange kernels, but instead, different threads are scheduled into the same loop pipeline at runtime; obviously, the more threads you have, the more successful the runtime scheduler will be in keeping the pipeline, resulting in higher performance. Furthermore, I believe a major contributing factor to the performance difference you are seeing could be the difference in the operating frequency of the kernels.