--- Quote Start ---
Hello everyone,
I am wondering if the AOC compiler would automatically infer loop pipelining for non-task multi-threaded kernels as well? In addition, does the "shift register inference" optimization for task kernels also work for multi-threaded kernels?
About inline function for OpenCL Kernel. I am wondering if it is better to pass in variable normally or by reference? Will pass in variable by value cause the compiler to generate extra registers to hold the values, and will pass variable by reference help save registers?
Thanks!
--- Quote End ---
Loop pipelining and shift register inference are for task kernels only.
Passing arguments by value or by reference are expected to result in the same hardware.