How does local memory behave when used in autorun kernels for FPGA-based OpenCL design ?
I am using local memory in autorun kernels as buffers to hold data within the kernels. The autorun kernels are replicated multiple times by using pragma "__attribute__((num_compute_units(LANE_NUM)))...