[FPGA SDK for OpenCL] Problem with setting multiple compute units

wwood10
New Contributor
7 years ago
Thanks for the help.
I tried using get_local_id()/get_group_id() in a new design (which I have attached an image of the report for), however it still performs the same.
One strange thing I have noticed is that CL_DEVICE_MAX_WORK_ITEM_SIZES returns me (0,17,52) and CL_DEVICE_MAX_WORK_GROUP_SIZE returns me 2147483647. These number seem a bit strange to me.
For context I run the kernel with clEnqueueNDRangeKernel(queue_, kernel_, 1, NULL, gSize_, wgSize_, 0, NULL, NULL); where wgSize_[3] = {WORK_ITEM_SIZE, 1, 1} and gSize_[3] = {BUFFER_SIZE, 1, 1}. I assume I do not need to enqueue a command for each work group right?
- HRZ
  Frequent Contributor
  7 years ago
  No, you don't need a separate queue for each work-group; everything is handled automatically. How many work-groups are you using? The guides recommends at least 3x more work-groups than compute units to see a reasonable performance benefit. Furthermore, if your application is memory unfriendly (e.g random memory accesses) or one compute unit already saturates the memory bandwidth, you are not going to see any performance benefit from using multiple compute units.
shubham10
New Contributor
4 years ago
Hi,

Is there any way by which we can get/request more than one physical compute unit on the underlying FPGA chip (say S10PAC)?

Thanks
- HRZ
  Frequent Contributor
  4 years ago
  What exactly are you trying to achieve by that? An FPGA design is not fixed and the underlying FPGA architecture does not have any notion of a "compute unit"; "compute unit" is simply an OpenCL terminology which doesn't necessarily map to anything meaningful on an FPGA.
  
  You can always compile and synthesize multiple kernels into one bitstream and run them in parallel in different queues, if that is what you are trying to achieve. There are also ways to automatically create/duplicate compute units in both Single Work-item and NDRange kernels.

Forum Discussion

[FPGA SDK for OpenCL] Problem with setting multiple compute units

Recent Discussions

Agilex 7 I-Series "aocl diagnose acl0" error following OFS

AI Suite - Custom model in the FPGA building process

Any date for the release of the Docker image alterafpga/fpgaaisuite-quartus-v2026.1.1?

Downloading AI Suite deb file returns text file

Is Spatial IP ready for LLM / transformer inference?