Forum Discussion

Honored Contributor

8 years ago

Wrong results when running design on hardware

Hello, My design is made of a chain of single work-item kernels transfering data using channels. It runs fine on emulation, and the FPGA binary is built correclty (95% of estimated usage). ...

Altera_Forum

Honored Contributor

8 years ago

Are you talking about num_compute_units for NDRange kernels or single work-item kernels? num_compute_units for NDRange kernels works in a fully automatic manner and does not require any user intervention other than adding the attribute to the kernel header. The compiler will automatically replicate the pipeline in this case, allowing multiple work-groups to be scheduled in parallel. This obviously comes at the cost of higher area usage and higher memory bandwidth utilization. If memory bandwidth is saturated, using num_compute_units will actually reduce performance due to extra memory contention.

Forum Discussion

Wrong results when running design on hardware

Recent Discussions

Xcelium Simulation using XRUN with Verilog Configurations defining Library Search Order

setup and hold time with set_input_delay

Quartus 26.1 type inheritance bug.

Typedef inheritance in nested modules

Possible Quartus 24.1 bug?