Forum Discussion

sraha's avatar
sraha
Icon for New Contributor rankNew Contributor
7 years ago

What might be wrong when we increase the number of computing units of the kernel (OpenCL implementation) and the execution time increases instead of decreases?!

1 Reply

  • HRZ's avatar
    HRZ
    Icon for Frequent Contributor rankFrequent Contributor

    When using Compute Unit Replication, each compute unit will have its own set of memory ports. This results in a high amount of contention on the memory bus and if there are too many ports going to the memory interface, then it is very much possible that performance will start degrading.