Forum Discussion
kkvasan
New Contributor
4 years agoHi Yohann,
Thanks for the reply.
Did a few experiments, it seems like above the loop is mapped to a stall-enabled cluster.
There is a latency in read data getting through processing kernel and returning to the pipe for mem write .
within this latency entire cluster stop for each iteration.
when modifying processing kernel such that it just pop the data and push a random data to write pipe,
I am getting expected performance.
is there a way to make mem read cluster and mem write cluster stall-free as in following?
https://www.intel.com/content/www/us/en/develop/documentation/oneapi-fpga-optimization-guide/top/introduction-to-fpga-design-concepts/scheduling/clustering-the-datapath.html
Kind regards,
Vasan