Forum Discussion
elias94
New Contributor
3 years agoThank you for your answer.
I have a simple vectorized code where i do 4 consecutive reads and then i write them back (4 writes) in the DDR with a clock frequency of 360 MHZ and i only achieve 10.6% occupancy for the read with a bandwidth of 612MB/s and the same for the writes (i used the profiler). I have also 50% stalls .
When i lower the frequency (for example 200 MHZ, i lower the frequency by inserting shift registers) and increase vectorization for example 8 or 16 i get at most an occupancy of 25% and a bandwdith of 1600 MB/s for both reads and writes respectively. I have no stalls at all.
The trip count for my loops is the same so i cant see how the link you sent to me can help me
Best Wishes
elias94