Forum Discussion
Altera_Forum
Honored Contributor
8 years agoHi HRZ,
Thanks for your advice, I removed the initialization part and declared a variable in the accumulation loop called last_sum, which is 0 when t is 0 and the current accumulator value when t is greater than 0, and it resolves the problem. I have 2 more questions: 1. With TILE_WIDTH = 4 and TILE_HEIGHT = 4, The "output" buffer is supposed be be duplicated 16 times (I intentionally made it BRAM), but in the report it takes 32 BRAMS. Where does this 2 times more replication come from? 2. In the report, there's 16 simultaneous thread launched for the loop controlled by t, does this mean pipelining? If yes, this loop is not pipelinable, due to the data dependency on "output" buffer. Any advice would be greatly appreciated!