Forum Discussion
Altera_Forum
Honored Contributor
16 years agoSince 56 in parallel is too many resources then split the large matrix by groups of rows - enough to meet the latency requirement. For example, use 7 matrix mults each processing 8 rows, then recollect the 7 8x1 results.