Altera_Forum
Honored Contributor
12 years agoKernel Vectorization query
Hi,
I am trying to incorporate the kernel vectorization optimization I get the following compiler warning Compiler Warning: Kernel is vectorized but there exist loads/stores that cannot be vectorized. This may reduce performance. The following are the details: Global thread dimension: 240 x 540 Local Work grp dimension 240 x 1 Input dimension 1920 x 1080 I used following attributes _attribute__((num_simd_work_items(4))) __attribute__((reqd_work_group_size(240,1,1))) input loading code snippet : for(UInt32 i = 0 ; i < 8; i++) { tempin[lidx + i * 240] = input[lidx + i * 240]; } where lidx: local_work_id in x direction with max val as 239 (since Local wrg grp dim 240 x 1) tempin is a local memory buffer which is used for per workgrp computation Can anyone suggest way to avoid this warning.............? Let me know if I have to furnish any more details ...... Thanks Neelakandan