HHo2
New Contributor
7 years agoOpenCL burst Non-aligned access.
I have a OpenCL code like bellow,
the report.html says that I have burst non-aligned access to Z,
but I have read 4x16xchar = 512 bit every clock,
why this is not aligned?
typedef struct{
char ww[16];
}packed_16;
typedef struct{
packed_16 w[4];
}packed_4_16;
__kernel
void Read(
__global const packed_16* restrict Z,
int end,
){
for(int s=0 ; s<end; s++){
// some statement about a_address, b_address
//
packed_4_16 data;
#pragma unroll
for(int i=0; i<4; i++){
packed_16 Z_cache = Z[ a_address + b_address + i ];
data.w[i] = Z_cache;
}
}
}