Björne
New Contributor
2 years agoWhy does aoc set ii to 6 when I use high clock frequencies?
I have a simple toy that I want to run at 1000 MHz kernel that doesn't do much:
__attribute__((uses_global_work_offset(0)))
__attribute__((max_global_work_dim(0)))
__kernel void
netsim(
__global const volatile float * restrict gl_vm
) {
float vm[50000];
#pragma ii 1
#pragma ivdep
#pragma speculated_iterations 64
for (int i = 0; i < 50000; i++) {
vm[i] = gl_vm[i];
}
}
According to the report (see screenshot), II=6 and latency=927. Why can't the compiler lower the latency and set II to 1 here?