ContributionsMost RecentMost LikesSolutionsWhy my bitstream bring errors I'm trying to use intel sdk for OpenCL 20.3 to program de10_pro. But when I program a bitstream, the diagnose failed when reading or writing fpga. The error message is as follow: FAILED to read auto-discovery string at byte 61. Full auto-discovery string value is 20 20 81d369d2b25191e24199d2e694189ed48a41e80d s11_sh2e1_8Gx2 0 1 13 DDR 0 2 24 1 2 0 858x934592 8589934582 17179969184 0 - 0 0 0 1 97 mi(imap2_o0encl0 0`256 1 0 0 1 0 1 0 10 6 1 0 8 1 0 0 6 0 0 4 1 0 0 6 0 0 0 1 0 0 p 0 0 4 0 0 0 6 0 0 4 1 1 0 6 0 0 4 1 0 0 6 2 1 8 1024 0 3 6 2 1`8 1024 0 2 6 2 0 8 1024!0 2 6 2 1 8 1020 0 3 0 0 1 2 64 0 1 1 1`3 1 1 1 3 1 0 acl_hal_mmd.cpp:1431:assert failure: Failed to initialize kernel interfacediagnose: acl_hal_mmd.cpp:1431: int l_try_device(unsigned int, const char*, acl_system_def_t*, acl_mmd_dispatch_t*): Assertion `0' failed. The command I used to generate bitstream is: aoc -fmax 100 -v -report device/$FILE.cl -o bin/$FILE.aocx -hyper-optimized-handshaking=off Any suggestion why this bitstream get wrong? The compilation time is too long for Intel FPGA OpenCL I am trying to compile a HLS project with Intel SDK for OpenCL 20.3 on DE10 PRO. This project used to take 5~6 hours to compile on Intel SDK for OpenCL 19.4 on Arria10, but it has taken more than 17 hours of compilation now. The resource consumption in Linux is shown below. It seems that the resource used is not that much. The latest file generated in compilation is shown below. It seems that routing has been finished successfully. But after top.fit.route.rpt generated, 5 hours has been consumed with no refresh in any file. I want to know if it is usual to take such a long time in compilation and how can I reduce the compilation time in this flow. SolvedRe: I want to know how to control hyper-optimized handshaking setting I got no reason appear in the Details pane below when I click that colunm, anyway thanks for your help. Re: I want to know how to control hyper-optimized handshaking setting I'm building a platform in OpenCL with RTL embedded. The code worked well with the Arria 10 platform so I tried to used it on stratix 10. I used the code below: ulong2 a_local = __prefetching_load(&a[i]); a_x_local0[0] = a_local.x; a_y_local0[0] = (int)a_local.y; subparts_to_process = __prefetching_load(&num_subparts[i]); And I got an error of conflict between hyper-optimized handshaking and prefetch LSU when using this code, so I change __prefetching_load into normal memory access ulong2 a_local = a[i]; a_x_local0[0] = a_local.x; a_y_local0[0] = (int)a_local.y; subparts_to_process =num_subparts[i]; The error disappeared. But in the html report, it shows that hyper-optimized handshaking is still in the status "off". The kernel worked independently with no data exchange with other kernels. Why does hyper-optimized handshaking still in status "off" and do that affect the performance of my design? I want to know how to control hyper-optimized handshaking setting I just start using quartus pro 20.3 to compile on DE10-PRO. I used prefetch_load in my code and get a error as follow. Compiler Error: Prefetching LSU is not available when hyper-optimized handshaking is enabled Then I deleted prefetch_load and turn back into normal global memory access, this error disappeared. But I found than in my compile report, the hyper-optimized handshaking in Kernel Summary is off. Why I can't apply this feature after cancel prefetch_load lsu? And how can it affect my design. Solved