Forum Discussion

Altera_Forum's avatar
Altera_Forum
Icon for Honored Contributor rankHonored Contributor
10 years ago

aoc device/fft1d.cl -o bin/fft1d.aocx --board s5phq_d8 (FAILED)

Hello, everyone

I used Board bittware s5-pcie-hq d5/d8 (http://www.bittware.com/products-services-fpga-cots-hardware/s5-pcie-hq), and operated it in OS : CentOS 6.6 , Software circumstance is Quartus 15.0

and I want to execute the example source code download from https://www.altera.com/support/support-resources/design-examples/design-software/opencl/fft-1d.html

But it failed every time.

on above site I found the Boards information as followed,

fft performance

[TH="class: header-left"]Boards[/TH]

[TH]GFLOPS[/TH]

bittware s5-pcie-hq d5/d8 (http://www.bittware.com/products-services-fpga-cots-hardware/s5-pcie-hq),

nallatech pcie385-a7/d5 (http://www.nallatech.com/opencl-fpga-accelerator-cards.html)

81

then what's it exact meaning? Does it mean that If I have one of the bittware s5-pcie-hq d5/d8 (http://www.bittware.com/products-services-fpga-cots-hardware/s5-pcie-hq),

nallatech pcie385-a7/d5 (http://www.nallatech.com/opencl-fpga-accelerator-cards.html) boards , the example code could be compiled and executed successfully?

Or Does it mean that I should have all of the boards connected with my PC, then the fft1d.cl file can be compiled into fft1d.aocx successfully?

The hello_world example has been executed successfully, and the result could be expressed as following :

# ######################################################################################################

[root@hello_world]# aoc --list-boards

Board list:

s5phq_a7

s5phq_ab

s5phq_d5

s5phq_d8

[root@hello_world]# make run

./hello_world

Querying platform for info:

==========================

CL_PLATFORM_NAME = Altera SDK for OpenCL

CL_PLATFORM_VENDOR = Altera Corporation

CL_PLATFORM_VERSION = OpenCL 1.0 Altera SDK for OpenCL, Version 15.0

Querying device for info:

========================

CL_DEVICE_NAME = s5phq_d8 : S5PHQ

CL_DEVICE_VENDOR = BittWare Inc

CL_DEVICE_VENDOR_ID = 4466

CL_DEVICE_VERSION = OpenCL 1.0 Altera SDK for OpenCL, Version 15.0

CL_DRIVER_VERSION = 15.0

CL_DEVICE_ADDRESS_BITS = 64

CL_DEVICE_AVAILABLE = true

CL_DEVICE_ENDIAN_LITTLE = true

CL_DEVICE_GLOBAL_MEM_CACHE_SIZE = 32768

CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE = 0

CL_DEVICE_GLOBAL_MEM_SIZE = 8589934592

CL_DEVICE_IMAGE_SUPPORT = false

CL_DEVICE_LOCAL_MEM_SIZE = 16384

CL_DEVICE_MAX_CLOCK_FREQUENCY = 1000

CL_DEVICE_MAX_COMPUTE_UNITS = 1

CL_DEVICE_MAX_CONSTANT_ARGS = 8

CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE = 2147483648

CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS = 3

CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS = 8192

CL_DEVICE_MIN_DATA_TYPE_ALIGN_SIZE = 1024

CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR = 4

CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT = 2

CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT = 1

CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG = 1

CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT = 1

CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE = 0

Command queue out of order? = false

Command queue profiling enabled? = true

Using AOCX: hello_world.aocx

Reprogramming device with handle 1

Kernel initialization is complete.

Launching the kernel...

Thread# 2: Hello from Altera's OpenCL Compiler!

Kernel execution is complete.

# ######################################################################################################

In my case , only hello_world example can be executed successfully, but when I tried to compile the others, its all failed.

vector addition (https://www.altera.com/support/support-resources/design-examples/design-software/opencl/vector-addition.html)

*Updated in 14.1

  • OpenCL API

  • Partition a large problem across multiple devices

  • OpenCL events and event profiling

  • Getting started

This simple design example demonstrates a basic vector addition OpenCL kernel and its corresponding host program.

fft (1d) (https://www.altera.com/support/support-resources/design-examples/design-software/opencl/fft-1d.html)

“Updated in 14.1”

  • Single-precision floating-point optimizations

  • Single work-item kernel

  • Performance

This design example demonstrates a high-performance 1D radix-4 complex fast Fourier transform (FFT) or inverse fast Fourier transform (IFFT) engine using OpenCL. This example takes advantage of the efficient sliding window data reuse pattern.

executed in kernel : aoc device/vector_add.cl -o bin/vector_add.aocx --board s5phq_d8

aoc device/fft1d.cl -o bin/fft1d.aocx --fpc --no-interleaving default --board s5phq_d8

aoc device/fft1d.cl -o bin/fft1d.aocx --board s5phq_d8

Now I do need help! What can I do next?

Any advice will be welcome and appreciated!

Thanks in advance.
No RepliesBe the first to reply