Math functions (sine & cosine) latency

Question

Can anyone provide the data about the latency for sine and cosine single precision functions from math.h lib on Cyclone III, possibly Cyclone II, with floating point hardware acceleration (hardware division included)? I need it for the purpose of my engineers thesis, I would be thankful for any help.

altera_forum · Answer

The library is compiled with software floating point. So hardware FPU won't help any functions defined in math.h.  I am also seeking ways to recompile the math functions with hardware FPU support.

altera_forum · Answer

--- Quote Start ---

So hardware FPU won't help any functions defined in math.h.

--- Quote End ---

That's not true. In the Tutorial "Using Nios II Floating-Point Custom Instructions", you can find :

--- Quote Start ---

Table 1–2 indicates which math library functions use floating-point, and of those, which use floating-point division. If a function uses floating-point, it runs faster with floating-point hardware. If a function uses floating-point division, it runs even faster with floating-point division hardware.

--- Quote End ---

So the use of the floating point custom instruction affects the computation speed of the sine and cosine.

altera_forum · Answer

Thinks ....

Having written a soft-float package (in ARM assembler) it strikes me that some simple combinatorial custom instructions (possibly just 1 that uses the rB field to decide what to do) would speed up soft float somewhat!

Likely candidates:

- Extract exponent, detecting NaN and Infinity

- Extract and normalise mantissa (with and without sign)

- Count leading zeros and count leading ones

There also needs to be an easy way of doing 'add with carry' and 64bit shifts (for normalising values).

Something built that way would be significantly faster than full soft-float, but using much less fpga real estate than the current custom code.

It would also let you write 'double' (and maybe long double - 64 bit mantissa) support.

altera_forum · Answer

--- Quote Start ---

That's not true. In the Tutorial "Using Nios II Floating-Point Custom Instructions", you can find :

So the use of the floating point custom instruction affects the computation speed of the sine and cosine.

--- Quote End ---

Then how can get the math library work with hardware FPU.

I have tested sin/cos with/without hardware FPU. The performance is the same.

Is there any configeration I need to set? Thanks.

altera_forum · Answer

If you have integrated the custom FP instruction with the Nios II Processor (in the SOPC Builder), then you have no particular things to do to enable it, it is automatic.

Personnaly I have used it for a project (for the computation of the arctan function) and I have seen a difference.

Are you sure that you are using float variables and not double ?

Check the "Floating-Point Instructions" part of the Nios II Processor Reference Handbook for the details, maybe you have missed something.

Forum Discussion

Math functions (sine & cosine) latency

10 Replies

Recent Discussions

LPDDR4 not available in NIOSV/g linker script - Agilex-5, Quartus 26.1 Pro

Need a way to make firmware upgrade

Nios IDE CPU Detection

FPGA Community Enqueries

Multiple NIOS V Implementation