rppolicy do you have available any Nios II/f measurements in cycles needed per operation that you used in your comparison tests which you could post here?
I am facing here a big bottleneck in an application that uses a double precision multiplication operation and doing some analysis here, using the performance counters peripherals, I see that Nios II/f needs about 1100 cycles!! per DP multiplication. Is this any close to the performance you are measuring there? I have also tried the same options like you did. I also tried a 3d party software FP library but the number of cycles remains pretty much the same. Is this number real or am I missing something here?
BTW, BadOmen to which amount of quoted cycles are you reffering ?
Jesse: I do not think that adding a hardware FP to Nios II and then comparing it with an ARM using a software FP and a hardware Barrel shifter (which by the way Nios II/f also has) would make things fair. I think that this is exactly what would make thinks unfair. I think it would be then like comparing apples to oranges to use BadOmen's expression.