When I changed from using MegaFunctions to Verilog and used wires for the combinatorial logic there was higher speed and less resources(I think). Specifically it was for an ALU, so it seems the ALUT function was more efficient. Are there any guidelines for efficiency of implementation? This is for StratixIII which has a new LUT design. Continuous assignment for each alu function and a case selected the wire for output. Thankyou

Hi SimKnutt Since you used wires and comb logic may be you have opted unknowingly for asynchronous design. That is much faster but hard to master against hazards and logic delays. If there are no registers there would be no violations of reg timing and no limit for fmax from reg point of view

Thanks, kaz The rest of the design is synchronous, the alu is just the data path between regs/rams. It seems lik using a mux to select add/sub: compare: and/or/xor used more resources but using wires put the functions in a single lut per bit.

It will be helpful to demonstrate your verilog coding for one such function e.g. how did you do wiring instead of mux for the case of Add.

The project archive is attached. I hope it has a copy of the verilog source because it is in a different directory. You can see the whole project as it stands. It compiled but not ready for simulation. Thanks again

but this is schematic based project, what about the verilog project?

Efficient Verilog coding style | Altera Community

11 Replies

Altera_Forum
Honored Contributor
14 years ago
Altera spent money developing Stratix III with a new "fracturable LUT" It can do any function of 6 inputs. If they do not push that, then they wasted their money.

Long comb paths generally are due to long strings of if/else in the HDL whereas the LUT is a simple 2 port memory that has the same access time no matter how complex the function. Edge triggered regs are used because synthesis can only handle them.
So the less function between regs means more wasted time for clock skew and setup/hold times. Those that think pipelining performance is simply a matter of clock speed are sadly mistaken. If the total clocks to do a function times the clock period is not less than before a stage was added then there is no gain with more power used.
Long comb paths generally are due to long strings of if/else in the HDL whereas the LUT is a simple 2 port memory that has the same access time no matter how complex the function. Edge triggered regs are used because synthesis can only handle them.
So the less function between regs means more wasted time for clock skew and setup/hold times. Those that think pipelining performance is simply a matter of clock speed are sadly mistaken. If the total clocks to do a function times the clock period is not less than before a stage was added then there is no gain with more power used.

You are so hung up over the asynch notion that I cannot believe it. Long before TTL and edge triggered flip flops there were multiple clock pulses per machine cycle and regs were simply latches. Somehow the embedded memory blocks are not truly edge triggered but yet synthesized, so I am using the memories with a multiclock cycle to essentially do a latched data flow, so that is why you don't see dedicated regs. Use whatever is available from the technology.

Forum Discussion

Efficient Verilog coding style

11 Replies

Recent Discussions

Cyclone-V SCFIFO - adding ECC to M10K/MLAB/Auto memory

Will serialization factor of 6 in LVDS serdes IP be supported in the future on Agilex5?

System PLL of Agliex5 PCIE example design cannot be locked after configuration

JTAG Chain Broken on Agilex 7-I Dev Kit

Request for Cyclone V Pinout File Information