This is a really late reply, sorry for that. I missed the e-mail from altera.
The reference I used was:
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.97.9309&rep=rep1&type=pdf It takes the LFSR output and uses a Box-Muller transform on that involving sine,cos,log,sqrt etc to get a Gaussian Distribution. It looks like a very good candidate for FPGA implementation as you can get all these function implemented in a single cycle and throw in multiple parallel implementations to get a large number of samples per second.
I did complete the coding and did some signal-taps using the Quartus software to sort of check what I got out of the design. The data looks roughly gaussian (didn't do any histogram checks - just judged that the samples had a large number of small values, and a small number of large values) - but I haven't had a chance to quantify it entirely, and a couple of things I saw tells me it's not quite right. The project is shelved for now, hoping to come back later to it.
Yeah I stopped looking at terasic software. I just couldn't get anywhere with it. Maybe if I had gotten to your response earlier, I would have decided even earlier to not go on the wild goose cha
Thanks Dave,
Anand