Hi,
I’m currently working on a CUDA project that utilizes the MTGP random number generator.
[url=“Mersenne Twister for Graphic Processors (MTGP)”]http://www.math.sci.hiroshima-u.ac.jp/~m-m...MTGP/index.html[/url]
Apparently, the code is not optimized for Fermi architecture. Has anyone tried optimizing it for Fermi? If so, any pointers on what might need to be changed to accelerate the code? Thanks.