This is just a wrapper around some code Vasily Volkov posted to this thread: http://forums.nvidia.com/index.php?showtopic=47689
The only size (used for everything) is defined by the ‘N’ at the top of the main function. sgemm.cu is included inline into main.cu.
You’ll need to chop all the '.txt’s off the filenames. The forum didn’t allow me to upload .tgz, .cu, or extension-free files.
Also, do these debugger slowdowns sound right?
I’m seeing the plain launches (./test.opt and ./test.dbg) of the SGEMM slowing down 30-80x. Launching the debug code in cuda-gdb (cuda-gdb ./test.dbg causes a 2000x slowdown from a base -O3 run.
For the SDK MonteCarlo program, I see a 28x drop by building with -g -G. Launching with the debugger gives a 2400x slowdown.
For the SDK binomialOptions program, I see a 15x drop by building with -g G. Didn’t finish running after a few minutes when launched with cuda-gdb.
sgemm.cu.txt (5.54 KB)
Makefile.txt (437 Bytes)
main.cu.txt (2.28 KB)