Need complete code example from nVIDIA CUDA lecture

I’m sure most people on this forum are familiar with the lecture videos by Prof. Wen-Mei Hwu posted on the CUDA Zone website. Does anyone have a complete, working copy of the code (not just the kernel) from Lecture #4 (Simple Matrix Multiplication) that they could post? Being something of a novice, I’ve been unable to translate the bits and pieces from the video to my compiler.


Is that not similar (or the same) as the example in chapter 6 of the Programming Guide?