Help me to do matrix multiplication for NXN matrices

I dont know to do matrix multiplication for NXN matrices in CUDA C programming.
I have to initialize and I have to do calculations in NXN matrices.
Will anyone help me ?

http://developer.nvidia.com/cuda-cc-sdk-code-samples#matrixMul

I already saw it. But I didnt understand that solution.