matrix multiplication example do not work with double

I have tried to test the matrix multiplication example in SDK with data type “double” instead “float”, On emulation mode it works, but on debug not (my card is with compute capability 1.3)
can any body tell me why? thanks in advance!

Did you specifically set the architecture to compile for to 1.3? Eg -arch sm_1.3

Thank you very much!!!:D