Trying to debug matrixMul example and hit an CUDBG_ERROR_INTERNAL(0xa) error

Hi @kanweipeng, I asked the question, and fixed it for myself, in a more relevant forum here. I hope this fixes it for you!