I programmed a kernel similar to the matrix multiplication example. The resulting application sometimes works as expected and sometimes failes with the error 4 (“unspecified launch failure”). Is there an explanation for this unpredictable behaviour?
I’m using Ubuntu 7.04 and the latest nvidia drivers (100.14.19).