I coded a small piece of the application I want to write using CUDA. I based it on the matrixMul example. When I ran it, it failed at the first CUDA call, cudaMalloc(), with error 10999, “Unspecified driver error”. I am calling CUT_DEVICE_INIT() prior to the cudaMalloc call. I couldn’t figure out what was causing this, so I tried some of the SDK examples I first tried after I installed the Tesla card. They failed today, but had worked weeks ago when I first tried them. For example, the matrixMul program showed that the GPU values were all 0.0. The mersenneTwister program failed and I noticed that the Samples/sec speed was about 4x less than my recollection from when I ran it before. I then tried the bandwidthTest program. It showed that all transfer speeds are much slower than the first time I ran it. E.g. the host->device speed was 2.0 GB/s while my recollection is that it ran at 16 GB/s earlier. The device->device time is 4 GB/s while I think it used to be much higher (65 GB/s).
Any ideas about what’s going on? Or a suggestion about what to try. I’m using a Tesla C870 on a quadcore Pentium with Windows XP. The driver version is 169.21. I did power cycle the PC; which didn’t fix the problem. It looks like a hardware issue to me. Is there anything I can do that would help confirm that it’s a hardware problem?