I have a situation where I am running a lot of different CUDA based programs repeatedly over many days. This works for a while until something eventually crashes with some vague CUDA error. Once this has happened, the GPU is in a compromised state where subsequent CUDA based code does not run properly. For instance, I can run the scalarprod executable that is part of the SDK, and I will get errors ranging from 3e-3 up to INF, or even QNAN.
I’d like to think that whatever my problem is that caused the initial crash wouldn’t just linger until a reboot… Any thoughts on how to deal with this or what the problem may be? For the moment, I am less concerned with the fact that I have a single crash, and more concerned with the fact that nothing works thereafter.
My system is a 64 bit Windows 7 laptop with an NVIDIA 485M processor, running version 265.77 of the driver. I am using version 3.2 of the CUDA toolkit.
On a related note - if someone from NVIDIA is checking this out - when will a new driver for the 485M become available? At this point, I only have the original driver from Sager, the manufacturer of the laptop - the NVIDIA driver download section doesn’t even make mention of the 485M.