CUDA Vista "Display driver has stopped responding" CUDA execution time on Vista

Samer_Barakat · September 15, 2008, 8:26pm

I’m using NVIDIA Quadro FX 1700 on a Vista 32 Dell machine. The driver I’m using is 177.84 and the CUDA version is 2.0. The machine has a Xeon processor and 3GB RAM.

When I run my application in the EmuDebug mode it runs fine and it completes successfully. Trying to run the application in the release mode it crashes. Sometimes the blue screen is shown, sometimes the screen start to pixelate and sometimes only the error message appears “Display driver has stopped responding and has recovered”.

The kernel is basically doing numerical integration and Eigen value computations for every voxel in a volume with dimensions 256x160x107. Every block has 160 threads and hence there is 256x107 blocks in the grid. When reducing the grid size to a small size 5x5 the application completes successfully although the block size is still the same 160. Also if I change a condition inside the code so that each thread takes less time to execute the application completes successfully. In both tests I do not change the allocated memory so I don’t suspect the memory requirements are a reason for the crash.

I noticed that the computations crash whenever the time of execution exceed an amount of around 5 sec. I used a timer in the code to check out the computation time. I suspect that the crash is happening because the device takes long to execute the kernels which makes Vista for some reason think it is not responding. When the system detects that the device is not responding it automatically restart the device and hence the device memory is cleared.

My question: Is there any limitations on the amount of time the device needs to complete the execution of the kernels on Windows Vista?

I have attached the code.
template_kernel.txt (25 KB)
template.txt (9.48 KB)

VrahoK · September 15, 2008, 9:22pm

Ever heard of the watchdog timer? This is an auto check by the OS to test if your card is still responding and automatically resets the card after not returning from a kernel within 5 seconds, thus leaving it sometimes in undefined behaviour. Try searching the forum for it. It is often mentioned and AFAIK under Windows you can do nothing about it except split your kernel into smaller parts or use another device as primary graphics device and the quadro only for computation purposes.

Vrah

Topic		Replies	Views
Bluescreen while running CUDA kernel CUDA Programming and Performance	5	7776	July 8, 2009
CUDA and a non-responsive display driver Getting around it... CUDA Programming and Performance	1	2108	August 22, 2008
Cuda timeout and crash CUDA Programming and Performance	1	955	July 17, 2009
CUDA Display driver stopped working on Windows 7 32/64 Display driver stopped working CUDA Programming and Performance	13	192757	February 19, 2010
Display Driver Stopped responding and has recovered? CUDA Programming and Performance	7	8605	August 11, 2009
What is the solution for overcoming NVIDIA Device Driver Crash problem? CUDA Programming and Performance	2	17073	March 1, 2015
Multiple nVIDIA Display Cards (one for display and one for CUDA) CUDA Programming and Performance	1	11292	March 16, 2009
xp calculations problem 20 seconds of life CUDA Programming and Performance	0	708	January 18, 2010
Information about kernel execution time limit? large kernels blank the screen CUDA Programming and Performance	7	3591	September 18, 2008
"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery CUDA Programming and Performance	19	160724	February 4, 2012

CUDA Vista "Display driver has stopped responding" CUDA execution time on Vista

Related topics