Kernel Interruption in Command Line Application

Scot_Halverson · July 13, 2011, 2:39pm

Hi All,

I’m having a problem with my CUDA based command line application. The kernel is by far the most time-consuming portion of the application, accounting for ~97% of the applications run time. This is also a fairly long running application. Depending on the data on which the application is executing, it makes several thousand calls to the kernel. This whole process takes on the order of 1 to 2 minutes. On occasion, particularly during testing, its desirable to stop the application before it completes its computations. Normally, this would be done by clicking the ‘Close’ button on the command prompt window. However, it seems that if that is done while a kernel call is being executed, the whole computer locks up, then crashes, resulting in a restart. I’ve implemented a second thread which monitors user input, waiting for a ‘q’ character, which then safely stops the computations. However, I still catch myself going for the close button on occasion, and I can’t imagine users of the application will be any different.

First off, am I doing something wrong? Is this normally how CUDA works? Secondly, is there some way I can interrupt the kernel gracefully? Or perhaps some means of setting the ‘close’ button behavior?

Some relevant information:

OS: WIN 7 64 bit
CUDA version: 4.0
binary: 32bit windows command line
GPU: GTX285

Thanks,

-Scot

Scot_Halverson · July 15, 2011, 5:27pm

Hi All,

I’m having a problem with my CUDA based command line application. The kernel is by far the most time-consuming portion of the application, accounting for ~97% of the applications run time. This is also a fairly long running application. Depending on the data on which the application is executing, it makes several thousand calls to the kernel. This whole process takes on the order of 1 to 2 minutes. On occasion, particularly during testing, its desirable to stop the application before it completes its computations. Normally, this would be done by clicking the ‘Close’ button on the command prompt window. However, it seems that if that is done while a kernel call is being executed, the whole computer locks up, then crashes, resulting in a restart. I’ve implemented a second thread which monitors user input, waiting for a ‘q’ character, which then safely stops the computations. However, I still catch myself going for the close button on occasion, and I can’t imagine users of the application will be any different.

First off, am I doing something wrong? Is this normally how CUDA works? Secondly, is there some way I can interrupt the kernel gracefully? Or perhaps some means of setting the ‘close’ button behavior?

Some relevant information:

OS: WIN 7 64 bit

CUDA version: 4.0

binary: 32bit windows command line

GPU: GTX285

Thanks,

-Scot

I’ve actually solved this on my own. For anyone who might be interested, the solution was essentially to reduce the number of threads launched for a given kernel instantiation. To make up for this, I launched the same kernel a number of times, with an offset parameter to account for the problems with unique thread identifiers.

The problem was that Windows 7 doesn’t give the program the ability to handle a close event like XP and prior versions. Instead, it gives the program a short period of time, and if it hasn’t reached a ‘clean’ stop state, it just kills the process in the middle of whatever it was doing. In a normal program, this doesn’t have much in the way of consequences outside of any data the program is operating on. With CUDA however, if a kernel is being executed and goes beyond this short period of time, serious problems surface. I’m guessing at this point, but I would imagine that any host-side processes are killed by Windows, but the kernel is still executing. Once it finishes, it no longer has any process to return to, as Windows has killed it. This probably results in a device error. In my case, and I imagine most people’s cases, the CUDA device is also the display adapter. Apparently this issue is significant enough to bring down the entire device, ultimately resulting in the whole computer becoming unresponsive.

In any case, the issue is fixed.

Topic		Replies	Views
CUDA kernel timeout CUDA Programming and Performance	12	58755	December 22, 2022
CUDA Timeout? CUDA Programming and Performance	7	27688	December 19, 2011
CUDA thread in background? CUDA Programming and Performance	10	15994	February 19, 2010
Effect of CUDA on primary display device Slow does of desktop with some code CUDA Programming and Performance	3	4794	June 9, 2009
CUDA limit for loops..? too large number of iterations? CUDA Programming and Performance	28	27376	March 20, 2008
Inexpiable CUDA hang (NOT WDM timeout!) CUDA Programming and Performance	2	1477	June 5, 2014
The Cuda 5 Second execution-time limit Finding a the way to work around the GDI timeout CUDA Programming and Performance	24	12717	July 26, 2010
Kernel problem, execution stop after ~15min CUDA Programming and Performance	7	1781	November 4, 2016
How to use second card with Cuda? CUDA Programming and Performance	15	3520	October 21, 2010
Performance leakage due excessive API times CUDA Programming and Performance	5	654	May 24, 2019

Kernel Interruption in Command Line Application

Related topics