5 seconds limitation is permanent ?

masutani · May 18, 2007, 7:41am

Is 5 seconds limit on Windows permanent restricition?
Or do you have plan to fix this problem in any future update ?

By this restriction, we need to replace mother board to the one having multiple PCIEx slots.

Thank you.

Simon_Green · May 21, 2007, 2:47pm

Unfortunately the watchdog timer is a feature of Windows and isn’t going away.

The restriction doesn’t exist on Linux systems.

mstock · May 21, 2007, 5:24pm

Is that true? Or will it be fixed in the next release? I’ve been experiencing this problem on RHEL4 since I began working with CUDA in Feb.

Simon_Green · May 24, 2007, 12:21pm

My mistake, it turns out there is actually a separate watchdog timer in the Linux graphics driver which limits the maximum execution time.

We are working on removing this restriction for a future release.

mstock · May 24, 2007, 2:20pm

Excellent. Thanks for clearing that up. I’m very pleased at the progress that CUDA is making. It’s amazing the amount of power in these cards, and it’s almost a miracle that we can employ even half of it!

e.ping · May 29, 2007, 7:54pm

Are there any hard numbers on what the maximum execution time is on Linux? I’m running Fedora 6, and ~10 seconds seems to be the limit I’m running into.

mstock · May 30, 2007, 2:18pm

On RHEL4E3 I see around 7.5 seconds, FWIW. Creating a workaround should be easy for many problems, making the time limit merely an inconvenience.

acorrigan · May 30, 2007, 3:09pm

Perhaps I’m missing what you’re thinking of as a workaround, but I don’t understand how a workaround could be easy, or even possible. The execution time of a given kernel is probably a function of the size of the input data (for my kernel I’m summing N terms at each point on a 2D grid) in addition to the specs of the graphics card, the driver version, and the current state of the system (is the card being used by some other program?). How could one write code to reliably restart execution of the kernel before the crash occurs that runs on more than one particular card? Even ignoring that issue, just keeping the card fixed, in my case (which I think is rather straightforward) it’s difficult to determine execution time of the kernel over the 2D parameter space of the number of terms and the area of grid.

Personally, I’m looking forward to the Linux fix, and I’ll just let the Windows version of my code crash :D

mstock · May 30, 2007, 8:58pm

Perhaps I’m missing what you’re thinking of as a workaround, but I don’t understand how a workaround could be easy, or even possible. The execution time of a given kernel is probably a function of the size of the input data (for my kernel I’m summing N terms at each point on a 2D grid) in addition to the specs of the graphics card, the driver version, and the current state of the system (is the card being used by some other program?). How could one write code to reliably restart execution of the kernel before the crash occurs that runs on more than one particular card? Even ignoring that issue, just keeping the card fixed, in my case (which I think is rather straightforward) it’s difficult to determine execution time of the kernel over the 2D parameter space of the number of terms and the area of grid.

Personally, I’m looking forward to the Linux fix, and I’ll just let the Windows version of my code crash :D

[snapback]203378[/snapback]

Some problems have a trivial workaround. Could you, instead of summing the terms over every point on the grid, just have the kernel sum the terms over a smaller chunk of the grid? Or sum only the first M terms? Then put the kernel call in a loop and run it as many times as are necessary to complete the task, keeping each kernel call under 5-7 seconds.

It appears to me that highly data-parallel problems (such as those that run well in CUDA) should be able to be broken apart in this fashion. I only have limited experience with GPU programming, though, and I don’t fully understand what task you are trying to do.

jhanweck · June 4, 2007, 5:22pm

Simon,

Under Windows XP, there are two manifestations of the 5-second limit: the documented one where, if the card is the primary display adapter, the machine hangs after 5 seconds; and the “undocumented” one where, if the card is not the primary display adapter, a kernel will happily run for more than 5 seconds, but then returns with an “unspecified launch failure.”

Are these both due to the same Windows watchdog timer? And, if so, does this mean under Windows a CUDA kernel will never be able to run longer than 5 seconds?

Workarounds aren’t available in all cases, and we’re restricted to Windows machines, so this could be a severe limitation in many applications…

Thanks

Jerry

Topic		Replies	Views
5 seconds limitation? or a bug in my kernel? CUDA Programming and Performance	2	2336	October 17, 2007
CUDA limit for loops..? too large number of iterations? CUDA Programming and Performance	28	27379	March 20, 2008
The Cuda 5 Second execution-time limit Finding a the way to work around the GDI timeout CUDA Programming and Performance	24	12733	July 26, 2010
Watchdog Timer What exactly is the watchdog timer? CUDA Programming and Performance	4	15945	July 8, 2008
CUDA Timeout? CUDA Programming and Performance	7	27694	December 19, 2011
5-second limit in Windows XP Does it still apply with two cards? CUDA Programming and Performance	6	7594	August 24, 2007
"time out" in cuda program mechanism of "time out" CUDA Programming and Performance	14	12721	December 9, 2008
Need solution of "kernel launch timeout" from NVIDIA CUDA Programming and Performance	11	19382	March 4, 2009
User Request kernel timout CUDA Programming and Performance	4	967	January 5, 2015
5 seconds kernel runtime limit on CUDA 2.1 CUDA Programming and Performance	5	5352	November 25, 2008

5 seconds limitation is permanent ?

Related topics