Where to find info on timeouts, etc?

jamesqf · October 8, 2011, 4:29am

I’ve looked through a lot of the CUDA docs (I think all that might be relevant) and don’t see anything that addresses the problem of dealing with timeouts, and related issues. Could someone point me in the right direction?

Generally, I’m working with code that will do intensive computations on a 3D grid. There’s enough work that it will probably run much longer than the watchdog timer limit (5 sec on Linux, I believe) even on fast hardware, so needs to be broken into sub-problems. But it will be run (by clients) on all sorts of GPU hardware, so I need to figure out chunk sizes to use to minimize overhead, but not get killed by watchdog. (And if it is killed, detect this, back up, and split the failed part into smaller chunks.)

It’d also be good to know things like e.g. how to make the GPU card into a pure compute device, letting the on-board graphics handle the display. But again, I don’t see much in the way of documentation.

Thanks,
James

Topic		Replies	Views
Need to remove timeouts and the "launch timed out and was terminated" message CUDA Programming and Performance	20	11600	May 24, 2010
CUDA timeout CUDA Programming and Performance	5	4314	May 23, 2008
Timeout under Linux, is it possible to remove it? CUDA Programming and Performance cuda	6	1206	January 20, 2023
Launch timed out CUDA Programming and Performance	4	6256	February 19, 2010
Configuring timeout CUDA Programming and Performance	3	3986	October 12, 2007
Cuda timeout and crash CUDA Programming and Performance	1	964	July 17, 2009
CUDA Timeout? CUDA Programming and Performance	7	27875	December 19, 2011
The Cuda 5 Second execution-time limit Finding a the way to work around the GDI timeout CUDA Programming and Performance	24	12976	July 26, 2010
User Request kernel timout CUDA Programming and Performance	4	1053	January 5, 2015
Windows Watchdog Timeout on gpu with no display (reg hack, programm still freezes) Cuda Kernel Timeo CUDA Programming and Performance	0	7125	October 14, 2011

Where to find info on timeouts, etc?

Related topics