Timeout under Linux, is it possible to remove it?

jmricher70 · January 20, 2023, 2:37am

Hello, I had a question related to the “timeout” and the fact that a kernel will stop after a few seconds.

I am using Ubuntu 20.04 and a NVIDIA GeForce RTX 3050 with driver 525.60.11.
When I run a quite long CUDA kernel (more than 100s) I don’t get any timeout while on an old computer with a GTX 770 a have a timeout and the kernel stops after 5s.

My question is why I don’t get the timeout under the recent computer ? Is it related to

Ubuntu / Gnome ?
the GPU ?
or the GPU driver ?

and is it possible to remove the timeout or to set it for example using nvidia-smi ? I am working with students and they have laptops with Linux/Debian and a Quadro M1200 and they have the timeout active.

Best regards,
JMR

AKravets · January 20, 2023, 8:17am

Hi @jmricher70
Are you running your application under the compute debugger (cuda-gdb). Please note, that this forum thread is dedicated to the compute debugger support. If you are running without the debugger, you might consider different forum section, e.g.: CUDA Programming and Performance - NVIDIA Developer Forums

jmricher70 · January 20, 2023, 10:35am

No I am not running the program with cuda-gdb. I didn’t know this forum was for the debnugger.
Thanks for you answer.
JM

AKravets · January 20, 2023, 10:37am

Thank you for the clarification!

I have moved the topic to general CUDA section.

jmricher70 · January 20, 2023, 10:38am

Thank you very much.
Best regards.
JM

njuffa · January 20, 2023, 11:36am

This is related to a GUI using the GPU. A long-running compute kernel typically blocks graphical tasks like updating the GUI. A frozen GUI makes for a bad user experience. So operating system running a GUI limit the runtime of compute kernels to about two seconds. This limit is guarded by a watchdog timer that triggers destruction of the compute context when it expires, which triggers a CUDA error detectable with proper CUDA error checking.

There are operating-system dependent ways of turning the GUI watchdog timer on or of and/or set the time limit. This is controlled outside of CUDA, so look in the documentation of your operating system(s) of how to use these controls. Alternatively, do not extend the GUI to the GPU in question.

Robert_Crovella · January 20, 2023, 2:36pm

https://nvidia.custhelp.com/app/answers/detail/a_id/3029/~/using-cuda-and-x

Topic		Replies	Views
CUDA Timeout? CUDA Programming and Performance	7	27707	December 19, 2011
"time out" in cuda program mechanism of "time out" CUDA Programming and Performance	14	12749	December 9, 2008
User Request kernel timout CUDA Programming and Performance	4	974	January 5, 2015
Has 2-second timeout problem been fixed yet? CUDA Programming and Performance	7	1939	November 9, 2012
Disabling the "run time limit" How do I disable the so-called, "KERNEL_EXEC_TIMEOUT" CUDA Programming and Performance	1	1673	December 19, 2011
Configuring timeout CUDA Programming and Performance	3	3888	October 12, 2007
Need to remove timeouts and the "launch timed out and was terminated" message CUDA Programming and Performance	20	11391	May 24, 2010
Cuda timeout and crash CUDA Programming and Performance	1	913	July 17, 2009
CUDA Kernel Crash CUDA Programming and Performance	13	4689	January 8, 2018
CUDA kernel timeout CUDA Programming and Performance	12	58866	December 22, 2022

Timeout under Linux, is it possible to remove it?

Related topics