Long running kernels / GPU watchdog timeout

dave-lowe · May 9, 2019, 6:58pm

New to CUDA under Windows, I soon came across the run-time limits on kernels, & the reg edit workarounds to increase the timeout there.

Now new to CUDA under Linux, when porting that code to L4T on the nano, I quickly realised I might be hitting a similar restriction. I’m guessing it’s this, as reducing the kernel workload / iterations per kernel makes things run as expected. But rather than exit with an error as it does under windows, my code seems to hang at the point where the kernel workload has exceeded the imagined limit.

Two questions:

Is the hung code (L4T) v early exit (windows) the correct symptom re too long a kernel run time?
Googling seems to indicate no way under linux/L4T to increase the timeout - correct?

Thanks, Dave.

Honey_Patouceul · May 9, 2019, 7:21pm

This is a quite old post (this was for TK1) and I haven’t been dealing with such case since then, but there was a time where this might have some relevance for your case. However I can’t tell what would be the sysfs path if any for Nano.

Topic		Replies	Views
Kernel launch timeout Jetson Nano cuda , nvbugs , nano2gb	16	2663	January 19, 2022
Disabling Runtime Execution Limit Jetson Nano kernel , nano2gb	12	1418	October 15, 2021
Timeout under Linux, is it possible to remove it? CUDA Programming and Performance cuda	6	1148	January 20, 2023
CUDA Timeout? CUDA Programming and Performance	7	27838	December 19, 2011
CUDA Kernel error: "The launch timed out and was terminated" Jetson Nano cuda , nano2gb	5	3151	October 15, 2021
"time out" in cuda program mechanism of "time out" CUDA Programming and Performance	14	12896	December 9, 2008
How to disable TDR in Jetson Nano Jetson Nano cuda	2	914	March 9, 2022
User Request kernel timout CUDA Programming and Performance	4	1044	January 5, 2015
CUDA kernel timeout CUDA Programming and Performance	12	59139	December 22, 2022
5 seconds limitation is permanent ? CUDA Programming and Performance	9	14012	June 4, 2007

Long running kernels / GPU watchdog timeout

Related topics