Exception while debugging. How to upgrade cuda-gdb?

nikitablack · September 14, 2021, 9:20am

Greetings,

I started to work on a new project that is based on NVidia Jetson Xavier Agx, I followed the instruction on how to set everything up. In the end, I have a running device with software installed from sdkmanager.

I am able to compile cuda programs and run them. But the problem is with debugging. Provided with cuda-10.2 debugger crashes when stepped over ((cuda-gdb) next) a cuda function with the message _dl_catch_exception(). I found a couple of relevant topics, for example this. Though I have the same problem even when debugging locally on the device itself. I tried to follow the instructions, tried to run the debugger with sudo but the problem stays. The problem is reproducible with cuda samples.

The linked topic is from 2019 and the NVidia employer mentions that this is a known bug with the gdb-7.12 and that should be fixed in gdb-8.2. But today is 2021 and the sdkmanager still installs the same broken version.

What is the fix? Should I upgrade the debugger manually? How can I do it?

AastaLLL · September 15, 2021, 2:52am

Hi,

Could you share the detailed steps to reproduce this with a CUDA sample?
And the complete output error log with us first?

Thanks.

nikitablack · September 15, 2021, 9:38am

Hi @AastaLLL,

Compile cuda-10.2/samples/0_Simple/vectorAdd with make dbg=1.
Strart cuda debugger cuda-gdb vectorAdd.
Break on the line with the cuda function break vectorAdd.cu:82 (this adds a breakpoint on the line err = cudaMalloc((void **)&d_A, size)).
Run the program with run
Step over the function with next.

After stepping over I have the following output:

Breakpoint 1, main () at vectorAdd.cu:82
82	    err = cudaMalloc((void **)&d_A, size);
(cuda-gdb) next
0x0000007fb7d4e684 in _dl_catch_exception ()
   from /lib/aarch64-linux-gnu/libc.so.6
(cuda-gdb) next
Single stepping until exit from function _dl_catch_exception,
which has no line number information.
0x0000007fb7fe2418 in _dl_find_dso_for_object ()
   from /lib/ld-linux-aarch64.so.1
(cuda-gdb) next
Single stepping until exit from function _dl_find_dso_for_object,
which has no line number information.
cuda-gdb/7.12/gdb/infrun.c:2795: internal-error: resume: Assertion `pc_in_thread_step_range (pc, tp)' failed.
A problem internal to GDB has been detected,
further debugging may prove unreliable.
Quit this debugging session? (y or n)

If I run the program without the debugger, it works fine. If I continue instead of next, it works fine.
I tried to do the same steps on my laptop where I installed the same 10.2 Cuda and everything works fine.

AastaLLL · September 28, 2021, 6:26am

Hi,

Thanks for sharing the detailed steps.
For JetPack 4.6, the cuda-gdb is still integrated with gdb v7.12.

We will upgrade CUDA to 11.0 in our next major JetPack.
You can find a release plan below:

Thanks.

Topic		Replies	Views
Cuda developmnent on Jetson Nano Jetson Nano cuda , debugger	6	1366	June 22, 2022
CUDA API calls throw exceptions in remote debugger Jetson TX2	14	2084	September 22, 2021
CUDA-GDB not work on debian Nsight Visual Studio Code Edition cuda , cuda-gdb	6	967	January 11, 2024
Kernel code execution causes segmentation fault in cuda-dbg but not when executed standalone CUDA-GDB cuda-gdb	4	835	July 3, 2024
Cuda-gdb run failed, but gdb run success CUDA-GDB cuda	7	1065	October 24, 2023
Can't use Cuda-gdb CUDA-GDB	7	5141	November 28, 2022
Debug error Jetson AGX Xavier cuda	9	4681	October 18, 2021
Cannot Debug Jetson AGX Xavier with JetPack SDK 5.0 Developer Preview Jetson AGX Xavier cuda	4	934	May 13, 2022
[Jetson Orin AGX \| CUDA 12.6] cuda-gdb causes SIGSEGV in libcudadebugger.so.1 when entering kernel CUDA-GDB cuda	3	93	November 28, 2025
Cuda-gdb doesn't break and/or step into Kernels CUDA Programming and Performance	26	54254	August 1, 2011

Exception while debugging. How to upgrade cuda-gdb?

Related topics