cuda-gdb cannot break in device code

HPrancer · April 6, 2011, 9:06pm

I am having some trouble getting cuda-gdb to work properly.

I have a project consisting of several files which I compile using the -g -G flag combination in a single step. When I run this program in cuda-gdb, I am unable to set break points on my device functions. They are invisible to the tab-completion feature and the program will not break if I set the break points to the relevant line numbers of the .cu file. I am able to set a break point on the global kernel function, but the debugger does not actually break there.

I have used cuda-gdb on other projects without this problem, but they were less complicated. So, I wonder if I have inadvertently built some problem into this current project that is causing this behaviour. What should I look for?

A particularly unsettling phenomenon is that when I compile the code with -g -G, the program runs in a fraction of the time and then gives the wrong answer! So, it appears the debugging flags are causing the program to do an entirely different calculation. What might be the cause of this?

I have seen this thread, but none of the advice in it has helped me:

Thanks in advance!

benetion · April 6, 2011, 11:31pm

I don’t know if this would hlep or not:

I have seen this once. When I use texture fetch, I cannot get into the device code by cuda-gdb. It maybe that the feature is not supported (correct me if I am wrong).

The other symptom seems indicating a bug.

I am having some trouble getting cuda-gdb to work properly.

I have a project consisting of several files which I compile using the -g -G flag combination in a single step. When I run this program in cuda-gdb, I am unable to set break points on my device functions. They are invisible to the tab-completion feature and the program will not break if I set the break points to the relevant line numbers of the .cu file. I am able to set a break point on the global kernel function, but the debugger does not actually break there.

I have used cuda-gdb on other projects without this problem, but they were less complicated. So, I wonder if I have inadvertently built some problem into this current project that is causing this behaviour. What should I look for?

A particularly unsettling phenomenon is that when I compile the code with -g -G, the program runs in a fraction of the time and then gives the wrong answer! So, it appears the debugging flags are causing the program to do an entirely different calculation. What might be the cause of this?

I have seen this thread, but none of the advice in it has helped me:

The Official NVIDIA Forums | NVIDIA

Thanks in advance!

fcs · April 12, 2011, 9:24am

Hi huys,

I 've also a strange bug with cuda-gdb on my gpu cluster based on Tesla S1070 nodes.

We have cuda 3.2 installed (i think we have this problem since cuda 3.0) with Linux 64 bit driver 260.19.21

The OS is an "Red Hat Enterprise Linux Server release 5.3 " slightly modified by the cluster vendor.

We can’t break in cuda kernels.

After reading this post i tried a simple reproducer with all advices there were here but it still failed:

I compile my code in one step

nvcc -G -g matmul.cu -o matmul_debug

At execution, i try to set the focus without success and cuda-gdb finally crash and dump a core when trying to step in kernel

Program exited normally.

(cuda-gdb) run

Starting program: ./matmul_debug 

[Thread debugging using libthread_db enabled]

[New process 23054]

Matrice rÃ©elle NxN: 1.05 Mo

[New Thread 47728650801024 (LWP 23054)]

[Switching to Thread 47728650801024 (LWP 23054)]

Breakpoint 1, kernel_mulmat (__cuda_0=0x100000, __cuda_1=0x200000, __cuda_2=0x300000, __cuda_3=512) at matmul.cu:6

6       __global__ void kernel_mulmat(real *A, real *B,real *C, int n){

(cuda-gdb) info cuda device

Focus not set on any running CUDA kernel.

(cuda-gdb) cuda device 0

No CUDA kernel is currently running.

(cuda-gdb) cuda device 1

No CUDA kernel is currently running.

(cuda-gdb) info cuda kernels

No active kernel on CUDA devices.

(cuda-gdb) step

Breakpoint 1, kernel_mulmat (__cuda_0=0x100000, __cuda_1=0x200000, __cuda_2=0x300000, __cuda_3=512) at matmul.cu:6

6       __global__ void kernel_mulmat(real *A, real *B,real *C, int n){

(cuda-gdb) info cuda threads

Focus not set on any running CUDA kernel.

(cuda-gdb) step

Breakpoint 1, kernel_mulmat (__cuda_0=0x100000, __cuda_1=0x200000, __cuda_2=0x300000, __cuda_3=512) at matmul.cu:6

6       __global__ void kernel_mulmat(real *A, real *B,real *C, int n){

(cuda-gdb) step

__device_stub__Z13kernel_mulmatPfS_S_i (__par0=0x100000, __par1=0x200000, __par2=0x300000, __par3=512) at /tmp/tmpxft_00005985_00000000-1_matmul.cudafe1.stub.c:6

6       /tmp/tmpxft_00005985_00000000-1_matmul.cudafe1.stub.c: No such file or directory.

        in /tmp/tmpxft_00005985_00000000-1_matmul.cudafe1.stub.c

(cuda-gdb) step

7       in /tmp/tmpxft_00005985_00000000-1_matmul.cudafe1.stub.c

(cuda-gdb) step

cudaLaunch<char> (

    entry=0x401062 "UH\211ï¿½\203ï¿½H\211}ï¿½\211uï¿½\211Uï¿½211Mï¿½213Mï¿½\213Uï¿½\213uï¿½\213}ï¿½\031ï¿½ï¿½ï¿½ï¿½\220UH\211ï¿½\203ï¿½020ï¿½017\021Eï¿½\213Eï¿½\211Eï¿½\017\020Eï¿½\tï¿½ï¿½f\017(ï¿½\017\020\005ï¿½s") at /applications/cuda-3.2/bin/../include/cuda_runtime.h:935

935       return cudaLaunch((const char*)entry);

(cuda-gdb) step

BACKTRACE (9 frames):

cuda-gdb[0x459b2e]

/lib64/libc.so.6[0x3d36e30280]

/usr/lib64/libcuda.so[0x2b4092e3c06f]

/usr/lib64/libcuda.so[0x2b4092e379aa]

/usr/lib64/libcuda.so[0x2b4092e389dd]

/usr/lib64/libcuda.so[0x2b4092e39db5]

/usr/lib64/libcuda.so[0x2b4092fb90c9]

/lib64/libpthread.so.0[0x3d37a06367]

/lib64/libc.so.6(clone+0x6d)[0x3d36ed2f7d]

If i put the breakpoint in a line inside the kernel, it also crashes.

The strange thing is the same code, with the same cuda version, the same driver works on a machine with a CentOS 5.2 and a Quadro FX5800

We are waiting for Cuda 4 final release to see if it has changed…

Topic		Replies	Views
Cuda-GDB doesn't hit breakpoints inside kernel/ if the kernel is in a library and then linked to the executable CUDA-GDB vscode , cuda-gdb	9	3288	April 13, 2023
Cuda-gdb doesn't break and/or step into Kernels CUDA Programming and Performance	26	54175	August 1, 2011
cuda-gdb segfaults on setting break, linux64, 3.0b CUDA Programming and Performance	6	9268	March 2, 2010
Break points ignored and does not step into cuda Kernels. CUDA-GDB	2	1359	August 7, 2017
Using cuda-gdb CUDA-GDB	6	1207	August 18, 2022
Cuda-gdb problem Cuda-gdb don't show symbols in device funcs CUDA Programming and Performance	5	7816	October 14, 2009
Why is setting break point using cuda-gdb very slow? CUDA-GDB cuda-gdb	3	2106	November 18, 2023
cuda-gdb cannot set break point in device function defined in a header(.h) file CUDA-GDB	1	1358	August 28, 2019
cuda-gdb problem with function call in kernel? CUDA Programming and Performance	0	2106	December 6, 2011
Cuda-gdb: Breakpoint at a particular block CUDA Programming and Performance	1	3614	July 13, 2011

cuda-gdb cannot break in device code

Related topics