Error message when stepping out of global function in cuda-gdb

gy_xiao · August 28, 2019, 5:42pm

When I try to step out of a global function in cuda-gdb, I have the following error message:

(cuda-gdb) s
0x00002aaaac219110 in cuVDPAUCtxCreate () from /lib64/libcuda.so.1
(cuda-gdb) s
Single stepping until exit from function cuVDPAUCtxCreate,
which has no line number information.
cuda-gdb/7.12/gdb/infrun.c:2794: internal-error: resume: Assertion `pc_in_thread_step_range (pc, tp)' failed.
A problem internal to GDB has been detected,
further debugging may prove unreliable.
Quit this debugging session? (y or n)

In my code, the first line of host code after the global function is cudaDeviceSynchronize(). When I backtrace the debugging process, this is what I get:

(cuda-gdb) bt
#0  0x00002aaaac219110 in cuVDPAUCtxCreate () from /lib64/libcuda.so.1
#1  0x00002aaaac219504 in cuVDPAUCtxCreate () from /lib64/libcuda.so.1
#2  0x00002aaaac11e65c in cudbgApiDetach () from /lib64/libcuda.so.1
#3  0x00002aaaac11e810 in cudbgApiDetach () from /lib64/libcuda.so.1
#4  0x00002aaaac052b5a in ?? () from /lib64/libcuda.so.1
#5  0x00002aaaac1a4a9d in cuCtxSynchronize () from /lib64/libcuda.so.1
#6  0x00000000005163ad in cudart::cudaApiDeviceSynchronize() ()
#7  0x000000000053b04d in cudaDeviceSynchronize ()

Does anyone know if this is a cuda-gdb bug or my own problem in the code? Thank you.

T.D.Qiu · August 29, 2019, 1:37pm

Can you show some code? Do you use any libraries?
Do you check for errors on every call to a CUDA function, like cudaMalloc()?
Also mention versions and platform.

gy_xiao · September 9, 2019, 6:21am

Thank you for your offer to help. Sorry for my late reply.

As my original code is too large to share here, I tested on a simple code snippet:

#include <stdio.h>

using namespace std;

__global__
void kernel_func() {
	printf("In kernel func\n");
	return;
}

int main() {
	kernel_func <<<1, 1>>> ();
	cudaDeviceSynchronize();
	return 0;
}

The same error happened to this snippet as well:

Thread 1 "simp4gdb" hit Breakpoint 1, kernel_func<<<(1,1,1),(1,1,1)>>> ()
    at simp4gdb.cpp:7
7               printf("In kernel func\n");
(cuda-gdb) n
8               return;
(cuda-gdb) n
0x00002aaaac0763d0 in cuMemGetAttribute_v2 () from /lib64/libcuda.so.1
(cuda-gdb) n
Single stepping until exit from function cuMemGetAttribute_v2,
which has no line number information.
cuda-gdb/7.12/gdb/infrun.c:2794: internal-error: resume: Assertion `pc_in_thread_step_range (pc, tp)' failed.
A problem internal to GDB has been detected,
further debugging may prove unreliable.
Quit this debugging session? (y or n)

Here is how I compiled this code snippet:

nvcc -x cu -g -G -Xcompiler -rdynamic simp4gdb.cpp -o simp4gdb

The CUDA installed on my server is CUDA 10.0.130. Device capability is 7.0. The OS is CentOS Linux release 7.6.1810, and the kernel version is 3.10.0-957.el7.x86_64.

I run CUDA on a remote server. Is there anything wrong with the cuda-gdb setting?

Topic		Replies	Views
Error message when stepping out of __global__ function in cuda-gdb CUDA Programming and Performance	0	403	August 28, 2019
Cuda-gdb client via the gdb/mi interface CUDA-GDB	2	138	March 6, 2025
Cuda-GDB bug when assertion fails CUDA-GDB	2	185	October 14, 2024
Cuda-gdb CUDBG_ERROR_COMMUNICATION_FAILURE when stepping into a function generated by llvm CUDA-GDB	10	1944	March 7, 2023
Assertion failure in cuda-gdb printing a variable gives 'cuda-gdb internal error' CUDA Programming and Performance	1	1741	September 29, 2009
Anomalies with __device__ functions. Or is cuda-gdb playing stupid? CUDA Programming and Performance	0	3500	November 9, 2011
cuda-gdb gets "The CUDA driver has hit an internal error." at first CUDA usage. CUDA-GDB	2	1785	February 5, 2018
cuda-gdb Error: Failed to suspend device (dev=0, error=10). CUDA Programming and Performance	4	2861	April 27, 2012
Segfault using cuda-gdb 12 with cusparseCreate() in a thread GPU-Accelerated Libraries cusparse	2	282	March 5, 2024
Debugging device code does not work CUDA Programming and Performance	7	2888	July 11, 2013

Error message when stepping out of __global__ function in cuda-gdb

Related topics

Error message when stepping out of global function in cuda-gdb