Streaming CUDA core dump into the pipe

gritukan · September 23, 2020, 5:19pm

Hello,

I’m building a service that runs on a server with many GPU jobs and collects core dumps that are produced by them. Since the HDDs of this server are heavily loaded by the aforementioned jobs, I’d rather not materialize these coredumps on disks. Instead I’d prefer to write coredumps into the pipe and stream them to a remote storage.

Unfortunately, it seems that CUDA library is unable to write coredumps into the pipe. When I set CUDA_COREDUMP_FILE environment variable to the path of my pipe, only first 64 bytes of the coredump are sent.

After a small research with strace I found out that CUDA library calls ftell function of a file descriptor of coredump file. This function returns -1 for pipes and after that program terminates. I’ve implemented a custom version of ftell that counts the number of bytes written into the pipe using LD_PRELOAD mechanism and this allowed me to obtain an almost valid coredump (the only difference is that first 64 bytes of the coredump are located at the end of the file, with one possible explanation being that CUDA library does fseek till the beginning of a coredump file when writing ELF header).

However, this custom approach seems totally unreliable. Is it possible to fix it in CUDA library?

Best regards,
Grigory Reznikov.

AKravets · March 12, 2021, 9:47am

Hi! Thank you very much for the report and the detailed investigation. We are actively working on addressing this issue, so it will be fixed in one of the upcoming releases.

I will provide another update in this post when the fixed CUDA GDB version is released.

AKravets · July 5, 2021, 12:30pm

Hi!

Coredump streaming to pipe has been fixed in CUDA toolkit 11.4 CUDA GDB release (current available at https://developer.nvidia.com/cuda-toolkit).

gritukan · July 5, 2021, 12:44pm

Great news, thank you very much!

Topic		Replies	Views
Manually take Memory Dump? CUDA-GDB	7	4555	November 17, 2021
CUDA coredump file corrupted CUDA-GDB cuda	4	46	December 22, 2025
CUDA coredumps not being generated CUDA-GDB	15	313	October 2, 2025
how to use cuda-gdb core dump CUDA Programming and Performance	6	4579	December 28, 2023
Can we specify a CUDA core dump location? CUDA Programming and Performance	4	1004	October 12, 2021
Cuda gdb cannot load coredump file, error: Assertion `m_instance.m_num_devices > 0' failed CUDA-GDB cuda-gdb	2	319	June 27, 2025
Is there any sample code for generating core dump? CUDA Programming and Performance	2	789	October 12, 2021
cuda-gdb cudacore CUDA Programming and Performance	1	638	November 22, 2017
Why my gdb shows ?() when i use CUDA COREDUMP? CUDA-GDB	3	63	December 9, 2025
CUDA coredump support in MPS environment CUDA Programming and Performance	0	486	July 27, 2017

Streaming CUDA core dump into the pipe

Related topics