CUDA coredump support in MPS environment

suhach · July 27, 2017, 1:45pm

The GPU core dump could be generated by setting the environment variable “CUDA_ENABLE_COREDUMP_ON_EXCEPTION” to “1”. This works fine when kernel is launched on the device by a single client process without MPS.
But when MPS is used and the work launched by any client has caused an exception, the generated core dump file is not complete. It looks like the MPS server has exited before the GPU core dump could be written fully. Is there any way to get the complete core dump when MPS is used.

Cuda toolkit version : 8.0 Driver Version : 375.26 GPU architecture : Tesla P100 (Pascal)

Topic		Replies	Views
CUDA coredumps not being generated CUDA-GDB	15	331	October 2, 2025
how to use cuda-gdb core dump CUDA Programming and Performance	6	4607	December 28, 2023
Manually take Memory Dump? CUDA-GDB	7	4576	November 17, 2021
Full memory dump of a running neural network training process CUDA-GDB	1	1349	June 19, 2018
Is there any sample code for generating core dump? CUDA Programming and Performance	2	791	October 12, 2021
Cuda core dump does not work properly when many device assert happens CUDA Programming and Performance cuda-gdb	2	219	December 4, 2025
CUDA coredump file corrupted CUDA-GDB cuda	4	58	December 22, 2025
Why my gdb shows ?() when i use CUDA COREDUMP? CUDA-GDB	3	67	December 9, 2025
Cannot generate cuda coredump when cuda kernel access illegal address CUDA-GDB	8	1754	December 9, 2021
Can we specify a CUDA core dump location? CUDA Programming and Performance	4	1005	October 12, 2021

CUDA coredump support in MPS environment

Related topics