When using MPS, the GPU memory is enough, but CUDA shows that out of memory

chenmzh · March 20, 2022, 2:05pm

I execute the inference in the Machine P100 with 16000M GPU memory. When using the MPS and the process is 8, everything is ok. But when the inference process is 9 with an occupation of 6000M GPU memory, CUDA shows error, out of memory.
Many Thanks !!!

nkwkelvin · April 7, 2023, 8:20pm

I am facing the same problem. On T4, using only 10% of memory, it starts to say out of memory when I have 4 MPS processes. Things work well if only 3 or fewer.

In the documentation, it says this is a known issue. Following the suggestions there, I call cudaSetDevice(0) at the first line of my main function, and also compile my executable with -fPIE and -fPIC. However, I am still getting the same problem.

Any suggestions on how I can fix this problem? Thank you.

MarkusHoHo · April 19, 2023, 1:38pm

Hi there @nkwkelvin,

since the original post is more than a year old, would you mind creating a new post in the CUDA forums for this? That is a better place to ask for advice on how to implement certain workarounds in CUDA.

Thanks!

Topic		Replies	Views
Why is GPU still used while cuda out-of-memory error occurs? CUDA Setup and Installation gpu	0	597	September 15, 2021
cudaMalloc: out of memory, although the GPU memory is enough CUDA Programming and Performance	2	1675	December 27, 2019
CUDA MPS Problem CUDA Programming and Performance cuda	7	1176	May 23, 2022
Weird behaviour when out of memory exception occurs CUDA Programming and Performance	0	849	May 9, 2012
Cuda error "Out of memory" on launch kernel CUDA Programming and Performance	0	2412	January 10, 2013
CUDA_ERROR_OUT_OF_MEMORY: out of memory when there is actually no such a large tensor to allocate cuDNN	1	12767	December 28, 2019
CUDA out of memory CUDA Programming and Performance cuda , deep-learning	1	990	July 8, 2021
out of memory CUDA Programming and Performance	11	16485	April 13, 2009
Out of memory error in cuSparse ILU0 GPU-Accelerated Libraries cusparse	3	981	February 10, 2022
What happens if two processes running with MPS need more memory than available on GPU CUDA Programming and Performance	2	446	October 12, 2021

When using MPS, the GPU memory is enough, but CUDA shows that out of memory

Related topics