cudaMalloc: out of memory, although the GPU memory is enough

xaml · November 4, 2019, 3:58pm

I have faced a strange problem about cudaMalloc reporting “out of memory” error although the GPU memory is enough.
The computer I used has two GTX1070 GPU cards and Windows 10 system.
The problem is that, I have two programs solving the same problem.
One program uses two CPU threads, each of the threads launches a GPU card of the two, two GPU cards solve the problem in cooperation, this program works correctly.
Another program uses the two GPU cards to solve the same problem, the only difference is each of the two GPU cards are launched from a different MPI process.
The strange problem is the latter program failed, because the cudaMalloc reports “out of memory”, although the program just need about half of the GPU memory in total.
I have called the cudaSetDevice before the cudaMalloc to make sure the two MPI processes operate on different GPUs.
The CUDA is version 10.0, and the MPI package I used is Microsoft MPI (MS-MPI). SLI is disabled in driver control panel.

Any idea what the problem could be? Thanks very much!

tera · November 4, 2019, 8:54pm

Are you running out of of CPU memory by any chance?

dragonzhsd · December 27, 2019, 8:02pm

I got same problem with python api call dlib/GPU on face_recognition.

not sure if dlib problem?

CUDA test passed though.

cuda_data_ptr.cpp line 58 , out of memory.

Topic		Replies	Views
"out of memory" problem.. CUDA Programming and Performance	1	6452	May 9, 2007
Out of CUDA memory when running torch project. CUDA Programming and Performance	2	644	January 10, 2019
Sharing 1 GPU betwenn MPI tasks work fine with 4 mpi tasks but cudaMalloc "unknown error" wi CUDA Programming and Performance	4	5931	April 10, 2009
When using MPS, the GPU memory is enough, but CUDA shows that out of memory General Topics and Other SDKs cuda	2	893	April 19, 2023
PBO and Out of memory CUDA Programming and Performance	0	4433	June 6, 2011
cuda malloc use CPU memory Legacy PGI Compilers	3	4361	August 22, 2013
cudaError is 2 Help CUDA Programming and Performance	2	33	November 28, 2024
CUDA_ERROR_OUT_OF_MEMORY HELP!!! CUDA Programming and Performance	2	2869	February 13, 2018
cudaMallocPitch is failed while multi GPUs are controlled by separated CPU processes despite the fac... CUDA Programming and Performance	4	762	October 18, 2017
CUDA: OUT OF MEMORY CUDA Programming and Performance	4	1710	August 25, 2010

cudaMalloc: out of memory, although the GPU memory is enough

Related topics