cudaMalloc fails after

jdinkla · May 31, 2011, 10:34am

The first call to cudaMalloc in the following code fails with cudaErrorMemoryAllocation because the amount of memory is too large. But the second call for only 1024 bytes fails, too.

cudaError_t err;

cudaThreadSynchronize();				// Initialize

// Determine free memory

size_t free, total;

err = cudaMemGetInfo(&free, &total);

if (err != cudaSuccess) {

	cout << "Error cudaMemGetInfo" << endl;

} else {

	cout << "Free=" << free << ", total=" << total << endl;

}

void* ptr;

// The first malloc fails, because the requested block is too large

err = cudaMalloc(&ptr, free);

if (err != cudaSuccess) {

	cout << "Error cudaMalloc large block" << endl;

} 

cudaFree(ptr);

// The second call fails, too

err = cudaMalloc(&ptr, 1024);

if (err != cudaSuccess) {

	cout << "Error cudaMalloc small block" << endl;

} 

cudaFree(ptr);

So after the first call the device is in an usable state. I did not expect this behaviour. Is this a bug or a feature? If it is a feature than it should be included in the documentation. I ran this on Windows 7 64 Bit, CUDA 3.2, WDDM Driver 270.81 and Visual Studio 2008.

Is there a way around this? I can’t reset the device with cudaThreadExit(), because in my application there are other buffers on the device and already allocated. And i have to use CUDA 3.2 because of my customer.

I read in another thread, that the author uses a conservative estimates of 80% of free memory. Is this the only way?

The output:

Free=1505865728, total=1576468480

Error cudaMalloc large block

Error cudaMalloc small block

Best regards,

Joern Dinkla

tera · May 31, 2011, 12:03pm

I think you need to call cudaGetLastError() to reset the error.

jdinkla · May 31, 2011, 12:29pm

Thanks for the quick response. But this does not change the behaviour of the second call to cudaMalloc.

Best regards,

Joern

tera · May 31, 2011, 1:40pm

Sorry this didn’t help. As for the limit itself, there is a note in the release notes regarding Windows with WDDM drivers:

To circumvent this, you would need to run on a Tesla in TCC mode, or on Linux.

jdinkla · May 31, 2011, 2:34pm

According to The Official NVIDIA Forums | NVIDIA, “PAGING_BUFFER_SEGMENT_SIZE is approximately 2GB” and the system memory is 18 GB. So the formula yields 2GB, because MIN ( ( System Memory Size in MB - 512 MB ) / 2, PAGING_BUFFER_SEGMENT_SIZE ) = MIN ( ( 18000 - 512 MB ) / 2, 2 GB) = 2 GB.

So this is not the reason for the error described, because the card only has 1.5 GB.

Idefix1981 · June 15, 2012, 11:04am

For those who come here via google (like me). You need to call

cudaDeviceReset();

jdinkla · June 18, 2012, 7:40am

This destroys all the buffers and streams allocated on the device. In my original post i wrote “in my application there are other buffers on the device and already allocated”.

Topic		Replies	Views
cudaMalloc failed with unknown error after only 491656bytes CUDA Programming and Performance	9	4386	July 2, 2009
CudaMalloc on Vista : strange behaviour Works on XP, Fails on Vista CUDA Programming and Performance	6	12258	July 1, 2009
cuMemAlloc limited to 1/4 total GPU memory? CUDA Programming and Performance	10	12776	April 1, 2010
Slow cudaMalloc (~1.5s) and slow mem access there, allocating nearly whole memory, with WDDM CUDA Programming and Performance	0	1091	June 18, 2014
cudaMemGetInfo returns wrong amount free memory CUDA Programming and Performance	3	5271	December 11, 2012
cudaMalloc error in big loop CUDA Programming and Performance	12	15608	May 21, 2008
Maximum memory allocation size CUDA Programming and Performance	7	16668	January 24, 2012
How much GPU memory can cudaMalloc get? CUDA Programming and Performance	17	15168	April 2, 2022
cudaFree is returning an unrecognised error code CUDA Programming and Performance	10	7946	March 13, 2009
Cuda Out of Memory with tons of memory left? CUDA Programming and Performance	5	38995	December 23, 2009

cudaMalloc fails after

Related topics