cudaFree does not free memory on Kepler

_device · June 14, 2012, 9:01pm

Hi All,

We have the following issue with CUDA 4.2 and 680GTX – if a large piece of memory is allocated by cudaMalloc(), not all memory is deallocated by a subsequent cudaFree() call. The “large piece” means something of the order of 1GB. Everything is fine if 1MB or so is allocated.

It doesn’t look like a big deal except that presumably it leads to memory fragmentation which is lethal for our code. Here is a simple snippet reproducing the problem. Any ideas what’s going on? Thanks!

#include <stdio.h>

#include <cuda.h>

__global__ void null()

{

}

int main(int argc, char** argv)

{

    cudaSetDevice(0);

    null <<< 1, 1 >>> ();

    CUresult CUStat;

    size_t free, total;

    size_t free1, total1;

    CUStat=cuMemGetInfo(&free, &total);

    printf("err code = %d\n", CUStat);

    int* d_data;

    cudaMalloc((void**)&d_data, 1024*1024*1024);

    cudaFree(d_data);

    CUStat=cuMemGetInfo(&free1, &total1);

    printf("err code = %d\n", CUStat);

    printf("lost %d bytes\n", free-free1);

    return 0;

}

In my case the output is:

err code = 0

lost 0 bytes (1MB chunk)

err code = 0

lost 3145728 bytes (1GB chunk)

Keldor314 · June 20, 2012, 12:54pm

Sounds very much like cudaFree() is asynchronous. Try calling cudaDeviceSynchronize() after the free but before polling for free memory or allocating anything else.

tmurray · June 20, 2012, 5:33pm

it has to do with when the CUDA driver allocates PTEs. that’s done dynamically when needed and freed on context destroy. it doesn’t cause fragmentation or anything like that. we actually reduced the PTE overhead significantly in the upcoming 5.0 release (I think it goes down to 64K for your test).

Topic		Replies	Views
cudaFree not freeing up memory CUDA Programming and Performance	2	773	May 16, 2019
how to effectively free large memory allocation CUDA Programming and Performance	8	7899	November 5, 2015
cudaMalloc KILLED on tx2, and the memory can not be cudaFree real CUDA Programming and Performance	8	812	July 18, 2018
cudaFree not working properly CUDA Programming and Performance	1	1076	November 28, 2013
Memory allocation reliablity CUDA Programming and Performance	8	3303	August 18, 2008
cudaFree is returning an unrecognised error code CUDA Programming and Performance	10	8115	March 13, 2009
Could not clear or free all the cpu memory when using cudaMalloc founction? CUDA Programming and Performance	1	548	December 19, 2018
cudaFree leaves zero memory CUDA Programming and Performance	0	650	May 29, 2013
cudaFree isn't cleaning global memory CUDA Programming and Performance	12	3784	June 29, 2010
16GB cudaMalloc() on A10 (24GB) takes ~300-400ms after previous cudaFree CUDA Programming and Performance tensorrt , cuda , driver	7	606	February 7, 2024

cudaFree does not free memory on Kepler

Related topics