Memory leak in nvcuda.dll

uk2017 · March 30, 2017, 11:21am

In our cuda application I experience a memory leak in nvcuda.dll that I have spotted with the tool ‘Memory Validator’ from Software Verify Ltd. This is the stack trace I got from Memory Validator:

id:1,595,284 <<33,286 objects>> void * : 8,521,216 bytes, largest allocation 256 bytes at 0x000000002e298cd0 : [NoFileName Line 0]
Allocation location 1 of 33,286 allocations, Largest: 256 bytes, Total: 8,521,216 bytes
Heap ID: 0x0000000000980000
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]
nvcuda.dll Ordinal39 : [NoFileName Line 0]

Unfortunately I am not able to reproduce the problem in a simple program. We have a complex application, that under certain circumstances enters a ‘leak mode’ in which each call of cudaDeviceSynchronize() loses a memory block of 256 bytes.

Once this ‘leak mode’ has been entered, I can deactivate most parts of the running application, so that a loop with only two cuda commands is left:

loop
{
cudaMemset(…);
cudaDeviceSynchronize();
}

The ‘leak mode’ persists in this case.

The circumstances under which this ‘leak mode’ is entered are very unclear, not always reproducible and kind of random. But once it is entered, it persists (memory consumption is growing indefinitely, then).

Windows 7, 64bit
nVidia Driver Version 376.33
Cuda Runtime Version 6.5
GeForce GTX 960
Visual Studio 2010

Robert_Crovella · March 30, 2017, 2:21pm

You might want to try newer versions of CUDA (e.g. 8.0, instead of 6.5). Bugs get fixed all the time.

uk2017 · March 30, 2017, 3:26pm

I just upgraded to CUDA 8.0 but it did not fix the problem.

Robert_Crovella · March 30, 2017, 3:38pm

I wouldn’t be optimistic about getting help unless you can provide a code that reproduces the issue along with the steps needed to reproduce.

uk2017 · March 31, 2017, 8:50am

I switched back to an older nVidia driver, Version 353.06.
With this driver the problem does not occur.
But this is only a temporary solution because graphics cards supported by this driver will once not be available any more.

I will try to create a small program to reproduce the problem.

uk2017 · April 5, 2017, 11:10am

I was not able to create a small program that reproduces the problem. All I can say is:

The leak occurs in driver versions >= 369.04
and it does not occur in driver versions <= 368.81.

To trigger the problem you have to access a CUDA device from different threads at the same time.
When we remove calls to cudaDeviceSynchronize() from all threads but one, it occurs less frequently, but does not disappear completely.

Topic		Replies	Views
Cuda-memory leak since Video Codec SKD 9.1 Windows drivers Video Processing & Optical Flow	7	1124	December 1, 2019
Memory leak problem in nVidia driver CUDA Programming and Performance	3	2265	April 4, 2017
Memory leak using managed memory CUDA Programming and Performance	2	1394	March 5, 2018
`cuCtxCreate` and `cuCtxDestroy` pairs have a memory leak CUDA Programming and Performance cuda , problem	9	1210	January 11, 2024
Huge memory leak CUDA Programming and Performance	16	5619	July 27, 2016
Large memory allocation with CudaHostAlloc fails with CUDA 8.0 release build CUDA Programming and Performance	23	4397	January 29, 2018
Memory leaks in libcudart 4.2.9 or misuse? CUDA Programming and Performance	2	2130	June 7, 2012
bug in memory allocation? CUDA Programming and Performance	6	4157	May 24, 2012
Memory leak in cuFFT (cuda 5.0)? GPU-Accelerated Libraries	8	3584	January 27, 2013
Memory Leak when Using nvJitLinkAddData/nvJitLinkAddFile in CUDA JIT Compilation CUDA Programming and Performance	10	98	December 12, 2024

Memory leak in nvcuda.dll

Related topics