GPU soft-reset / freeing all resources possible?

ehan6 · May 5, 2010, 5:34pm

Hi everybody,

my application repeatedly allocates and de-allocates large chunks of memory (up to 200MB, no easy way to avoid that, unfortunately). Part of the application also uses OpenGL. I run into the problem that cudaMallocs randomly (?) fail even though cuMemGetInfo reports enough free memory (e.g., a 200MB malloc fails even though 300MB are reported free). The failure rate depends on the delay between application launches (waiting between launches reduces failure rate) and on how often I repeate allocation/deallocation operations (later operations during an application launch fail more often).

I suspect GPU memory fragmentation is the culprit. Therefore my question: is there any programmatic way (CUDA command, OpenGL, other interface) to force a GPU soft-reset or to otherwise free all allocated GPU memory? The application will be the only program running, so I could afford to lose all allocated GPU resources (if the OS is not be affected). I am running Windows XP.

Any ideas? Let me know if you need further details.

Cheers, and thanks,
ehan6

Gregory_Diamos · May 6, 2010, 4:44am

cudaThreadExit() ?

tmurray · May 6, 2010, 5:47am

What driver are you using?

ehan6 · May 17, 2010, 8:28pm

Sorry for the late reply, somehow my email alerts for new posts didn’t quite work …

cudaThreadExit() does the job of freeing all resources that were allocated in CUDA. But, apparently, this does not free resources allocated with OpenGL routines before. I can clearly see that running GPU-memory intensive tasks before my CUDA algorithms makes the CUDA part fail randomly with ‘out of memory’ errors. When not running the OpenGL part, the CUDA algorithms run fine. But, as far as I see, the OpenGL code deallocates all its resources on exit. So that’s a bit puzzling. Also, when I run the OpenGL part, wait for a few minutes, and then run the CUDA part, it usually also works fine. A vague guess would be that the driver does some clean-up tasks in regular intervals, so some resources are not free’d on the spot.

I run on Microsoft Windows XP SP 2, driver version 197.59 .

Thanks,
ehan6

ehan6 · June 2, 2010, 6:11pm

bump …

SombraLee · October 27, 2010, 9:34am

ehan6, did you get to solve this? Im having the same problem…

SombraLee · October 27, 2010, 9:34am

ehan6, did you get to solve this? Im having the same problem…

ehan6 · October 27, 2010, 2:49pm

No, I haven’t solved this yet. Sorry.

ehan6 · October 27, 2010, 2:49pm

No, I haven’t solved this yet. Sorry.

mkaushik · October 27, 2010, 6:08pm

On Linux, rmmod + insmod of nvidia.ko will do the trick.
On Vista/Win7, you can reset the driver using devcon (Windows Device Console (Devcon.exe) - Windows drivers | Microsoft Docs)
On WinXP…i don’t know. Ideally when you quit a graphics app, the driver’s video memory heap should return to its previous state and there should be no fragmentation. Maybe you should open task manager and kill all clients that might be using the heap.

mkaushik · October 27, 2010, 6:08pm

On Linux, rmmod + insmod of nvidia.ko will do the trick.
On Vista/Win7, you can reset the driver using devcon (Windows Device Console (Devcon.exe) - Windows drivers | Microsoft Docs)
On WinXP…i don’t know. Ideally when you quit a graphics app, the driver’s video memory heap should return to its previous state and there should be no fragmentation. Maybe you should open task manager and kill all clients that might be using the heap.

ehan6 · October 27, 2010, 6:17pm

Thanks! That’s the first real answer I get to that question. However, I would like to do the ‘reset’ without killing the respective application. Maybe that’s asking a bit too much.

ehan6 · October 27, 2010, 6:17pm

Thanks! That’s the first real answer I get to that question. However, I would like to do the ‘reset’ without killing the respective application. Maybe that’s asking a bit too much.

Lev · December 9, 2010, 12:23am

With Fermi you may use custom memory manager.

minhtuan · February 21, 2011, 2:37pm

Hi Lev,

could you please address in more detail?

Thanks,

Tuan

Topic		Replies	Views
CudaMalloc on Vista : strange behaviour Works on XP, Fails on Vista CUDA Programming and Performance	6	12258	July 1, 2009
Cuda Out of Memory with tons of memory left? CUDA Programming and Performance	5	39025	December 23, 2009
Device Memory Mangement CUDA Programming and Performance	14	3460	December 5, 2008
using cudaMalloc and cudaFree within a loop unspecified launch failure! CUDA Programming and Performance	21	37702	April 23, 2009
how to effectively free large memory allocation CUDA Programming and Performance	8	7661	November 5, 2015
Runtime memory leaks? How do you clear them? CUDA Programming and Performance	3	5067	October 22, 2008
mem allocation without free / remain data on mem? CUDA Programming and Performance	2	5261	May 15, 2007
cudaMalloc KILLED on tx2, and the memory can not be cudaFree real Jetson TX2	10	1125	December 2, 2019
Keep previously allocated memory on GPU CUDA Programming and Performance	5	1564	July 2, 2010
Is there any way to diagnose GPU memory fragmentation & defrag nvidia GPU RAM? OpenGL	6	1478	January 3, 2024

GPU soft-reset / freeing all resources possible?

Related topics