High virtual memory consumption on Linux for CUDA programs: is it possible to avoid it?

sabatin · November 22, 2018, 11:10am

Hi,

I see that when CUDA is used on Linux the process virtual memory consumption is large, approximately matching the size of the physical GPU memory plus the size of system memory.

Quoting this comment:

this is related to UVA. we have to carve out a chunk of virtual memory equal to the total physical GPU memory, plus the total system memory, plus some small fudge factor for alignment purposes.
we actually throttle back on the UVA region if you run out of virtual memory. this will restrict the amount of memory you can allocate, though.

While I see that this has no special performance consequences, is there some way to avoid this, for example by setting flags in cuCtxCreate() or by globally tweaking the driver settings?

Also, can someone link to documentation/specification which states how virtual memory is used by CUDA on Linux? So far I only found references to this behavior on random places over this forum and SO.

Many thanks in advance.

Robert_Crovella · November 22, 2018, 4:03pm

One way to reduce the size of the allocation is to reduce the GPU “footprint” in a multi-GPU system, if all GPUs are not needed.

For example, if you have a CUDA code that only uses 1 GPU, but you run it on a system that has 4 GPUs, the 4 GPUs will affect the size of the virtual space reservation requested. You could reduce this, in this particular scenario, by setting the CUDA_VISIBLE_DEVICES environment variable to restrict the CUDA runtime to only a single GPU.

AFAIK there are no direct controls over virtual memory allocation performed by CUDA, and it is not formally documented anywhere.

sabatin · November 26, 2018, 11:27am

Thanks Robert. The CUDA_VISIABLE_DEVICES trick could be useful although of limited use.

Do you know if it would be possible in principle to reduce the virtual memory usage or if this behavior is deeply ingrained into the Linux implementation?

Although it shouldn’t have any performance impact that’s still a bit annoying, because it misleads users into thinking that the CUDA process is taking a lot of memory.

njuffa · November 26, 2018, 5:41pm

In order to provide a unified address space, all physical memory (host system and GPUs) must be mapped into a single virtual space. As a result, CUDA’s virtual memory usage will look huge. I am not aware that this has negative consequences of any kind. What specific problems are you encountering?

I don’t see what is misleading here. The virtual memory usage is stated correctly by the operating system. Users may be thinking incorrectly about what that number means. If you care, you could educate them on virtual memory. Or you could pay no attention to what users think as long as things are working.

sabatin · November 27, 2018, 12:09pm

Agreed, as long as there are no performance penalties (and until now I have no evidence of that) I guess there is no issue with high memory usage. Thanks.

Topic		Replies	Views
Issues using CUDA on a RealTime RedHat System CUDA Programming and Performance	5	1121	December 2, 2014
Consumption of host memory increases abnormally CUDA Programming and Performance	5	5662	June 2, 2011
Virtual memory used by a CUDA program CUDA Programming and Performance	0	1382	January 23, 2013
When is 'virtual memory' available in CUDA ? CUDA Programming and Performance	7	7379	October 13, 2009
High virtual memory comsumption CUDA Programming and Performance	1	662	September 20, 2011
High virtual memory usage when calling cuInit CUDA Programming and Performance	1	634	September 6, 2019
Multi GPU, Windows 10 pagefile and global memory issues CUDA Programming and Performance	3	2332	July 31, 2018
Large virtual memory reserved and scheduling issues Legacy PGI Compilers	4	3204	February 7, 2018
Using CUDA virtual memory API for host allocation CUDA Programming and Performance	9	357	October 29, 2025
Introducing Low-Level GPU Virtual Memory Management Technical Blog	59	8875	June 4, 2024

High virtual memory consumption on Linux for CUDA programs: is it possible to avoid it?

Related topics