How do I increase the VRAM capacity programmatically?

Ahessian · November 20, 2020, 4:12am

I have a 3 GB VRAM and I can only allocate an array of 520^3 vector4, above that the application just ignore the malloc.

I’ve tried with cudaMallocManaged() and cudaMallocHost( but they are both limited to the (shared) VRAM size as well, plus it slows the performance by 70%

njuffa · November 20, 2020, 4:59am

Assuming this is a Windows 10 system using the default WDDM driver, the maximum allocation via cudaMalloc will be about 81% of the GPU memory. With 3GB of GPU memory, this is about 2,600,000,000 bytes.

For GPUs with larger GPU memory, this percentage can be a tad higher. For example, on my Quadro RTX 4000 with 8 GB of GPU memory, the maximum allocation size via cudaMalloc is 7,060,320,000 bytes, or 82.2% of total physical memory.

If the programmer requests more memory than can be allocated, cudaMalloc() informs the programmer of that fact via the returned status code. That’s different from silently ignoring the request and is the best it can do.

NVIDIA offers GPU with all kinds of memory sizes up to 80 GB, so you might want to use a different one. I read that cloud services offer large GPU instances at reasonable prices, or you could look into buying more capable hardware, possibly previously owned if your budget is tight.

You might also want to ponder whether your data could be stored more efficiently, e.g. by using half precision instead of single precision.

njuffa · November 20, 2020, 5:13pm

That’s exceedingly unlikely. In order to run the GPU memory much faster than ordinary system memory it

is placed in the immediate vicinity of the GPU on the card, to reduce trace length for optimal signal quality
is soldered directly onto the board in order to minimize capacitive load and allow high memory frequency
is integrated into the thermal solution to optimize heat dissipation

Robert_Crovella · November 20, 2020, 5:17pm

I cannot make forward looking statements, but I’m pretty familiar with the GPUs produced by NVIDIA since about 1998. We have never produced a discrete GPU with upgradeable memory. It’s a possible datapoint to consider, when speculating about the future.

Topic		Replies	Views
How much GPU memory can cudaMalloc get? CUDA Programming and Performance	17	15767	April 2, 2022
What is the limit of cudaMalloc? CUDA Programming and Performance	7	22209	December 19, 2011
cuMemAlloc limited to 1/4 total GPU memory? CUDA Programming and Performance	10	12985	April 1, 2010
Question about cudaMalloc Behavior When Exceeding Physical VRAM on GTX 1070 CUDA Programming and Performance	1	129	December 26, 2024
Need a little tool to adjust the Vram size CUDA Programming and Performance	9	14779	August 16, 2023
Is there a maximum number of graphics driver allocations of GPU memory in CUDA?(there is a limit in Vulkan, OpenGL) CUDA Programming and Performance	0	360	December 21, 2023
How to Lower Available VRAM To avoid buying more cards CUDA Programming and Performance	1	1337	August 21, 2023
Memory on DRAM CUDA Programming and Performance	6	2605	April 28, 2012
CUDA fails to allocate large chunk of memory CUDA Programming and Performance cuda	1	1178	March 23, 2022
Memory problem CUDA Programming and Performance	4	6952	July 26, 2010

How do I increase the VRAM capacity programmatically?

Related topics