Driver support level for VRAM offload to system RAM

drwootton1 · July 4, 2023, 7:46pm

I read newer drivers implement offloading VRAM to CPU ram or treating CPU ram as an extension of VRAM for applications like Stable Diffusion or running large language models so that larger models can be loaded.
What driver level for Linux supports this. Do I need to set anything for this to work. My system has a RTX 3060 and a RTX 4070.

Yovan-Fowdar · July 25, 2024, 8:57am

I am also looking on how to achieve this on Linux (CUDA – Sysmem Fallback Policy)
By Default offloading to RAM does not seem to be active.
Torch reports the following error when trying to use more than the GPU memory

torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 13.04 GiB. GPU 0 has a total capacity of 11.76 GiB of which 11.64 GiB is free.

My configuration as follows :
Ubuntu 24.04
NVIDIA GeForce RTX 3060 12GB
Driver Version: 550.90.07
CUDA Version: 12.4

zie · May 18, 2025, 7:43am

How is this still not got any comment on it? this is a critical feature and cripples all of the nvidia cards on linux

zebcom · May 18, 2025, 7:58am

Because there is another thread about this: Non-existent shared VRAM on NVIDIA Linux drivers - #73 by lucasggamerm

Topic		Replies	Views
Memory problem CUDA Programming and Performance	4	6905	July 26, 2010
linux: when are vmalloc & uppermem required? CUDA Programming and Performance	1	10908	January 7, 2008
VRAM RamDisk CUDA Programming and Performance	7	14142	March 5, 2013
Why is my GTX 460 idling at 303 MB? (~1/3 total RAM) CUDA Programming and Performance	9	1406	July 13, 2013
VRAM 0 after upgrading driver CUDA Programming and Performance	0	666	January 28, 2011
CUDA fails to allocate memory CUDA Programming and Performance	0	1242	November 1, 2012
Drivers on Vista for Tesla 1060 and Quadro 5800FX CUDA Programming and Performance	8	13223	March 7, 2009
Free cram on lion CUDA Programming and Performance	0	686	July 22, 2011
Mode Switching and CUDA memory CUDA Programming and Performance	1	3981	January 12, 2009
GTX TITAN X and other >4GB VRAM cards CUDA Programming and Performance	10	2655	June 16, 2015

Driver support level for VRAM offload to system RAM

Related topics