Why does GPU code require more RAM than same code on CPU?

garcfd · February 6, 2024, 9:55pm

I have a code which runs fast on GPU but requires much more RAM than it does when it runs on CPU. For example the code runs on 8GB CPU RAM, but requires 36GB GPU RAM. Since I’m now trying to run on an A10G GPU (which has 4xGPUS with 24GB RAM on each), hence my program wont run on this one because 24GB is less than the 36GB RAM I was using on T4.
So the next question is, can I modify my code to use less GPU RAM?

njuffa · February 7, 2024, 2:26am

There is no magic expansion of memory requirements when porting host-based code to the GPU. What you are observing would appear to be specific to this particular application of yours.

In the absence of elaborating information, it appears that you chose to use more RAM for the GPU version of your code, presumable as a trade off between memory usage and performance. Since we have not been told anything about this code, offering advice on how to shrink the memory footprint is not really possible.

If this were my code, I would revisit the original design decision that caused RAM usage to bloat by a factor of more than 4 when porting the code to the GPU. This could involve an examination of the data structures and data types involved for example.

Here are some general references that I acquired while working on embedded and mobile products in the early 2000s. I do not have any particular recollection of the contents of any of them, and it is entirely possible that some or even much of the advice they provide is now outdated and/or no longer applicable.

David Loshin, Efficient Memory Programming, McGraw-Hill 1999
Rene Alexander & Graham Bensley, C++ Footprint and Performance Optimization, SAMS Publishing 2000
James Noble and Charles Weir, Small Memory Software: Patterns for systems with limited memory, Pearson Education 2001
Kris Kaspersky, Code Optimization: Effective Memory Usage, A-List LLC 2003
Frantisek Franek, Memory as a Programming Concept in C and C++, Cambridge University Press 2004

atanu2531 · January 13, 2025, 8:50am

Question is properly still remain unanswered …
Why is same program requires less RAM in CPU compared to GPU ?

Curefab · January 13, 2025, 12:57pm

It depends on the port of the CPU program to GPU. There are reasons, why GPU programs typically use more memory:

They are often optimized for a certain GPU or at least for a certain minimum amount of memory and can often assume that no other program is using the GPU at the same time. That typically is not true for the CPU
GPUs use pipelining for copying memory and different stages of algorithms. Whereas CPUs more often modify data in place, GPUs often read one block of memory and write into a different one. That often makes it easier to avoid performance costly synchronizations
GPUs often process more data at the same time to better use the parallelization. E.g. if you have a video editing software, on the GPU it could process 8 frames at the same time and on the CPU 1 frame (just an example).
Nvidia GPUs profit a lot from coalesced memory accesses, where blocks of 32 or 128 bytes are read by a warp. Sometimes it is possible to improve the memory coalescing for the cost of overall higher memory consumption

Topic		Replies	Views
newbie: Host to GPU overhead CUDA Programming and Performance	4	4599	April 23, 2009
Why GPU has large memory bandwidth than CPU? CUDA Programming and Performance	3	10640	June 21, 2009
What hardware to get? CUDA Programming and Performance	6	5342	August 10, 2008
CUDA: OUT OF MEMORY CUDA Programming and Performance	4	1728	August 25, 2010
I hope to know that, why GPU faster than CPU in Ge CUDA Programming and Performance	5	4277	December 28, 2007
Thinking about porting some meshing code I wrote, but memory concerns me... CUDA Programming and Performance	2	527	April 14, 2014
Is CUDA worth it when my algorithm can not use coalesced memory? CUDA Programming and Performance	5	1207	June 10, 2014
number of registers of a GPU processor 5 general purpose registers in a x86 CPU CUDA Programming and Performance	11	16140	January 7, 2009
What info can I extract about use of device memory? CUDA Programming and Performance	1	777	March 16, 2009
Why is ~300 MiB of GPU RAM used by "nothing"? CUDA Programming and Performance	8	1722	February 22, 2018

Why does GPU code require more RAM than same code on CPU?

Related topics