why GPU memory size is small?

mianlu · July 11, 2010, 2:45pm

Hi, I just wonder why the available GPU memory is much smaller than the main memory?

Since GPUs are proposed to be used in scientific computing these years, several GBs are clearly not enough for many scientific applications. Most desktop GPUs only have around 1GB memory, even for computing-specific products, such as S2050, each processor only has 3GB. I think this limits the use of GPUs in real-world applications.

Anybody knows the reasons? This is because the technique bottleneck, or just NVIDIA does not want to make large GPU memory (maybe several GBs is enough for image processing)?

Thanks,
Mian

Qazax · July 11, 2010, 3:17pm

it will be a combination of price and what most people do with the cards, you can get tesla cards with 4gig of memory in them, if that isnt enough then buy a second, then you have 8gig of RAM at your disposal.

several GB’s clearly IS enough at the moment per card and CUDA continues to be used in many scientific environments successfully.

there are servers available with multiple cards in them with 16gig of memory if you really need it but they cost a very large amount of money.

tera · July 11, 2010, 3:20pm

It’s because of the ultra-high bandwidth the memory has to provide. You cannot just add more memory chips because the increased electrical capacity (and decreased resistance) would spoil the timing on the memory bus.

mianlu · July 11, 2010, 3:26pm

I can see there are still a large number of scientific applications requiring very large memory. yes, actually this is also my point, I may guess the price limites the use of GPUs with large memory (but i do not know why GPU memoyr is more expensive, and where is the technique bottleneck).

BTW, can you give me more information about the 16GB card, please? thanks a lot!

seibert · July 11, 2010, 3:31pm

GPUs also tend to use more expensive memory technologies than CPUs because the GPU demands a very high memory bandwidth. As an example, the GTX 480 has a 384 bit wide bus, yet achieves a peak transfer rate of 177.4 GB/sec. In comparison, an Intel Core i7 CPU with a 192 bit wide bus (in the triple channel configuration) achieves a peak transfer rate of something like 25-40 GB/sec, depending on the speed of DDR3 you install. Correcting for the differing bus width, that means the GPU memory technology (GDDR5 in this case) needs to move data across each bus line at more than twice the rate of DDR3 memory.

mianlu · July 11, 2010, 3:43pm

this makes sense to me :) the bandwidth is much higher than the main memory.

thank you very much.

YDD · July 12, 2010, 7:43pm

As a side note, most of the large supercomputers listed on TeraGrid weigh in at about 2 GiB of RAM per (CPU) core. Put in these terms, the GPU memory ceilings aren’t so low.

Greg_Ross · July 13, 2010, 8:33pm

Even if there is not enough memory for large data sets for an application, I would imagine that most of the time you can tile the input or output so that the current working set does fit in the 1-6GB that the card has. You might even be able to use double buffering, streams, and asynchronous I/O in order to overlap data transfer on one buffer with execution on the other buffer…

mianlu · July 14, 2010, 2:14am

sure, that is true, you always can use such methods :)

laughingrice · July 14, 2010, 9:00pm

You already have cards with 6GB and 9GB coming supposedly near the end of the year. The problems are probably, price, availability and difficulty of putting a lot of ram on the card.

more than 4GB was only possible technically with Fermi anyway as it requires 64bit addressing.

Topic		Replies	Views
too little memory for seriuos computations too little memory for seriuos computations on cuda suppor CUDA Programming and Performance	5	3235	June 11, 2011
Why GPU has large memory bandwidth than CPU? CUDA Programming and Performance	3	10541	June 21, 2009
Dazed and Confused.. CUDA Programming and Performance	6	1412	April 8, 2013
Why does GPU code require more RAM than same code on CPU? CUDA Programming and Performance	3	502	January 13, 2025
Why are GPU so memory bound? CUDA Programming and Performance	3	2441	January 22, 2023
Less GDDR2 X More GDDR3 Performance and Useability in CUDA. CUDA Programming and Performance	6	4643	August 17, 2008
Small memory of Nvidia Tesla workstation and GPU CUDA Programming and Performance	5	1064	April 18, 2012
Advice on which GPU to use for scientific computation in realtime environment Jetson TK1	2	995	November 11, 2014
About the different memories CUDA Programming and Performance	12	11649	December 6, 2007
Possible memory limits of GPU on 32 bit system ? CUDA Programming and Performance	4	6562	June 25, 2008

why GPU memory size is small?

Related topics