PCI-E x16 and CUDA CUDA and concurrent 8GB/s bandwidth?

e.ping · December 14, 2007, 2:13am

PCI Express x 16 is said to have 4 GB/s of peak bandwidth per direction, and up to 8 GB/s concurrent bandwidth. I am curious to know if CUDA provides a mechanism to use this concurrent bandwidth of 8 GB/s for cards that support PCIe x 16.

VanDammage · December 14, 2007, 9:00am

PCI Express x16 v1.1. has a theoretical bandwidth of max. 4GB/s in each direction, but the bandwidth you’ll get depends on your mainboard chipset and the architecture.
The highest bandwidth I have experienced was around 3,5 GB/s with pinned memory.

PCI Express x16 v2.0. supports a max. bandwidth of max. 8GB/s, but theoretical of course.
The only consumer cards I know that support PCI Express x16 v2.0 are the GeForce 8800 GT(G92) and the GeForce 8800GTS 512MB (G92).
I have no experiences with these cards and their actual bandwidth in CUDA so I can’t comment on that.

Try running the BandwidthTest Sample from the CUDA SDK to see what you can expect with your card.

e.ping · December 14, 2007, 9:32am

What about in an ideal configuration: would

bandwidthTest --memory=pinned
run at 4 GB/s or 8 GB/s? And if it’s 4 GB/s is there (another) CUDA example where the bandwidth whould be above 4 GB/s?

VanDammage · December 14, 2007, 10:10am

you will never get the full 4GB/s in “real life”. There are always some constraints in the hardware of your system.

Only if you use PCI Express 2.0 which doubles the theoretical bandwidth of 1.1 assuming you’re also using a graphics card with PCI Express 2.0 support.
Maybe you can get up to 7GB/sec then.

The bandwidthTest from the SDK reflects only what you can expect from your system.

MisterAnderson42 · December 14, 2007, 2:36pm

See [url=“The Official NVIDIA Forums | NVIDIA”]The Official NVIDIA Forums | NVIDIA for a precise answer why ~3.4GiB/s is the peak. You are making full use of unoverclocked hardware if you get this performance.

P.S. to anyone with a PCIe-2.0 MB and a PCIe-2.0 capable card, please post the output of bandwidthTest “–memory=pinned -mode=shmoo”. I’m sure many of us would love to see how close you can get to the 8GiB/s peak… The actual bandwidth available to the RAM on the MB may become the limiting factor now.

Topic		Replies	Views
Maximum bandwidth with Intel Z68 Chip CUDA Programming and Performance	8	7703	August 16, 2011
how to achieve equal bandwith to 3 GPUs in CUDA? (searching for recent motherboards) CUDA Programming and Performance	13	6506	January 11, 2009
PCI Express x16 bandwidth - host<->device transfer Bandwidth is much lower than should be CUDA Programming and Performance	38	68259	April 18, 2008
Where has all the bandwidth gone? Bandwidth loss with concurrent sends on "independent" PCIe CUDA Programming and Performance	3	1141	February 29, 2012
Info please: x16 PCI Express slot outdated? Upgrade from Quadro FX 1500 to GTX470? (CUDA) CUDA Programming and Performance	5	7644	January 5, 2011
Bandwidth Test uses full cpu ? CUDA Programming and Performance	6	7719	August 15, 2007
What is the full potential of my GPU? CUDA Programming and Performance	9	6227	September 11, 2008
Bandwidht Usage CUDA Programming and Performance	16	9005	October 30, 2008
Memory bandwidth CUDA Programming and Performance	31	38614	October 5, 2007
Bandwidth problem ? Could anyone verify that this is normal? CUDA Programming and Performance	7	3634	April 25, 2008

PCI-E x16 and CUDA CUDA and concurrent 8GB/s bandwidth?

Related topics