GPU Cluster (2 cards) faulty bandwidth time estimate

invsoigne · November 14, 2008, 3:46pm

Hello everyone!

I’m trying to help port a lot of python code into CUDA so as to be able to make our analysis code run faster. We have access to a cluster of GPU cards, and I was just running through the packaged CUDA bandwidthTest, and recieved some interesting results:

Quick Mode
Device to Device Bandwidth
.
Transfer Size (Bytes) Bandwidth(MB/s)
16777216 2133333.2

This is a very nice result, but I can only assume it’s faulty. The cluster consists of a head node, and a “collection” of compute nodes. The nodes are all built around an Intel Core 2 Duo (E6850) CPU on an ASUS P5N32-E motherboard. The compute nodes each sport one NVIDIA GeForce 8800 GTX GPU. (Ultimately we want to have two GPU’s in each compute node).

As it stands, this is at least one order of magnitude greater than I think would be a believable result. Is there anyone else who has run into this problem, or anyone with any ideas for suggestions? My ultimate goal being to be able to run some sort of bandwidthTest (modified perhaps) that gives an accurate result.

Thank you!

netllama · November 14, 2008, 4:04pm

Which driver are you using?
What’s the full output?
Do the other SDK apps work ok?

invsoigne · November 14, 2008, 7:36pm

I don’t know the driver, but I will ask.

Other apps seem to work ok.

Here’s the output:

Quick Mode
Host to Device Bandwidth for Pageable memory
.
Transfer Size (Bytes) Bandwidth(MB/s)
16777216 1134751.9

Quick Mode
Device to Host Bandwidth for Pageable memory
.
Transfer Size (Bytes) Bandwidth(MB/s)
16777216 1176470.5

Quick Mode
Device to Device Bandwidth
.
Transfer Size (Bytes) Bandwidth(MB/s)
16777216 2064516.1

&&&& Test PASSED

invsoigne · November 14, 2008, 8:18pm

The driver most likely is the:
NVIDIA Driver for Linux with CUDA Support (169.09)

It is definatly a Linux driver, and almost assuredly an older version.

Thank you.

netllama · November 14, 2008, 8:20pm

Yes that driver is rather old. You need to upgrade to the CUDA_2.0 driver.

Topic		Replies	Views
Low Device to Device Bandwidth CUDA Programming and Performance	11	3553	May 4, 2009
Low memory bandwidth CUDA Programming and Performance	4	7248	March 10, 2008
Extremely low bandwidth CUDA Programming and Performance	10	2105	September 4, 2010
device memory bandwidth issues with 177.67 lower then expected CUDA Programming and Performance	7	5479	October 5, 2008
Very low device to device bandwidth with bandwidth test example from SDK CUDA Programming and Performance	2	6182	June 21, 2007
Bandwith problems with S870 and 177.67 CUDA Programming and Performance	18	12291	January 26, 2009
CPU <--> GPU is getting slow ? CUDA Programming and Performance	0	1159	November 6, 2008
Weird bandwidth issues CUDA Programming and Performance	8	1504	December 1, 2016
Bandwidht Usage CUDA Programming and Performance	16	9054	October 30, 2008
Host<-> device bandwidth problems slow and intermittent bandwidth on linux CUDA Programming and Performance	9	6826	January 8, 2008

GPU Cluster (2 cards) faulty bandwidth time estimate

Related topics