Low device-cpu bandwidth for GTX 1080 TI

phillippang1994 · November 12, 2019, 8:44am

Hi,

I am using a GTX 1080 Ti and I ran the bandwidth test in cuda samples.
here is the output, but the bandwidth values seem unusually low:

[CUDA Bandwidth Test] - Starting…
Running on…

Device 0: GeForce GTX 1080 Ti
Quick Mode

Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 1.5

Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 1.6

Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 350.3

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

I also saw this post https://devtalk.nvidia.com/default/topic/851390/k80-bandwidth-test/
and decided to try calculating the theoretical BW.
1582MHz * 2 (DDR) * 352 bits / 8bits per byte = 140 GB/s
That doesn’t seem to add up with the BW in the product specs as well as the above bandwidth:
https://www.nvidia.com/en-us/geforce/products/10series/geforce-gtx-1080-ti/

Can someone clarify?

njuffa · November 12, 2019, 1:26pm

The most likely cause of this extremely low host/device throughput is that the GPU is plugged into the wrong PCIe slot. It should go into a PCIe gen3 x16 capable slot, and this should result in a transfer rate of 12+ GB/sec.

[CUDA Bandwidth Test] - Starting...
Running on...

 Device 0: Quadro P2000
 Quick Mode

 Host to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     12327.1

 Device to Host Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     12364.3

 Device to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     119536.8

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

Check your PCIe link configuration by looking at the output of nvidia-smi -q:

GPU Link Info
            PCIe Generation
                Max                 : 3
                Current             : 3         <--------------
            Link Width
                Max                 : 16x       
                Current             : 16x       <--------------

If you look at this while bandwidthTest or some other CUDA software which uses frequent host/device transfers is running (I took the above snapshot with Folding@Home running), the “Current” items should show generation = 3, link width = x16. You can also use 3rd party software like TechPowerUp’s GPU-Z to monitor the link configuration.

phillippang1994 · November 13, 2019, 2:15am

Yep, the GPU was on the slow slot.

    GPU Link Info
        PCIe Generation
            Max                 : 3
            Current             : 3
        Link Width
            Max                 : 16x

Current : 2x

phillippang1994 · November 13, 2019, 4:12am

Transferred to the x16 slot (which is usually the top-most slot on the motherboard), got 12 GB/s.

[CUDA Bandwidth Test] - Starting…
Running on…

Device 0: GeForce GTX 1080 Ti
Quick Mode

Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 12.6

Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 13.2

Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(GB/s)
32000000 351.0

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

Topic		Replies	Views
GTX 1080 Ti performance is poor CUDA Setup and Installation	3	2800	October 23, 2017
Extremely low bandwidth CUDA Programming and Performance	10	2070	September 4, 2010
Weird bandwidth issues CUDA Programming and Performance	8	1455	December 1, 2016
Bandwidht Usage CUDA Programming and Performance	16	9010	October 30, 2008
bandwith performance on PCI-E v1 slow? CUDA Programming and Performance	3	902	May 15, 2008
bandwidth test CUDA Programming and Performance	9	19340	March 24, 2009
20% of the bandwidth is missing CUDA Programming and Performance	4	1300	August 12, 2014
Low memory bandwidth CUDA Programming and Performance	4	7221	March 10, 2008
very low PCIe bandwidth CUDA Programming and Performance	9	3546	March 2, 2010
low transfer bandwidth between CPU and GPU my GTX 580 has a slow transfer speed CUDA Programming and Performance	9	3726	August 10, 2011

Low device-cpu bandwidth for GTX 1080 TI

Related topics