P2P peer communication is slower than the bandwidth between GPU and CPU

rxu · June 5, 2011, 5:39am

Hi, everyone!
I used CUDA 4.0 to test the bandwidth between CPU->GPU, GPU->CPU, and GPU<->GPU in a system with Tesla C2050. I ran the SDK sample program bandwidthTest.cu and simpleP2P.cu. The result shows the bandwidth between CPU and GPU is around 2.9~3.0GB/s, while the bandwidth between GPU and GPU is only about 2.4GB/s. Why the data transfer rate between GPUs is even slower than CPU and GPU? Can anyone explain that? Have you tested your program and got the similar conclusion?

Topic		Replies	Views
Questions about p2pBandwidthLatencyTest CUDA Programming and Performance	2	827	July 16, 2019
the bandwidth is low between my gpus. tested with p2pBandwidthLatencyTest CUDA Programming and Performance	0	643	March 28, 2018
Bandwidth disparity between Host-Device-Device-Host CUDA Programming and Performance	2	880	August 24, 2011
low GPU-to-GPU bandwidth with Titan V compared to Titan X CUDA Setup and Installation	1	504	June 25, 2020
Question about P2P transfer bandwidth between two RTX2080s CUDA Programming and Performance	1	503	November 2, 2023
CPU <--> GPU is getting slow ? CUDA Programming and Performance	0	1130	November 6, 2008
cudaMemcpy2D slow with TESLA1060 ? CUDA Programming and Performance	3	2765	November 6, 2009
GPU Peer to Peer communication bandwidth Test result is confused (the furthest card is the best} CUDA Programming and Performance	0	375	March 25, 2020
Cuda -> OpenGL bandwidth CUDA Programming and Performance	6	3243	August 21, 2008
PCI-E bottleneck when transferring data between CPU and GPU CUDA Programming and Performance	2	11221	April 28, 2011

P2P peer communication is slower than the bandwidth between GPU and CPU

Related topics