Peer-to-Peer Communication

kriegalex · October 13, 2014, 10:42am

Hi,

Quoting from Professional CUDA C Programming : “Kernels executing in 64-bit applications on devices with compute capability 2.0 and higher can directly access the global memory of any GPU connected to the same PCIE root node.[…] requires CUDA 4.0 or higher […] a system with two or more Fermi or Kepler GPUs […]”.

I am working on a machine with two Tesla C2075 (fermi) on a X9DR3-F motherboard with two Xeon CPUs. I’ve installed Ubuntu 12.04 and CUDA 6.5. Here is the output of the simpleP2P sample program :

[./simpleP2P] - Starting...
Checking for multiple GPUs...
CUDA-capable device count: 2
> GPU0 = "    Tesla C2075" IS  capable of Peer-to-Peer (P2P)
> GPU1 = "    Tesla C2075" IS  capable of Peer-to-Peer (P2P)

Checking GPU(s) for support of peer to peer memory access...
> Peer-to-Peer (P2P) access from Tesla C2075 (GPU0) -> Tesla C2075 (GPU1) : No
> Peer-to-Peer (P2P) access from Tesla C2075 (GPU1) -> Tesla C2075 (GPU0) : No
Two or more GPUs with SM 2.0 or higher capability are required for ./simpleP2P.
Peer to Peer access is not available between GPU0 <-> GPU1, waiving test.

Is there any reason that I cannot use peer to peer memory access ? I will also attach the results of lspci -tv :

[...]
 +-[0000:80]-+-01.0-[81]--
 |           +-02.0-[82]--
 |           +-03.0-[83]----00.0  NVIDIA Corporation GF110GL [Tesla C2050 / C2075]
[...]
\-[0000:00]-+-00.0  Intel Corporation Ivytown DMI2
             +-01.0-[01]--
             +-01.1-[02-03]--+-00.0  Intel Corporation I350 Gigabit Network Connection
             |               \-00.1  Intel Corporation I350 Gigabit Network Connection
             +-02.0-[04]----00.0  NVIDIA Corporation GF110GL [Tesla C2050 / C2075]

If the problem has to do with the hardware, it means that the motherboard should be changed ?

Best regards

kriegalex · October 13, 2014, 11:29am

Ok, so if someone ever needs this information :

I contacted the system administrator to have a look at the physical connections of the GPUs. This is the motherboard diagram :

External Media

Turns out the first Tesla was in Slot 2 and second Tesla in slot 4. In order to make P2P work, you have to connect both cards to same CPU and avoid QPI links, so here the card in slot 2 was moved to slot 6. It’s weird because slot 4 and slot 6 are the two first slots of the motherboard, but the tech prefered to plug the cards at the bottom rather than at the top of the MB.

Hope it can help someone.

Topic		Replies	Views
Problem with "Simple Peer-to-Peer Transfers with Multi-GPU" I got an exception when I run th CUDA Programming and Performance	1	1622	November 28, 2011
CUDA peer to peer example ./simpleP2P failing CUDA Programming and Performance	11	8523	February 5, 2015
Peer-to-Peer Access Fails between 2 GPUs CUDA Setup and Installation	3	5551	July 7, 2017
P2P programming CUDA Programming and Performance	3	1516	September 15, 2013
openMP+CUDA, need help! CUDA Programming and Performance	7	1977	November 23, 2012
Multiple GPUs, Peer-to-Peer Question CUDA Setup and Installation	1	1232	October 21, 2016
P2P access not enabled, is this a software or a hardware issue? CUDA Setup and Installation	7	9385	November 10, 2015
P2P between two Tesla K40c devices CUDA Setup and Installation cuda	2	605	July 14, 2020
P2P: How do I know if cudaMemcpy falls back to non-P2P? CUDA Programming and Performance	8	2232	October 12, 2021
Peer-to-Peer not enabled on Dell R730 but is on Dell R740 CUDA Programming and Performance	1	988	May 14, 2018

Peer-to-Peer Communication

Related topics