GPU sharing PCIe bus

dgs · March 30, 2010, 5:14pm

Dear all,

in the context of multi-GPU computing using OpenCL,
is there anyway to understand whether two GPUs are sharing the same PCIe channel?

Best regards,

Daniele

seibert · March 30, 2010, 8:59pm

This is complicated, since the ways that PCI-Express resources can be shared vary:

Some motherboards cut the number of lanes to some slots in half when multiple cards are inserted. Then each card gets an x8 link, which has half the bandwidth of the normal x16, and that bandwidth is fixed at that half-max level whether or not both devices are active at the same time.
Some motherboards use a PCI-Express switch (like the NF200) to share an x16 link between two cards. In this case, the bandwidth available to each card separately is the full x16, but can be cut in half if both devices are using it at the same time. This is the closest approximation to a “bus-style” sharing you see in PCI-e, however I’m not aware of any motherboard that shares an x16 link with more than two slots.
If you are using a GTX 295, then the two GPUs are already using an NF200 to share the single slot, much like above.

Figuring out which scenario you are in is probably OS-specific, and requires some way to query the low-level PCI-Express topology of the system.

dgs · March 30, 2010, 9:48pm

Some of the nodes in our cluster share up to 6 GPUs connected pair-wise through PCIe.

I wonder if OpenCL offers a way that I don’t know to understand which GPUs are connected

through the same PCIe link so to select decoupled GPUs when using 2/4 GPUs instead of 6.

Daniele

This is complicated, since the ways that PCI-Express resources can be shared vary:

Some motherboards cut the number of lanes to some slots in half when multiple cards are inserted. Then each card gets an x8 link, which has half the bandwidth of the normal x16, and that bandwidth is fixed at that half-max level whether or not both devices are active at the same time.

Some motherboards use a PCI-Express switch (like the NF200) to share an x16 link between two cards. In this case, the bandwidth available to each card separately is the full x16, but can be cut in half if both devices are using it at the same time. This is the closest approximation to a “bus-style” sharing you see in PCI-e, however I’m not aware of any motherboard that shares an x16 link with more than two slots.

If you are using a GTX 295, then the two GPUs are already using an NF200 to share the single slot, much like above.

Figuring out which scenario you are in is probably OS-specific, and requires some way to query the low-level PCI-Express topology of the system.

Topic		Replies	Views
Yet Another Hardware Question PCIe Requirements CUDA Programming and Performance	7	6124	May 14, 2008
Multi Device Bandwidth CUDA Programming and Performance	6	1378	May 4, 2010
Communication between two GPU cards of Drive PX2 using ePCI DRIVE Hardware	2	754	January 2, 2020
why is Tesla C1060 working at PCIe8X instead of 16X? CUDA Programming and Performance	3	2926	August 31, 2009
How can I tell which NVIDIA GPUs will have P2P access to the same GPU on PCIe? CUDA Programming and Performance	6	9691	January 20, 2025
GTX295 question CUDA Programming and Performance	11	10272	May 10, 2009
Two cards in one motherboard can double the speed between nodes? Switches and Gateways	4	588	June 3, 2019
8 K80 but there are two groups in P2P CUDA Setup and Installation	5	1318	April 21, 2016
Low Aggregate PCI Bandwidth for 9800GX2 CUDA Programming and Performance	14	22270	September 16, 2008
Pinned memory memcpy speed with 2 cards? pinned memcpy bandwidth drops to 50%!!! CUDA Programming and Performance	3	8968	November 18, 2007

GPU sharing PCIe bus

Related topics