Data transfer between two GPUs

wenshere · September 3, 2009, 2:33pm

If a machine has two GPUs, how is the data transfer between two GPUs’ device memory like? Are they connected by PCI-E two?
Speed?

LSChien · September 3, 2009, 3:59pm

two GPUs cannot communicate each other directly. You must bind two host threads to two GPUs respectively,

then you can switch data between two GPUs though host threads

[codebox]host thread 0 <------> GPU 0

  ^

  |

  |

  v

host thread 1 <------> GPU 1

[/codebox]

parallelis · September 3, 2009, 7:13pm

In fact two 1.3 generation devices may communicate between each other, asynchronously, using Mapped Pinned Memory (Host memory accessed by all the GPU), this is the most interesting way to have 2 GPU exchanging data asynchronously without stopping the kernels execution on each side, and without host CPU being directly involved.

Notice: it also work on MCP79 IGP (Geforce 9300M/9400M), an 1.3 are G200 GPU : GTX260 and over (GTX 260M mobile GPU is NOT a 1.3 device but a rebranded 9800M!).

cheers :-)

SPWorley · September 3, 2009, 8:25pm

A GTS260m is a 1.2 compute device and will indeed handle zero-copy memory. These are so new that they’re hard (impossible??) to find, their launch was more of an OEM availability announcement.

I’d rather have a GTS260m (with 96 SPs) than the GTX260m (with 112 SPs) just because of the 1.2 device support!

wenshere · September 4, 2009, 6:13am

How about GTX 295? The two GPUs on GTX 295 cannot communicate with each other? Is the memory of size 1792 MB shared by the two GPUs?

avidday · September 4, 2009, 6:38am

In CUDA, the two gpus on the GTX295 can’t communicate with each other. And the memory size is 2 x 896Mb, not 1792Mb, ie. each GPU has its own, discrete memory space that knows nothing about the other GPU.

parallelis · September 9, 2009, 6:24pm

You have to think of the GTX 295 not as one card with 2 GPU, but it is logically 2 cards with PCI-express bridge, each GPU having 896MB of local memory.

And the interesting point is that these 2 logicial cards may communicate using Host Memory (Pinned Mapped Memory), asynchronously without interfering with CPU, nor any need to stop kernel execution, exactly as if you have 2 GTX 275 in one computer :-)

Pinned Mapped Memory is a great tool to have symetrical or asymetrical GPU<->GPU<->CPU communication!!!

Topic		Replies	Views
Question about multi-GPU programming Memory accesses and sharing CUDA Programming and Performance	10	7214	January 13, 2009
What will be like for GTX 295 CUDA Programming and Performance	2	1536	July 23, 2009
Multiple GPU's and sharing memory Will a CUDA API eventually be provided for this? CUDA Programming and Performance	4	16511	June 28, 2010
Data transfer between multiple GPUs How to do it fast ? CUDA Programming and Performance	4	2557	January 21, 2010
CUDA 2.1 and GTX295 CUDA Programming and Performance	10	5693	May 9, 2009
Pinned memory memcpy speed with 2 cards? pinned memcpy bandwidth drops to 50%!!! CUDA Programming and Performance	3	8910	November 18, 2007
A few general questions... CUDA Programming and Performance	2	3074	October 12, 2009
OpenCL and multi GPU cards (GTX 295) CUDA Programming and Performance	1	10818	November 3, 2009
Data copy between multi-GPUs CUDA Programming and Performance	2	1573	October 14, 2008
Copy data from one GPU to another CUDA Programming and Performance	2	2176	July 1, 2010

Data transfer between two GPUs

Related topics