Multiple memcpu HostToDevice in parallel ? or how to fake broadcast to several GPU

e.ping · August 25, 2007, 3:10am

Suppose I have several GPUs connected. Each serviced by its own thread, having its private GPU address space.

To broadcast a big buffer to the GPUs, I can :

0- Dream to use a broadcast API from CUDA, yet missing…

1- Ask each server to upload the buffer to its private space. Because they are threads, all memcpy will run in parallel.

2- Sync each service thread after each upload, hence doing mempcy one after the other.

Questions:

Will 0 (broadcast) be supported in some futur release of CUDA ?
Is 1 (concurrent DMA) safe ? Or should I except some crash…
Is 1 faster than 2, because anyway the bus is the bottleneck and there is no gain in sending several concurrent DMA transferts.
Yes, but if “several” is 12 (eg. 3 S870 connected by 3 PCIex16), I do have several busses that should be able to do concurrent DMAs.
So the good question is: when 1 is faster than 2 ?

Topic		Replies	Views
Does CUDA support broadcast function? New nVida chipset 790i has builtin .... CUDA Programming and Performance	10	3621	July 2, 2008
CUDA support broadcast data to all devices? CUDA Programming and Performance	2	920	November 4, 2009
PCIe DMA broadcast & CUDA CUDA Programming and Performance	2	9419	August 19, 2008
Multiple GPUs Devise a synchro mechanism for host threads CUDA Programming and Performance	7	4221	May 13, 2010
GPU-CPU & GPU-GPU synchronization query on advanced CUDA features CUDA Programming and Performance	12	17463	June 14, 2008
Can't get any concurrency on simple vector add across multi-GPU and streams CUDA Programming and Performance	17	5831	April 28, 2012
GPU<-->GPU DMA? CUDA Programming and Performance	1	2492	March 27, 2008
Send data and calculate at the same time? CUDA Programming and Performance	1	1260	June 30, 2008
GPU Broadcast in CUDA 4.0 CUDA Programming and Performance	0	3842	June 4, 2011
Transfer data between host and device dynamicly? Maybe it's a problem. CUDA Programming and Performance	12	5308	April 2, 2008