concurrency of device to device copy

Accelerated Computing CUDA CUDA Programming and Performance

xnov December 17, 2012, 5:12am 1

it is written in the cuda docs that memory copies between two addresses to the same device memory is always concurrent.
[url]Programming Guide :: CUDA Toolkit Documentation

so my question here: is it possible that the device to device copy works concurrently like several independent kernels (I mean can it have 2 or more device to device copies at the same time) ? or is it still dictated by the asynchronous copy engine?

thanks

Topic		Replies	Views
MemCpyAsync with DevToDev Flag CUDA Programming and Performance	6	13289	February 7, 2008
concurrency among copies: is it possible? CUDA Programming and Performance	5	2697	December 7, 2012
concurrent copy and execution CUDA Programming and Performance	0	1615	November 6, 2009
devicetodevice memory copy ? CUDA Programming and Performance	1	12038	July 26, 2010
Concurrent Data Transfers CUDA Programming and Performance	9	7723	April 27, 2012
asynchronous cuMemcpyDtoD ? CUDA Programming and Performance	9	2449	December 9, 2008
copy between two devices seeking instructions on direct copy between two devices CUDA Programming and Performance	0	1518	January 13, 2011
is it possible to overlap computation with a device-to-device memcopy? CUDA Programming and Performance	2	1061	January 6, 2010
Concurrent data copying and kernel execution CUDA Programming and Performance	10	8267	September 18, 2010
Concurrent copy & execution problem Device to host memory copy is not overlapped with kernel exe CUDA Programming and Performance	1	1793	June 23, 2010

concurrency of device to device copy

Related topics