Asynchronous memory copy from Host to Device

Squall211 · June 11, 2008, 7:27pm

Hello,

My application requires constantly streaming data from a live camera feed. I’m therefore really interested in the asynchronous memory copy from host to device.

Are there any nVidia graphics cards currently out that support this capability? I know that cudaMemcpyAsync is supported in the new 2.0 SDK. However, I’ve heard from some colleagues that this will not actually run on current hardware.

MisterAnderson42 · June 11, 2008, 8:16pm

In works on all compute 1.1 hardware, which is all CUDA capable hardware that is not G80 (8800 GTX, Tesla).

Hasmanean · June 11, 2008, 9:37pm

I have Async memcpys working on an 8800GTX, Compute Cap 1.0.

Streams do not work on the 8800 GTX, but async kernel calls and async memcpys do.

MisterAnderson42 · June 12, 2008, 1:15am

Yes, you can do async memcpys. But you can’t overlap them with kernel executions on 1.0 hardware which I assumed is what the OP was asking about.

santyhyammer · June 12, 2008, 1:49am

I use async streams with 2x GF8500GT… but it’s curious… the cudaCreateStream() always returns a stream handle of “1”… even with multiple threads and GPUs plugged.

Squall211 · June 12, 2008, 3:38pm

Thanks for all the help so far.

So let me try to see if I understand:

Compute 1.0 capability: Kernel will not execute while async memcopy is working

Compute 1.1 capability: Kernel WILL execute while async memcopy is working

Do I have this right?

Topic		Replies	Views
memory copy overlap CUDA Programming and Performance	7	14719	March 29, 2008
Asynchronous data transfer CUDA Programming and Performance	8	7076	May 15, 2008
asynchronous cuMemcpyDtoD ? CUDA Programming and Performance	9	2404	December 9, 2008
Question about streaming CUDA Programming and Performance	5	10755	February 20, 2008
Concurrent exec. of kernel and GPU mem copies CUDA Programming and Performance	5	2892	March 7, 2008
Question about streams CUDA Programming and Performance	1	980	August 6, 2009
Pinned Host Memory and CC 1.1 Device CUDA Programming and Performance	1	5512	May 6, 2010
Memory copy/set async to kernel execution in different stream CUDA Programming and Performance	5	1012	December 15, 2022
Parallelizing data transfer with kernel execution CUDA Programming and Performance	7	1392	January 13, 2014
Is cudaMemcpyAsync inside a kernel controlled by the GPU? CUDA Programming and Performance	9	3399	July 28, 2019

Asynchronous memory copy from Host to Device

Related topics