Memcopy while Kernels Running? Performance hit?

trex · June 5, 2008, 2:21pm

Assuming the kernel in question does not use the memory address being copied from, am I able to run a kernel while also copy data from the device- to host- memory, and vice versa? I mean will there be any major performance penalty? Any other considerations (other than ensuring not copying data being worked on by kernels)?

kristleifur · June 5, 2008, 2:53pm

Yes you can, basically. What you’re looking for is are called async functions, such as cudaMemcpyAsync, and you want to use CUDA Streams. The CUDA streams are a way to help synchronise ansynchronous stuff. Hope that gets you going in the right direction.

trex · June 5, 2008, 3:05pm

Cheers. :)

Topic		Replies	Views
Accesing memory from both kernel and host side CUDA Programming and Performance	1	3031	February 17, 2008
Async kernel execution and data copy CUDA Programming and Performance	0	4228	November 10, 2009
Memory Copy Threads CUDA Programming and Performance	2	1997	July 27, 2007
load/store host data inside kernel? CUDA Programming and Performance	2	728	February 11, 2015
Asynchronicity of kernel execution and cuMemcpy CUDA Programming and Performance	2	3280	March 23, 2009
Send data and calculate at the same time? CUDA Programming and Performance	1	1246	June 30, 2008
during the copy, can cpu and gpu work? CUDA Programming and Performance	6	5214	June 11, 2008
Asynchronous H2D transfer while kernel execution CUDA Programming and Performance	2	5135	April 26, 2011
Asynchronous memory copy from Host to Device CUDA Programming and Performance	5	3063	June 12, 2008
cudaMemcpy during kernel execution asynchronous kernel launch CUDA Programming and Performance	2	3084	July 20, 2007

Memcopy while Kernels Running? Performance hit?

Related topics