Asynchronicity of kernel execution and cuMemcpy

chrismc · March 23, 2009, 9:29am

Is it possible that a cuMemcpy from device to host can be running at the same time a kernel on that device can be executing?

If so, how can the host and device communicate so that the kernel does not write to the memory the cuMemcpy is reading until the cuMemcpy has finished?

seibert · March 23, 2009, 12:54pm

If the kernel execution and the memcpy are in the same stream (if you don’t specify a stream, they are implicitly in stream 0), then you don’t need to worry. cuMemcpy automatically performs the synchronization required ensure consistency.

chrismc · March 23, 2009, 2:15pm

Streams are for me the next thing to understand in CUDA.

Topic		Replies	Views
Syncronization with cuda Streams CUDA Programming and Performance cuda	7	623	April 13, 2021
Concurrent copy & execution problem Device to host memory copy is not overlapped with kernel exe CUDA Programming and Performance	1	1844	June 23, 2010
cudaDeviceSynchronize needed between kernel launch and cudaMemcpy ? CUDA Programming and Performance	15	16672	September 29, 2017
Memory copy/set async to kernel execution in different stream CUDA Programming and Performance	5	1313	December 15, 2022
How to overlap execution of kernels in different streams with copy operations CUDA Programming and Performance	8	1141	January 18, 2022
cuda stream CUDA Programming and Performance	3	5924	April 6, 2011
Accesing memory from both kernel and host side CUDA Programming and Performance	1	3081	February 17, 2008
Memset/memcpyDtoD implicitly synchronizes all streams -- a way to disable it? CUDA Programming and Performance	5	712	August 23, 2023
Kernel Synchronization in CUDA not fully explained in programming guild CUDA Programming and Performance	1	10691	February 25, 2010
Kernel executed in non-default CUDA stream waits for other streams to complete cudaMemcpyAsync CUDA Programming and Performance cuda	14	513	November 18, 2024

Asynchronicity of kernel execution and cuMemcpy

Related topics