cuda sync and async memcpy

s002wjh · February 1, 2016, 10:05pm

is the cudamemsync blocking? if so for better performance using async mem copy with stream. when do I need to sync the result? (when everything in GPU is done and handover result back to CPU)? for example if I use async copy->fft->mult->ifft->syncstream->copy result back to host?

episteme · February 2, 2016, 6:34am

async copy->fft->mult->ifft->async copy result back->cudaStreamSynchronize(or cudaStreamQuery)

cudaStreamQuery does not block(if it returns cudaSuccess, the stream finished)

Topic		Replies	Views
CPU blocked MUCH longer than expected calling a cudaMemcpy after a cuda graph launch CUDA Programming and Performance	7	555	October 19, 2023
Cudamemcpy vs cudamemcpyasync in different cpu threads with different data and pointers Jetson AGX Orin cuda	2	33	December 4, 2024
Asyncronus call CUDA Programming and Performance	1	2256	September 24, 2009
Overlap cudaMemcpyAsync with CPU execution CUDA Programming and Performance	2	1128	April 3, 2009
Performance of memcpyasync CUDA Programming and Performance	2	1113	June 17, 2021
Questions about when using cudaMemcpyAsync(), the host is blocked CUDA Programming and Performance	6	3552	April 5, 2018
during the copy, can cpu and gpu work? CUDA Programming and Performance	6	5214	June 11, 2008
Memory copy/set async to kernel execution in different stream CUDA Programming and Performance	5	1061	December 15, 2022
cudaMemcpyAsync slower than cudaMemcpy? CUDA Programming and Performance	1	3087	March 10, 2009
cudaMemcpyAsync not giving any answers using cudaMemcpyAsync function CUDA Programming and Performance	1	797	September 5, 2011

cuda sync and async memcpy

cudaStreamQuery does not block(if it returns cudaSuccess, the stream finished)

Related topics