memcpy priority of cuda streams

k.adithya1990 · March 9, 2020, 5:14pm

I understand cudaMemcpyAsync can be used to launch memcpy on specific cuda streams, and that different cuda streams can be created with different priorities.
I also understand that cudaMemcpyAsyncs launched on a specific cuda stream execute in FIFO order.
My question is, for a one way memcpy in a single direction (i.e. htod or dtoh), how are the memcpys scheduled across different cuda streams?

Is the prioritization specification only for the kernel execution, or is it also for the DMA engine?

As an illustrative microbenchmark that does memcpy on two different streams launched from a single host thread, I noticed an interesting behavior on nsight.

Stream 16 is assigned a higher priority than stream 15 (using the cudaStreamCreateWithPriority API). The first request on stream 16 goes through, after which it falls back to stream 15.

Is this expected behavior?
Is there a way to control memcpy prioritization across cuda streams?

Topic		Replies	Views
cuda MemCopy memory consistency issue (across streams) CUDA Programming and Performance	0	4075	June 1, 2010
Syncronization with cuda Streams CUDA Programming and Performance cuda	8	418	October 12, 2021
CUDA stream priority across processes(Windows OS) CUDA Programming and Performance	1	708	May 31, 2019
I want to synchronize CUDA streams CUDA Programming and Performance	5	709	January 5, 2024
Execution order between Cuda Stream 0 and other streams CUDA Programming and Performance	0	356	July 9, 2020
cudaMemcpyAsync clarification required & help needed CUDA Programming and Performance	0	1749	October 17, 2009
Help with CUDA streams CUDA Programming and Performance	1	1599	April 2, 2010
cuda (Newbie question) when using streams, does the order of the Async calls make a difference? CUDA Programming and Performance	1	527	December 5, 2010
Unable to Make p2p cudaMemcpyAsync calls between GPU processes parallel CUDA Programming and Performance cuda	2	1373	March 1, 2022
Memset/memcpyDtoD implicitly synchronizes all streams -- a way to disable it? CUDA Programming and Performance	5	537	August 23, 2023

memcpy priority of cuda streams

Related topics