Using multi streams in cuda graph, the execution order is uncontrolled

Robert_Crovella · May 17, 2022, 2:54pm

CUDA graphs use streams to arrange for concurrency and asynchrony. You can control dependencies. This control is most obvious if you use the API capture method, but if you use the stream capture method, the dependencies will still be defined at that point.

No graph item will execute before its dependencies are complete. Other than that, CUDA graphs will attempt to schedule work efficiently to maximize performance, and you have no direct control over this scheduling.

Let’s say we have a graph item B that is dependent on A, and a graph item C that is also dependent on A. CUDA graphs will use streams (generally speaking) to allow both B and C to execute as quickly as possible, after A is complete.

Regarding your question 2, you don’t have control over the detailed scheduling of activity, other than declaring dependencies.

Topic		Replies	Views
Behavior of cudaGraphInstantiateFlagUseNodePriority CUDA Programming and Performance cuda	7	662	August 23, 2023
Multi-stream graph CUDA Programming and Performance	3	348	February 5, 2025
What will happen when I replay a cuda graph with two streams in a new stream? CUDA Programming and Performance	9	531	May 24, 2024
Multiple independent streams in a graph CUDA Programming and Performance	2	2216	October 7, 2019
Why cudaGraphLaunch(graph_exec_, stream1) dont run the graph at stream1 CUDA Programming and Performance cuda , graphics	1	30	June 6, 2025
Getting Started with CUDA Graphs Technical Blog	11	2325	January 8, 2024
Multiple launches of a single cudaGraphExec_t executing in parallel, in contrast to documentation? CUDA Programming and Performance	9	862	March 18, 2022
Cuda stream priorities Inference Town Hall 7-24-25 cuda	0	13	July 17, 2025
Rtx3070 Why when I run the program with CUDA Graph and multiple streams, there's no parallelism observed when monitoring with nsys GPU-Accelerated Libraries cuda	6	29	September 18, 2025
Questions of CUDA stream priority CUDA Programming and Performance cuda	10	4218	April 19, 2023

Using multi streams in cuda graph, the execution order is uncontrolled

Related topics