Dynamic Control Flow in CUDA Graphs with Conditional Nodes

Originally published at: https://developer.nvidia.com/blog/dynamic-control-flow-in-cuda-graphs-with-conditional-nodes/

CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and dependencies. Graphs provide incredible benefits for static workflows where the overhead of graph creation can be amortized over many successive launches. However, nearly all problems involve some form of decision-making, which can…

Nice feature! Are there any plans to support nodes with different contexts? Also the post seems to focus on the runtime API but are the memops nodes from the driver API allowed?