CUDA Callback function context

avishorp · August 5, 2018, 4:01pm

What is the context (thread) in which callback functions registered with cudaStreamAddCallback are called? Is it different than the main program thread? Is there a way to wait for any callback to execute without consuming CPU time?

Robert_Crovella · August 5, 2018, 9:13pm

Yes, its a different thread than the main program. It is run in a thread created by the CUDA driver.

[url]cuda - What thread runs the callback passed to cudaStreamAddCallback? - Stack Overflow

Since the complete context is not described, this should be treated as an abstract, implementation-defined methodology for handling a callback. Therefore I would be cautious about trying to create direct synchronization between it and your program code.

The way to wait for a callback in a CUDA-aware way would be to put an event into the stream the callback was issued into after the callback, then issue cudaEventSynchronize() (on that event) prior to the code you want to wait on the callback. This would be the programming-model-aware method, in my view.

At that point, the question about whether or not that cudaEventSynchronize() call uses CPU time would be a function of how you have the synchronization flags set.

[url]https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__DEVICE.html#group__CUDART__DEVICE_1g69e73c7dda3fc05306ae7c811a690fac[/url]

avishorp · August 8, 2018, 2:23pm

Thanks, very helpful.

Topic		Replies	Views
How to hang up a stream waiting for a CPU thread? CUDA Programming and Performance	1	844	September 22, 2015
Using cudaEvents to synchronise with cudaStreamCallback CUDA Programming and Performance cuda	5	654	May 9, 2024
cudaLaunchHostFunc API example CUDA Programming and Performance	31	5931	February 8, 2025
Cuda context and cudaDeviceSynchronize CUDA Programming and Performance	1	688	February 27, 2023
Threading and streams cudaStreamSynchronize CUDA Programming and Performance	4	3587	July 16, 2008
About the behavior of cudaStreamSynchronize() CUDA Programming and Performance cuda	3	2785	April 25, 2023
Threads sharing cuda events CUDA Programming and Performance	8	2082	August 19, 2016
Cuda syncronize APIs CUDA Programming and Performance	9	33	January 28, 2025
What is the recommended way to flag secondary threads CUDA Programming and Performance cuda	10	28	March 12, 2025
unable to get the cpu and gpu to run in parallel CUDA Programming and Performance	34	23205	October 7, 2010

CUDA Callback function context

Related topics