The behavior of cuCtxSynchronize

JacobHe_AMD · March 7, 2022, 2:41am

Greetings,
The driver API spec says “Blocks until the device has completed all preceding requested tasks.”, does it mean “Blocks until the device has completed all preceding requested tasks in the streams belongs to the context.”? Or it means to wait for all tasks of all of the streams on the device?

Thanks
Jacob

epk · March 7, 2022, 9:15am

It should wait for all tasks in the context, i.e. on streams in the context.

JacobHe_AMD · March 7, 2022, 12:02pm

Thanks, epk! My question is, does it wait for tasks in other contexts on the same device? I thought it would not, but a little bird told me that it will. So I want to get the clarification here.

epk · March 7, 2022, 9:24pm

does it wait for tasks in other contexts on the same device?

No, it doesn’t. But - you could just write a sample program and check

Shameless self-plug: … and your program would be pretty short if you used the upcoming version 0.5 of my Modern-C++ CUDA API wrappers which supports the combined Driver + Runtime API, with RAII objects and exception-protected calls, e.g.:

auto context_1 = cuda::context::create(device);
auto context_2 = cuda::context::create(device);
auto stream_1 = context_1.create_stream(cuda::stream::nonblocking);
auto stream_2 = context_2.create_stream(cuda::stream::nonblocking);
stream_1.enqueue.kernel_launch(my_kernel, my_launch_config, some_args, would_go_here);
stream_2.enqueue.kernel_launch(my_kernel, my_launch_config, different_args, perhaps);
context_2.synchronize();
// no need for destroying any of the contexts or streams here, just exit the scope or function.

etc. That’s not a full program of course, but it’s the gist of what you could write to check.

and has exception triggering like cuda::context::make()

JacobHe_AMD · March 8, 2022, 12:55am

Ok, thanks!

Topic		Replies	Views
Cuda context and cudaDeviceSynchronize CUDA Programming and Performance	1	833	February 27, 2023
Threading and streams cudaStreamSynchronize CUDA Programming and Performance	4	3721	July 16, 2008
cudaDeviceSynchronize() doesn't wait for kernels launched by other CPU threads, why? CUDA Programming and Performance synchronization	6	2471	September 10, 2020
About the behavior of cudaStreamSynchronize() CUDA Programming and Performance cuda	3	4462	April 25, 2023
cuStreamSynchronize on a blocking stream CUDA Programming and Performance	2	1293	March 9, 2022
Wait for all (I mean all) streams to finish on a device CUDA Programming and Performance	0	670	November 13, 2019
cudaDeviceSynchronize blocking effect cudaDeviceScheduleBlockingSync CUDA Programming and Performance	3	6745	June 30, 2012
Does cudaDeviceReset() wait for operation completion on the device? CUDA Programming and Performance	4	908	December 27, 2023
Synchronization Between cuCtxSynchronize (Driver API) and cudaMemcpy (Runtime API) GPU-Accelerated Libraries cuda	0	71	December 4, 2024
Is the driver API thread safe? Specifically the cuStreamSynchronize and cuEventSynchronize functions CUDA Programming and Performance	0	1330	January 7, 2009

The behavior of cuCtxSynchronize

Related topics