Threading and streams cudaStreamSynchronize

TL1 · July 16, 2008, 10:02am

Does [font=“Courier”]cudaStreamSynchronize[/font] have to be called from the same thread that launched the kernel / memcpy? Or will it work from any thread?

If it has to be the same thread, how can you wait for a stream to complete without blocking the host from issuing new requests in other streams.

Thanks.

MisterAnderson42 · July 16, 2008, 11:32am

It must be in the same CUDA context, so yes it must be in the same thread. If you need to synchronize to certain locations in the stream without blocking you can mark the location with a CUDA event and then perform a non-blocking check with cudaEventQuery.

TL1 · July 16, 2008, 12:57pm

So [font=“Courier”]cudaEventSynchronize[/font] can be called from any thread?

MisterAnderson42 · July 16, 2008, 3:41pm

No. All cuda* functions operate in the current CUDA context. There is one context per host thread and contexts cannot share any resources. cudaEventQuery can be used for a non-blocking check if an event has occured.

TL1 · July 16, 2008, 4:46pm

Hmm. Maybe I am going about this the wrong way then.

I have a bunch of host threads that require work done. Each thread represents a separate client.

I planned to also have a separate thread for each CUDA device. Then for each client I would select a device and route the CUDA calls through the appropriate thread for the device. This way, clients can change devices as necessary, to balance loads. But it seems that there is no way to block the client thread until processing is complete without blocking the thread that issues the CUDA calls to the device as well.

I want multiple separate threads to be able to make optimal use of the devices without blocking them, and with load balancing. And each originating thread needs to know when its compute job is done. What is the best way to achieve this?

Topic		Replies	Views
Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron CUDA Programming and Performance	1	7250	January 9, 2012
About the behavior of cudaStreamSynchronize() CUDA Programming and Performance cuda	3	4236	April 25, 2023
Is the driver API thread safe? Specifically the cuStreamSynchronize and cuEventSynchronize functions CUDA Programming and Performance	0	1322	January 7, 2009
cudaDeviceSynchronize - blocks only GPU for the host (CPU) thread in which it is called, or does it CUDA Programming and Performance	3	4239	January 12, 2014
Question about cudaDeviceScheduleBlockingSync CUDA Programming and Performance	0	485	March 24, 2021
cudaThreadSynchronize() in MultyThreade Application CUDA Programming and Performance	3	4818	December 17, 2010
Why does cudaEventSynchronize block other streams? CUDA Programming and Performance cuda	1	531	February 2, 2023
cudaDeviceSynchronize() doesn't wait for kernels launched by other CPU threads, why? CUDA Programming and Performance synchronization	7	2434	October 12, 2021
How to block a single host thread on a CUDA event multi-threaded CUDA application CUDA Programming and Performance	5	2280	September 8, 2011
synchronization between the host and the stream CUDA Programming and Performance	3	1050	June 29, 2009

Threading and streams cudaStreamSynchronize

Related topics