Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron

hdpatel · January 8, 2012, 4:25pm

Hello,

I wish to understand the semantics of cudaThreadSynchronize() when used in a multithreaded program. (I understand this is deprecated in CUDA 4, but I’m using 3.2).

Suppose I have a multithreaded application that creates two pthreads P1 and P2. Within P1 and P2, I perform a kernel invocation. Now, suppose I make a call to cudaThreadSynchronize() in the threads as well before readig the computed results. Does one invocation to cudaThreadSynchronize() force a block on both pthreads’ kernel invocations?

Example:

P1’s kernel invocation denoted by P1.k1.
P1’s invocation to cudaThreadSynchronize() by P1.cts1.
P2’s kernel invocation denoted by P2.k2.
P2’s invocation to cudaThreadSynchronize() by P2.cts2.

Suppose the execute occurs in the following manner:

P1.k1, P2.k2
P2.cts2

Now, does (2) ensure that P1.k1 completes before proceeding?

Thanks.

DrAnderson42 · January 9, 2012, 4:29pm

In CUDA 3.2, each thread has it’s own CUDA context and thus its own separate stream of kernel launches. In your example, the cudaThreadSynchronization in P2 only ensures that the kernel P2.k2 call is complete.

Note that is is generally better to insert events and use cudaStreamWaitEvent or other fine-grained sync mechanisms than big hammer cudaThread(Device)Synchronize.

Topic		Replies	Views
Waiting for kernel CUDA Programming and Performance	6	1448	September 8, 2010
cudaThreadSynchronize() and multiple kernels when is it necessary to sync? CUDA Programming and Performance	2	8335	June 20, 2008
Question regarding cudaThreadSynchronize() Does it act like a barrier? CUDA Programming and Performance	1	1142	September 16, 2008
Using GPU and CPU at the same time CUDA Programming and Performance	5	6955	March 4, 2009
can i use multiple kernels for my program CUDA Programming and Performance	2	4252	January 8, 2009
When multiple CPU threads launch their own kernels, do they share the same CUDA context? CUDA Programming and Performance	3	920	October 12, 2021
problem about cudaThreadSynchronize() CUDA Programming and Performance	3	7839	November 25, 2007
No need to check cudaThreadSynchronize() in release mode? CUDA Programming and Performance	9	6338	April 21, 2009
cudaThreadSynchronize() after kernel call? CUDA Programming and Performance	5	11480	November 29, 2010
"cudaThreadSynchronize()" and "__syncthreads()" CUDA Programming and Performance	1	9642	March 22, 2008

Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron

Related topics