synchronisation between blocks

hydos · June 11, 2009, 6:09am

Hi -

I just started to program in CUDA and was wondering if there was a way to synchronize between thread blocks. From my understanding and stepping through emulation mode, __syncthreads() only synchronizes threads within a block.

I am basically trying to implement a function that iterates over a matrix. Each block handles a submatrix. The catch is each block has to finish the current computation before any of the other blocks can go on to the next iteration. Something like a join():
Code:

for( int i=0;i<N;i++ )
{
// 1. each block does it’s computation
// 2. wait for each block to finish, ie: join()
// (tried a __syncthreads() here but only syncs threads inside the block)
}

I’m pretty sure the computation part is correct since if I use just a single block, everything comes out as it should. When I use multiple blocks things break.

One way I thought of doing it was to put the iteration loop on the host and the kernal takes the iteration number as an argument. Somethine like:

Code:

for( int i=0;i<N;i++ )
{
myFunc<<< threads,dim >>>( i );
}

But is this probably slower with all the host to device/function overhead?

Any help and pointers would be greatly appreciated.

cvnguyen · June 11, 2009, 6:17am

You may use a global counter to count how many blocks have finished a certain task.

Cygnus_X1 · June 11, 2009, 7:23am

What would be the benefit of using several blocks if one can only run after another?
If that work has to be serialized and you cannot do anything about that, just use a big for-loop inside one block.

Topic		Replies	Views
cuda block synchronization CUDA Programming and Performance	1	982	June 19, 2011
sync over blocks age old question CUDA Programming and Performance	2	2877	September 9, 2008
cuda block synchronization CUDA Programming and Performance	4	8393	June 20, 2011
Synchronizing threads CUDA Programming and Performance	1	5923	March 21, 2007
Thread sync CUDA Programming and Performance	2	788	May 9, 2011
Synchronization in CUDA CUDA Programming and Performance	0	366	August 27, 2020
question about __syncthreads(); CUDA Programming and Performance	9	8617	March 17, 2008
Need synchronization between blocks? CUDA Programming and Performance	3	3087	September 16, 2009
synchronization between blocks CUDA Programming and Performance	2	747	December 5, 2014
Synchronize threads. CUDA Programming and Performance	1	698	March 6, 2013

synchronisation between blocks

Related topics