Synchronize Blocks Within CUDA kernel Your ipinion

system · July 22, 2011, 10:24pm

Will NVIDIA be implementing a new method to synchronize all the blocks executing a kernel? Is it even possible with current hardware?

tmurray · July 22, 2011, 11:06pm

No and no.

LSChien · July 22, 2011, 11:35pm

The blocks belong to following three states at any instant,

done, or
execute on some SM, or
wait because of no resources.

You cannot synchronize for ALL blocks. More precisely, you cannot synchronize for blocks of state 3.

But you can synchronize blocks of state 1 and 2.

jorgec · July 25, 2011, 8:40pm

I have an algorithm that requires read and write to global memory in an iterative process and need to sync blocks between each iteration. What would be the best way to do this? might __threadfence()?

Currently, I call the kernel from the host in each iteration:

for(…)
Cuda-kernel;

thanks in advance

tmurray · July 25, 2011, 8:57pm

Your method is the best method. You don’t want to use __threadfence().

elect · June 30, 2012, 9:42pm

Isnt there any overhead in this way or?

Topic		Replies	Views
cuda block synchronization CUDA Programming and Performance	1	984	June 19, 2011
Synchronize all blocks in CUDA CUDA Programming and Performance	12	45671	October 25, 2013
question about __syncthreads(); CUDA Programming and Performance	9	8620	March 17, 2008
cuda block synchronization CUDA Programming and Performance	4	8401	June 20, 2011
sync over blocks age old question CUDA Programming and Performance	2	2877	September 9, 2008
synchronization between blocks CUDA Programming and Performance	2	747	December 5, 2014
synchronisation between blocks CUDA Programming and Performance	2	1478	June 11, 2009
device synchronization inside cuda kernels CUDA Programming and Performance	2	3404	October 1, 2016
Thread sync CUDA Programming and Performance	2	794	May 9, 2011
More than 512 threads sync. CUDA Programming and Performance	4	2842	June 3, 2009

Synchronize Blocks Within CUDA kernel Your ipinion

Related topics