Question about divergence and branch granularity

albertf · April 25, 2012, 12:31am

Can someone help me understand how the following example used everywhere to go from divergent code to non-divergent code? If there were 16 threads in a block, In the first if statement, only threads 0 and 1 will execute the body of the if statement. However, in the second if statement, it looks like all the threads will execute the body of the if statement. How can this work if I only wanted selective threads in a block to copy something from the device memory to the shared memory? Also, is there an implicit barrier at the end of the if statement?

if (threadIdx.x < 2) { 

}

is the same as

if (threadIdx.x/WARP_SIZE < 2) { 

   // do something

}

pasoleatis · April 25, 2012, 5:39am

The threads are executed in groups of 32. In the first case 2 threads will execute the if while the other 30 will not. This means that the warp is executed 2 times, onnce for th ebranch with 2 threads doing the if and once for the other not doing. After this the warp converges back at least in the warp everything is executed in the same time.

In the second case the first 2 warps execute the if. In this case there is no branching.

Topic		Replies	Views
Question about divergence and branch granularity CUDA Programming and Performance	1	3067	May 25, 2013
Avoid branching ... CUDA Programming and Performance	3	3602	May 19, 2010
Is there efficient way to deal with if/else in the kernel CUDA Programming and Performance	4	13886	June 14, 2009
Block Divergence CUDA Programming and Performance	5	1094	December 8, 2009
Thread Divergence CUDA Programming and Performance	2	2730	September 27, 2008
Branching? CUDA Programming and Performance	7	3157	March 16, 2012
About divergent warps CUDA Programming and Performance	3	1589	September 22, 2009
Thread divergence due to IF CUDA Programming and Performance	3	6853	September 13, 2007
Question about divergent branching CUDA Programming and Performance	3	6430	May 21, 2009
Shift direction and divergence CUDA Programming and Performance	7	381	November 13, 2020

Question about divergence and branch granularity

Related topics