Avoid branching ...

Cosmo2002 · May 19, 2010, 3:44pm

Hi,

I’m a CUDA beginner and I am currently reading the Programming Guide. In the section “Control Flow Instructions” (5.4.2) I found the following paragraph:

I don’t understand, why the code does not branch when using the sample condition. How are the threads scheduled? Does the scheduler select only these threads to execute in a warp, where (threadIdx / warpSize) is equal?

Regards,

Cosmo

cbuchner1 · May 19, 2010, 4:05pm

that is pretty much correct. Divergence between different warps does generally not affect performance. Branch Divergence within a warp can be a total performance killer.

Cosmo2002 · May 19, 2010, 7:14pm

So I can’t control which threads will be selected for execution in a warp? Is that correct?

I tended to think that the threads are bundled to warps according to their ids, meaning thread 0-31 is executed in the first warp, 32-63 in the second warp and so on.

avidday · May 19, 2010, 7:28pm

That is how it works. Only when there is intra-warp level divergence will you start seeing penalties because of branching.

Topic		Replies	Views
Is there efficient way to deal with if/else in the kernel CUDA Programming and Performance	4	14023	June 14, 2009
Shift direction and divergence CUDA Programming and Performance	7	383	November 13, 2020
Questions about control structure CUDA Programming and Performance	1	896	June 17, 2010
Thread divergence when block size is equal to warp size CUDA Programming and Performance	2	599	June 5, 2019
performance gain by "killing" warps can there be any? CUDA Programming and Performance	5	2268	February 12, 2009
Evaluation of complex conditions Do threads diverge ? CUDA Programming and Performance	1	2734	August 24, 2008
Loops in kernels CUDA Programming and Performance	2	1325	September 3, 2009
Question about divergence and branch granularity CUDA Programming and Performance	1	884	April 25, 2012
reduction optimization #1 Not agree with performances explanation CUDA Programming and Performance	8	6666	August 1, 2008
Question about control flow divergence CUDA Programming and Performance	4	7315	July 24, 2008

Avoid branching ...

Related topics