About if-else between warps

Shaquille · July 26, 2023, 6:27am

hello, NV experts
the performance will be very poor, if there is if-else in warp, like this:

val= global[lane_idx];
if(val >= 16){
    ......
    ......
}else{
    ......
    ......
}

now, the if-else is not occured in warps, it appeared between warps, like this:

int warp_idx = threadIdx.x / 32;
if(warp_idx > 1){
    ......
    function for warp 0&1
    ......
}else{
    ......
    function for warp 2&3
    ......
}

I am not sure the effection of above code.
I found there is not any poor effection in my application, I found only the code’s size become bigger.
I’m not sure how it behave on other CUDA-ARCH(my arch is 8.6, ampere)。
So, how to evaluate above code?

njuffa · July 26, 2023, 7:18am

A data-drivenif-then-else does not necessarily cause a performance problem. The compiler may apply if-conversion, or if the branch is retained, cases of actual branch divergence may be rare.

My usual recommendation is to write CUDA code in a natural fashion, and start worrying about branch divergence only when the CUDA profiler indicates it is a non-trivial detractor from application-level performance.

Shaquille · July 26, 2023, 7:30am

thank you

Topic		Replies	Views
Question about divergent branching CUDA Programming and Performance	3	6439	May 21, 2009
Is there efficient way to deal with if/else in the kernel CUDA Programming and Performance	4	14071	June 14, 2009
Thread divergence due to IF CUDA Programming and Performance	3	6864	September 13, 2007
If loops in kernel a problem? CUDA Programming and Performance	3	1747	February 26, 2009
Must all threads execute the same code? "Branch divergence occurs only within a warp" CUDA Programming and Performance	5	2968	December 28, 2008
Thread divergence when block size is equal to warp size CUDA Programming and Performance	2	604	June 5, 2019
Avoid branching ... CUDA Programming and Performance	3	3615	May 19, 2010
Question about divergence and branch granularity CUDA Programming and Performance	1	885	April 25, 2012
Thread question CUDA Programming and Performance	5	1882	December 2, 2008
Will this loop cause warp divergence? CUDA Programming and Performance kernel	0	250	December 10, 2023

About if-else between warps

Related topics