Hi. I found there is an instruction “bra.uni” in PTX besides “bra”, to mark a branching point is non-divergent. So is there some equivalent way to specify this in CUDA’s C/C++ programming language just like the “__builtin_expect()” in gcc for branching prediction? Thanks.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Bra.uni optimization Compiler optimization | 3 | 9136 | October 27, 2010 | |
Branch Predication in CUDA Any example in SDK | 1 | 1331 | December 25, 2009 | |
Branch or not | 7 | 2421 | February 28, 2018 | |
CUDA compiler needs too much help in order to use select instead of branch | 6 | 540 | October 12, 2021 | |
Is there any idiom to indicate that branches are uniform? | 0 | 479 | June 16, 2016 | |
Branching Performance Hit | 15 | 2665 | June 30, 2009 | |
about the divergent branching | 1 | 1201 | February 3, 2009 | |
branchless exchange based on condition ? | 1 | 992 | February 9, 2009 | |
branch predication | 0 | 3748 | November 12, 2009 | |
[Solved] PTX ISA predicated execution and the warp divergence issue | 6 | 2944 | January 14, 2014 |