Thanks, but I’m wondering how this happens. This means that runtime system checks control divergence at every branch instruction, and it means that control diverge check routine in in a critical path of GPU H/W. :angel:
Yep, it checks for divergence at every branch instruction. Wild, when you think about it! But remember the hardware and scheduling software has been DESIGNED to do this so it’s extremely efficient.
Nonetheless, you should still avoid divergence if you can, but the nice part is how it Just Works when you do use it.