Hello,
I would like to know if there is any documentation regarding/is it feasible to/ find out how thread divergent affects the overall performances ?
For example can we predict how many % the maximum throughput can be achieved given X number of thread divergences (collected by compute profiling ?)
Same thing for texture cache miss, L2 miss would be useful too.
Thanks in advance