Modeling CUDA performance on unideal events


I would like to know if there is any documentation regarding/is it feasible to/ find out how thread divergent affects the overall performances ?

For example can we predict how many % the maximum throughput can be achieved given X number of thread divergences (collected by compute profiling ?)
Same thing for texture cache miss, L2 miss would be useful too.

Thanks in advance