Measure warp context switching time

LongY · July 8, 2014, 1:40pm

As we know, when kernel has a global memory access or read-after-write dependency latency,another warp got switched to execute to hide the latency to maximize the throughput.

I was wondering if anyone knows how to measure the time of warp switching.

Any suggestions or comments are welcome.

Robert_Crovella · July 8, 2014, 3:32pm

It seems like you’ve already asked this, and Greg Smith gave you some pretty useful comments:

[url]cuda - Measure the overhead of context switching in GPU - Stack Overflow