Looking at the conversation above, I saw use of a “globaltimer” that can be used for timing. Some questions follow:
how can I access this in cuda in C/C++
is this clock synchronized across all SMs. As I understand it clock() and clock64() don’t return a clock value that is from a “synchronized” clock value across the SMs.
at what rate does the timer tick? On x86 it’s ~20MHz and it increments by a clock rate that doesn’t vary with pstate. This is nice because you can use the TSC (time stamp clock) to determine time expired.
can I modify this clock rate in CUDA with some calls to apis, or some other functionality.
The post above says NSIGHT ramps this timer from 1MHz to 30+MHz… which is nice. Is there a way of doing this without NSIGHT.
Many thanks for any helpful feedback.
Many thanks Robert. I can not at least monitor the freq of operation to some degree of accuracy. A shame you can’t increase the update freq on the globaltimer from 1MHz to 30+.