I am using Nsight Compute for performance measurement, and I have a question about how to read the item.
It uses the GPU of A100 and used Nsight Compute with the following settings.
If “Clock Control” is “None”, the frequency is fixed at 1410 Mhz. Why is this value not constant?
In the first place, I think “SM Frequency” refers to the frequency, but what does “SM Frequency” refer to?
Regarding the question, if I set “Clock Control” to “Base”, will this value be 765 Mhz?
“None” means there is no clock control, therefore the clock starts out at the boost frequency and during kernel execution, the frequency will vary, depending on workload, temperature etc.
“Base” means the clock is locked at the base frequency and will not vary, enabling repeatability between profile runs.
For measurement over “several thousand of times in a loop”, as outlined in this thread, instrumentation and average/median of the result is probably more realistic.