GPU cache hit fluctuation problem

Ahuan · September 26, 2021, 11:23am

Ncu mainly uses the pmu unit for statistics. The pmu unit is limited, so multiple replays are required to measure all the parameters. You can use operator serial, clock frequency lock, and clear the cache during multiple runs to avoid the impact of the replay process.But the results we measured with ncu still have various abnormal cache hit fluctuations, and there is no way to explain.I asked a person from nvidia on the forum before. He said that as long as the graphics card is not being used by others, the cache hit rate theoretically will not fluctuate, but we just can’t reproduce it.
How can I explain this fluctuation in cache hits?
And, what does the “steady state” in the official document actually mean?
https://docs.nvidia.com/nsight-compute/ProfilingGuide/index.html#range-and-precision

Topic		Replies	Views
GPU cache hit rate fluctuation problem Nsight Compute cuda	0	467	October 21, 2021
CUDA precision of desktop GPU CUDA Programming and Performance	9	2632	January 22, 2013
Nsight Compute: The frequency is not fixed Nsight Compute	4	1117	May 19, 2024
Confusion about NSight Compute profiler results Nsight Compute cuda , kernel , nvbugs	1	519	June 5, 2020
Computation crash = stuck at 574mhz CUDA Programming and Performance	9	1277	August 4, 2015
Unstable performance measured by cuda event CUDA Programming and Performance	3	450	December 6, 2022
Ncu problems Nsight Compute	6	911	December 3, 2022
Nvprof and Nsight returning different results for L1 and L2 cache hit rates Nsight Compute	4	645	August 13, 2019
Question about GPU L2 cache memory access。 Nsight Compute cuda , kernel	5	1036	February 21, 2024
Computation crash = stuck at 574mhz CUDA Programming and Performance	0	480	August 2, 2015

GPU cache hit fluctuation problem

Related topics