nvprof: Question about the sm_efficiency metric

geoffxy · April 8, 2019, 12:38am

I had a few questions about the sm_efficiency metric. My understanding from the profiler documentation is that the sm_efficiency metric reports the percentage of time where there is at least one active warp on an SM and that “active warps” include warps that are stalled. Is this interpretation correct?

Is the sm_efficiency a percentage of the kernel’s total runtime?
Since sm_efficiency is not 100% in every case, what causes the period of time where no SMs have any active warps? In other words, what is happening on the GPU when the kernel is still “running”, but no SMs have any active warps?

SagarAgrawal · April 8, 2019, 5:58am

Formual for sm_efficiency is (active_cycles / elapsed_cycles_sm) * 100. This both events can be profiled using “-e” option in nvprof.

sm_efficiency basically tells for how much percentage of elapsed cycles on SM (elapsed_cycles_sm) there was any work happening on SM(active_cycles).
There can be multiple reason for low sm_efficiency one of the reason is user might not have launched kernel with correct configuration to fully occupy the SM. For example if GPU has 10 Sms but launch configuration is such that only warps are launched on 1 SM. Then 9 SM will be idle, in that case you will get low sm_efficiency value.

Topic		Replies	Views
Questions about the sm_efficiency metric CUDA Programming and Performance	1	875	April 7, 2019
Error in sm_efficiency metric definition in User's Guide? CUDA Programming and Performance	0	551	March 29, 2017
Difference between eligible_warps_per_cycle, sm_efficiency, and achieved_occupancy of nvprof metrics? CUDA Programming and Performance	0	769	May 6, 2018
Sm and we efficiency Visual Profiler and nvprof	6	1669	August 26, 2021
Number of active SMs CUDA Programming and Performance	8	2833	September 7, 2016
computeprof "active cycles" counter "active cycles" value doesn't make sense to CUDA Programming and Performance	7	2620	May 15, 2012
Metrics Reference - Profiler CUDA Visual Profiler and nvprof	0	2208	May 8, 2015
Visual Profiler says my occupancy is 221% CUDA Programming and Performance	4	1811	April 14, 2013
nvprof active_cycles vs elapsed_cycles_sm CUDA Programming and Performance	3	2627	August 27, 2016
I want to know means about CUPTI metrics in details. CUPTI – CUDA Profiler Tools Interface	2	1326	October 12, 2021

nvprof: Question about the sm_efficiency metric

Related topics