Difference between eligible_warps_per_cycle, sm_efficiency, and achieved_occupancy of nvprof metrics?

yuzou · May 6, 2018, 6:40pm

I’m using nvprof to profile my CUDA code. The metric that I care about is how many clock cycles that the SM is totally idle. So, I measured three metrics using nvprof, eligible_warps_per_cycle, sm_efficiency, and achieved_occupancy. But the measured results are confusing.

Kernel: scatterDevice(float*, edgeBlk_t*, msgBlk_t*)
1 eligible_warps_per_cycle 0.032613
1 sm_efficiency_instance 100.00%
1 sm_efficiency 100.00%
1 achieved_occupancy 0.485068

If I understand correctly, eligible_warps_per_cycle means average number of warps that are eligible to issue per active cycle, and sm_efficiency means the percentage of time at least one warp is active. So, I was wondering why sm_efficiency is 100% but eligible_warps_per_cycle is only 0.03?

Thank you in advance.

Topic		Replies	Views
nvprof: Question about the sm_efficiency metric Visual Profiler and nvprof	1	2654	April 8, 2019
Error in sm_efficiency metric definition in User's Guide? CUDA Programming and Performance	0	538	March 29, 2017
I want to know means about CUPTI metrics in details. CUPTI – CUDA Profiler Tools Interface	2	1307	October 12, 2021
Visual Profiler says my occupancy is 221% CUDA Programming and Performance	4	1782	April 14, 2013
question about calculating occupancy CUDA Programming and Performance	2	6536	April 7, 2010
nvvp: count cycles where no warp is runnable not possible currently, but would be really helpful CUDA Programming and Performance	2	1076	June 4, 2013
Questions about the sm_efficiency metric CUDA Programming and Performance	1	859	April 7, 2019
nvprof active_cycles vs elapsed_cycles_sm CUDA Programming and Performance	3	2590	August 27, 2016
computeprof "active cycles" counter "active cycles" value doesn't make sense to CUDA Programming and Performance	7	2584	May 15, 2012
Sm and we efficiency Visual Profiler and nvprof	6	1630	August 26, 2021

Difference between eligible_warps_per_cycle, sm_efficiency, and achieved_occupancy of nvprof metrics?

Related topics