What does Achieved Active Warps Per SM in Nsight means and how to calculate it?

Chad.Ding · June 17, 2020, 2:23pm

Section: Occupancy

Block Limit SM block 32
Block Limit Registers block 4
Block Limit Shared Mem block inf
Block Limit Warps block 2
Achieved Active Warps Per SM warp 48.50
Achieved Occupancy % 75.78
Theoretical Active Warps per SM warp/cycle 64
Theoretical Occupancy % 100

There is a parameters which is Achieved Active Warps Per SM, And I want to know what does it mean? And which parameters in kernel will affect this one, like block size, grid size and so on. The last one is that can I calculate it without running the kernel, just use the information of code and launch config.

Chad.Ding · June 22, 2020, 3:18am

can anyone help me?

felix_dt · June 22, 2020, 9:36am

You cannot statically compute the Achieved Active Warps or Achieved Occupancy without running the kernel. The Theoretical Active Warps/Occupancy metrics are available using only the kernel launch parameters, GPU and CUDA cache configuration settings and can be computed statically using the CUDA Occupancy Calculator :: CUDA Toolkit Documentation.

The achieved metrics depend on the actual workload (i.e. your code). It shows the cumulative number of warps in flight on average over the runtime of the kernel, as suggested by the underlying metric name (sm__warps_active.avg.per_cycle_active).

Also, note this part of the description of this section by Nsight Compute: Large discrepancies between the theoretical and the achieved occupancy during execution typically indicates highly imbalanced workloads.

Topic		Replies	Views
Achieved occupancy reported at nsight compute Nsight Compute	2	987	July 23, 2021
Achieved Occupancy vs Theoretical CUDA Programming and Performance	6	5298	September 20, 2011
How to profile overall SM utilization of the program by Nsight Compute? Nsight Compute	9	2200	July 27, 2023
Question about NVIDIA Visual Profiler's occupancy results CUDA Programming and Performance	2	977	May 29, 2019
What exactly does SM Active Cycles mean? Nsight Compute	3	957	July 30, 2024
Calculating number of active SMs Nsight Compute	2	427	April 17, 2023
I want to know means about CUPTI metrics in details. CUPTI – CUDA Profiler Tools Interface	2	1283	October 12, 2021
Increasing number of active warps per scheduler CUDA Programming and Performance	4	2364	January 7, 2022
Metric references and description Nsight Compute	7	4477	March 2, 2024
Metrics meaning in Nsight compute Nsight Compute	2	574	June 28, 2024

What does Achieved Active Warps Per SM in Nsight means and how to calculate it?

Related topics