Although I collected all stall related metrics, here, it seems that the sum of average (or maximum) values is far less than 100%.
Metric Name Metric Unit Minimum Maximum Average
--------------------------------------------------------------- ----------- --------- --------- ---------
smsp__warp_issue_stalled_barrier_per_warp_active.pct % 4.186474 7.639925 6.399756
smsp__warp_issue_stalled_dispatch_stall_per_warp_active.pct % 2.238771 2.710122 2.538968
smsp__warp_issue_stalled_drain_per_warp_active.pct % 0.002744 0.005988 0.003129
smsp__warp_issue_stalled_imc_miss_per_warp_active.pct % 0.089172 0.197238 0.104320
smsp__warp_issue_stalled_lg_throttle_per_warp_active.pct % 0.000000 0.000004 0.000000
smsp__warp_issue_stalled_long_scoreboard_per_warp_active.pct % 0.804119 1.863121 0.945215
smsp__warp_issue_stalled_math_pipe_throttle_per_warp_active.pct % 1.998483 2.132997 2.027976
smsp__warp_issue_stalled_membar_per_warp_active.pct % 0.000000 0.000000 0.000000
smsp__warp_issue_stalled_mio_throttle_per_warp_active.pct % 0.282937 0.387940 0.299651
smsp__warp_issue_stalled_misc_per_warp_active.pct % 0.000061 0.000136 0.000071
smsp__warp_issue_stalled_no_instruction_per_warp_active.pct % 6.311340 9.950329 7.975228
smsp__warp_issue_stalled_not_selected_per_warp_active.pct % 28.167789 34.228746 31.538357
smsp__warp_issue_stalled_short_scoreboard_per_warp_active.pct % 5.872918 6.517629 6.016746
smsp__warp_issue_stalled_sleeping_per_warp_active.pct % 0.000000 0.000000 0.000000
smsp__warp_issue_stalled_tex_throttle_per_warp_active.pct % 0.000000 0.000000 0.000000
smsp__warp_issue_stalled_wait_per_warp_active.pct % 19.641737 20.246015 19.795740
Is something missing here?