System: Ubuntu 16.04
Driver: 418.56 (installed through apt-get PPA)
CUDA toolkit: 10.1.105 with NSIGHT COMPUTE 2019.3
Benchmark: VectorAdd in CUDA samples
When I ran
nv-nsight-cu-cli --query-metrics, I was able to see metrics in the form of
smsp__sass_thread_inst_executed_op_*. However, I tried capturing those metrics
nv-nsight-cu-cli --metrics <smsp...> ./vectorAdd, the profiler gives “(!)n/a”. When I tried running without --metrics or with predefined section files in the Nsight package, I was able to see other performance counters with numerical results printed.
Are those metrics
smsp__sass_thread_inst_executed_op* actually available in Nsight Compute?