I’m newbie in CUDA.
I start Nsight Compute as root using
My code has 2 kernels.
The first one profiles correctly for all (!) sections, but the second one doesn’t profile for the following sections:
- Instructions Statistics
- Source Counters
With any of the above sections selected, I get:
The profiler returned an error code: 1 (0x1)
The first errors in the report are:
[Error] Rule Bottleneck returned an error: Metric launch__waves_per_multiprocessor not found [Error] <built-in function IAction_metric_by_name> returned a result with an error set /opt/nvidia/nsight-compute/2020.2.1/target/linux-desktop-glibc_2_11_3-x64/../../sections/SpeedOfLight.py:45 /opt/nvidia/nsight-compute/2020.2.1/target/linux-desktop-glibc_2_11_3-x64/../../sections/NvRules.py:365 [Error] Rule Roofline Analysis returned an error: Metric sm__sass_thread_inst_executed_op_ffma_pred_on.sum.peak_sustained not found
Is there something with my code prevents these section to profile?