I am using
nsys profile --gpu-metrics-device all ./my_executable to trace the metrics of GPU, but I didn’t get anything about tensor core metrics as written in the nsight-systems UserGuide
How can I get SM instructions/Tensor Active row in the nsight GUI?
The architecture of the GPU is Hopper.
@pkovalenko can you help out here?
Please let me know the following:
- NSys version
- GPU driver version
- Output of
nsys profile --gpu-metrics-device help
Is there anything mentioning GPU Metrics in Diagnostic Summary page?
Hi, the environment is as follows:
The messages in diagnostic summary page are as follows:
2024.1.1 is our latest release. Are you sure you’re running that? The diagnostic summary page looks quite old. The GUI looks new, but nsys CLI that collected your report is definitely not 2024.1.1.
In any case, the error message explains the issue. You have to either run nsys as superuser, or allow access to perf counters to all users. Please refer to https://developer.nvidia.com/ERR_NVGPUCTRPERM for detailed guidance.
Sorry, the GUI version of nsys is
2024.1.1, while the nsys CLI version is
I ran nsys as superuser and collected the the metrics about tensor core utilization. Thank you for your answer.
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.