RTX 3080 LHR Missing gpu__dram_throughput CUDA metric

patel · January 14, 2022, 4:25pm

As part of a machine learning project, we are optimizing some custom CUDA kernels.

We are trying to profile them using Nsight Compute, but encounter the following error running on the LHR RTX 3080 when running a simple wrapper around the CUDA Kernel:


> ==ERROR== Failed to access the following 4 metrics: dram__cycles_active.avg.pct_of_peak_sustained_elapsed, dram__cycles_elapsed.avg.per_second, gpu__compute_memory_throughput.avg.pct_of_peak_sustained_elapsed, gpu__dram_throughput.avg.pct_of_peak_sustained_elapsed

==ERROR== Failed to profile kernel "kernel" in process 20204

Running a diff against the metrics available on an RTX 3080 TI (non-LHR) vs an RTX-3080 (LHR) via nv-nsight-cu-cli --devices 0 --query-metrics , We notice the following metrics are missing in the RTX 3080 LHR version:

gpu__compute_memory_request_throughput
gpu__compute_memory_throughput
gpu__dram_throughput

All of these are required for even basic memory profiling using Nsight Compute. All other metrics are correctly present, except for these. Is this a limitation of LHR cards? Why would they not be present?

Details:

Cuda Version: 11.5
Driver version: 497.29.
Windows 10

patel · January 19, 2022, 4:21pm

+Bumping up!

jmarusarz · January 19, 2022, 4:50pm

Thanks for submitting this issue. We are actively investigating what’s going on. I will update you as soon as I have more information.

jmarusarz · January 20, 2022, 3:36pm

We recently released CUDA 11.6. Are you able to install that newest version to see if the issue still reproduces? We are still trying to reproduce the issue in out lab. Thanks.

patel · January 20, 2022, 7:36pm

The upgrade to 11.6 Was able to resolve the issue. This is working correctly.

Successful conditions:
Gigabyte RTX 3080 10G Turbo (LHR)
CUDA 11.6
Driver version 511.23
Windows 10

Topic		Replies	Views
[Nsights system] GPU metric not supported on RTX 3090 Ti Profiling Linux Targets cuda , nsight	1	726	January 10, 2024
Using Nsight Compute to Inspect your Kernels Technical Blog	3	1805	January 8, 2026
How to get dram throughput in Nsight system？ Profiling Linux Targets	10	291	December 8, 2024
RTX 2080 Ti Not supported Nsight Compute	5	1006	November 3, 2020
Cannot profile RTX 2060 KO (TU104) with CUDA 11.0 on windows and ubuntu Visual Profiler and nvprof nvbugs	8	2873	July 27, 2020
NsightCompute doesn't profile some metrics on SM_75 Nsight Compute	2	807	November 8, 2019
RTX6000 profiling issue Nsight Compute	7	1237	October 12, 2021
what is the mean of `gpu__compute_memory_access_throughput` Nsight Compute	4	1081	August 22, 2019
NVIDIA® Nsight™ Compute 2022.3 is now available Nsight Compute	5	709	December 9, 2022
NSIGHT: Not recording cudaMem API calls Nsight Visual Studio Edition	6	764	October 12, 2021

RTX 3080 LHR Missing gpu__dram_throughput CUDA metric

Related topics