Tracing cuDNN library version 90.6 is currently not supported

nkem · January 26, 2025, 6:16pm

First time using Nsight Systems to learn how to profile my code, and I noticed a warning about cuDNN 9 in the Diagnostics Summary - does Nsight Systems not support cuDNN 9?

I should point out that I can see _5x_cudnn_ampere_scudnn_128x64_relu_xregs_large_nn_v1 showing up as kernel in the Timeline and Events View. Does that mean everything is working correctly and I can ignore the warning?

System information:

Ubuntu 24.04 (Docker)
Python 3.11.11
Pytorch 2.5.1 (built from source):
- CUDA 12.6
- cuDNN 9.6.0.74-1
NVIDIA Nsight Systems 2025.1.1.103-251135427971v0

Full Diagnostics Summary:

Source	Process ID	Time	Description
Daemon		-00:00.001	Unable to configure the collection of CPU IP/backtrace samples, context switch data, or event sampling data. Try the ‘nsys status --environment’ command to learn more.
Analysis		00:00.000	Profiling has started.
Daemon	1447	00:00.002	Process was launched by the profiler, see /tmp/nvidia/nsight_systems/quadd_session_101435/streams/pid_1447_stdout.log and stderr.log for program output
Injection	1447	00:00.020	Common injection library initialized successfully.
Analysis		00:00.025	No MMAP events were received for process with pid 1447 before attempting to resolve symbol. This might cause symbols to remain unresolved for the process.
Injection	1447	00:00.028	OS runtime libraries injection initialized successfully.
Analysis		00:00.036	Scheduling information is absent. The thread activity is deduced based on OS runtime libraries traces. This is inaccurate and does not take into account asynchronous interrupts and exception faults.
Analysis	1447	00:00.036	No NVTX events collected. Does the process use NVTX?
Analysis	1447	00:00.036	Number of CUDA events collected: 2.084.
Analysis	1447	00:00.036	Number of OS runtime libraries events collected: 4.095.
Analysis	1447	00:00.036	cuDNN profiling might have not been started correctly.
Analysis	1447	00:00.036	No cuDNN events collected. Does the process use cuDNN?
Injection	1447	00:00.606	Tracing cuDNN library version 90.6 is currently not supported.Loading ‘/opt/nvidia/nsight-systems-cli/2025.1.1/target-linux-x64/libToolsInjectionCuDNN64_90.so’ failed: dlopen hook: ‘/opt/nvidia/nsight-systems-cli/2025.1.1/target-linux-x64/libToolsInjectionCuDNN64_90.so’: cannot open shared object file: No such file or directory.
Injection	1447	00:00.607	cuDNN symbols found in /lib/x86_64-linux-gnu/libcudnn_graph.so.9.6.0 symbol table. No cuDNN trace will be generated from that library. Was cuDNN statically linked?
Injection	1447	00:01.746	Buffers holding CUDA trace data will be flushed on CudaProfilerStop() call. See --flush-on-cudaprofilerstop to control this behavior.
Injection	1447	00:01.761	Loaded CUPTI library: /opt/nvidia/nsight-systems-cli/2025.1.1/target-linux-x64/libcupti.so.12.8
Injection	1447	00:02.071	Enabling trace for device graph launch
Injection	1447	00:02.080	CUDA injection initialized successfully.
Injection	1447	00:02.298	cuDNN symbols found in /lib/x86_64-linux-gnu/libcudnn_graph.so.9.6.0 symbol table. No cuDNN trace will be generated from that library. Was cuDNN statically linked?
Injection	1447	00:02.299	cuDNN symbols found in /lib/x86_64-linux-gnu/libcudnn_graph.so.9.6.0 symbol table. No cuDNN trace will be generated from that library. Was cuDNN statically linked?
Injection	1447	00:02.304	cuDNN symbols found in /lib/x86_64-linux-gnu/libcudnn_graph.so.9.6.0 symbol table. No cuDNN trace will be generated from that library. Was cuDNN statically linked?
Injection	1447	00:03.154	Number of CUPTI events produced: 2.161, CUPTI buffers: 50.
Analysis		00:03.498	Profiling has stopped.

nsys command:
nsys profile -t cuda,nvtx,osrt,cudnn,cublas -x true -o cudnn_test --cuda-event-trace=false python cudnn.py

cudnn.py

import torch

input_tensor = torch.randn(64, 3, 224, 224).cuda()

for _ in range(100):
_ = torch.nn.functional.conv2d(input_tensor, torch.randn(64, 3, 3, 3).cuda())

(Also I tried to tag this topic with cudnn, but it wouldn’t let me)

hwilper · January 27, 2025, 5:13pm

@liuyis can you give more info about the current status of cuDNN 9?

liuyis · January 28, 2025, 3:56am

I checked our support - Nsys indeed does not support CuDNN 9+ trace yet. The latest version we support is CuDNN 8

The reason you see _5x_cudnn_ampere_scudnn_128x64_relu_xregs_large_nn_v1 in the report is because of the CUDA trace feature (--trace=cuda). CuDNN trace, if supported, will provide additional CuDNN API trace.

Topic		Replies	Views
Nsight systems cuDNN support Profiling Linux Targets	3	970	August 28, 2019
No cuDNN info in nsys traces Profiling Linux Targets nsight , pytorch	2	1534	May 16, 2022
Nsight --trace can not trace cublas and cudnn ... in windows10 Profiling x86 Windows Targets	1	1083	January 22, 2022
Broken Backtraces Profiling Linux Targets cudnn	2	361	April 11, 2025
Profling a simple deep learning code : no python backtrace + cannot use cudnn trace Profiling x86 Windows Targets cudnn	19	1417	December 13, 2023
How to profile an application with Cuda 12.1 driver? Profiling Linux Targets	19	2951	July 18, 2023
Profiling DCGan Tutorial Spins forever Nsight Compute	13	1374	June 7, 2020
Nsys does not show CUDA kernels Profiling Linux Targets	6	1480	December 12, 2022
Nsight nsys not collecting any CUDA kernel data (2023.1.2.43-32377213v0) Profiling Linux Targets	19	3054	September 14, 2023
Generating CUPTI_* tables with nsys Profiling Linux Targets cuda	25	2049	January 12, 2023

Tracing cuDNN library version 90.6 is currently not supported

Related topics