Profiling failure due to CUDNN_STATUS_INTERNAL_ERROR

iz314 · November 8, 2018, 9:59am

Hi,
I’m running the Nsight Compute profiler on a CNN with Pytorch, and it fails with the following message:
“RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR”
End of trace:
‘lib/python2.7/site-packages/torch/nn/modules/conv.py", line 313, in forward’

I tried both Tensorflow and Pytorch, on several machines. I’m using GTX-1080Ti, tried both CUDA 10.0 and 9.0, and I’m following all minimal requirements.

How can I fix this?

Thanks.

felix_dt · November 12, 2018, 9:38am

Could you please describe in more detail what exact steps or commands you tried to profile with Nsight Compute? Does your usage of Pytorch work fine when not profiling with Nsight Compute, or are you seeing issues there, too? Note also that Nsight Compute 1.0 does not support profiling child processes, so if your usage of CUDA or a CUDA-accelerated library is not directly within the process launched via Nsight Compute, you will not be able to profile it.

rbischof · March 4, 2019, 1:47am

Please note that Nsight Compute 2019.1 has been release in the CUDA Toolkit 10.1
and as a stand-alone download: [url]https://developer.nvidia.com/gameworksdownload#?dn=nsight-compute-2019-1[/url]
This version has an option to profile child processes. Let us know if this fixes your issue.

Also, as Felix mentioned, let us know if the problem goes away when you are not profiling.

Topic		Replies	Views
Nsight cuDNN error with CNN but not normal NN Profiling Linux Targets	1	540	September 26, 2023
Ncu return LaunchFailed Nsight Compute	1	613	August 25, 2023
Cannot profile CUDA kernel using NC : Run Bottleneck returned an error Nsight Compute	4	524	October 12, 2021
Python Tensorflow Windows 10 CUDA_ERROR_UNKNOWN error Nsight Compute	2	977	January 7, 2020
NSight Compute CUPTI_ERROR_MULTIPLE_SUBSCRIBERS_NOT_SUPPORTED Nsight Compute cudnn	5	1036	January 29, 2024
A100 nsight compute profiling error "cuDNN error: CUDNN_STATUS_INTERNAL_ERROR" Nsight Compute cuda	2	1519	September 20, 2021
Hitting "Profile in NVIDIA Nsight Compute" in Visual Profiler always returns error message "Nsight Compute failed to generate report" Nsight Compute	5	1048	January 17, 2022
Tracing cuDNN library version 90.6 is currently not supported Profiling Linux Targets ubuntu , pytorch , cudnn	3	50	January 28, 2025
Nsight Compute Error Jetson AGX Orin tensorrt , nsight , pytorch	12	1102	December 21, 2022
Cuda_error_invalid_context Nsight Compute	4	216	July 30, 2024

Profiling failure due to CUDNN_STATUS_INTERNAL_ERROR

Related topics