Got Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM)

877437674 · February 18, 2022, 10:45am

Inferencing with a super simple model but got an error when trying to inference in C++

I have some PyTorch neural networks and I convert them to tensorRT .plan file, I successfully deploy the model to Triton Server, then I write a C++ program to run an inference with an HTTP request.
The problem is, I can successfully convert and run some models like YoloV5, but when I try to do the same thing to a very simple model, I got an error on the triton server side:
E0218 09:59:10.719600 1 logging.cc:43] …/rtSafe/cuda/cudaConvolutionRunner.cpp (483) - Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM)
E0218 09:59:10.724232 1 logging.cc:43] FAILED_EXECUTION: std:exception

I convert the model multiple times, then I found the error does not occur every time, it occurs most of the time, and the inference works sometimes.

The model visualization, model onnx file, and full log of Triton Server is attached below

Environment

TensorRT Version: 7.2.1.6
GPU Type: NVIDIA Corporation Device 1f82 (rev a1), TU117 [GeForce GTX 1650]
Nvidia Driver Version: 470.57.02
CUDA Version: 11.1 but nvidia-smi shows 11.4
CUDNN Version: 8.0.4
Operating System + Version: Ubuntu 18.04
Python Version (if applicable): 3.6
TensorFlow Version (if applicable): not used
PyTorch Version (if applicable): 1.8.0+cu111
** python library ONNX version **: 1.6.0
** python library onnxruntime-gpu version **: 1.4.0
Triton Server: nvcr.io/nvidia/tritonserver:20.10-py3

Relevant Files

visualization:
simplesimple

onnx file:
test.onnx (804 Bytes)

full triton server log:
full_log.txt (23.4 KB)

Steps To Reproduce

You can just define a model in PyTorch like the one above with the above environment, I did not train it, just use the initial weights, export it to onnx model.
You can also directly use the onnx model I attached here
Use the official trtexec file in TensorRT-7.2.1.6, or write python code to convert the onnx model to tensorRT plan.
Deploy the model in triton server 20.10
Write C++ code to send HTTP inference request to the triton server.

Can anyone solve my problem? Thank you.

spolisetty · February 21, 2022, 10:54am

Hi,

Looks like you’re using an old version of the TensorRT, We recommend you please install the latest version of the TensorRT 8.4 EA and try again. If you still face the issue please let us know.
https://developer.nvidia.com/nvidia-tensorrt-8x-download

Thank you.

Topic		Replies	Views
../rtSafe/cuda/cudaConvolutionRunner.cpp (483) - Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM) TensorRT	3	705	November 2, 2022
Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED) TensorRT cuda	3	2181	May 31, 2022
PyTorch FCN-ResNet50 --> ONNX --> TensorRT TensorRT	3	980	February 17, 2022
ONNX fails to load TensorRT cudnn , inference-server-triton	2	146	September 23, 2024
Error occurred while running the Tensorrt samples: [reformat.cpp::executeCutensor::385] TensorRT tensorrt	3	1194	December 12, 2023
safeContext.cpp (184) - Cudnn Error in configure: 7 (CUDNN_STATUS_MAPPING_ERROR) TensorRT	10	3564	July 9, 2021
:nvinfer1::rt::ExecutionContext::enqueueInternal::330, condition: bindings[x] != nullptr TensorRT tensorrt	1	1883	February 15, 2022
CUDNN_STATUS_BAD_PARAM when infer with dynamic shape TensorRT	4	1501	June 25, 2021
Error during inference TensorRT	7	971	October 12, 2021
Convet onnx to trt engine got error TensorRT	3	1198	January 7, 2022

Got Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM)

Inferencing with a super simple model but got an error when trying to inference in C++

Environment

Relevant Files

Steps To Reproduce

Related topics