Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED)

779660843 · May 31, 2022, 2:43am

Description

I want to try the TensorRT in C++ implementation of ByteTrack in Windows. However, it only supports a method in Linux. So I Convert Its Model to ONNX and then convert the onnx file to tensorrt (TRT) by using trtexec command.

trtexec.exe --onnx=bytetrack.onnx --saveEngine=bytetrack.engine --workspace=16384 --buildOnly --explicitBatch --verbose

But when I try to do inference on the project which is built in VS2019 by cmakelist.txt as the tutorial described, I encounter an error as shown in the following figure. So, how can fix it?

Environment

TensorRT Version: 8.2.4.2
GPU Type: RTX
Nvidia Driver Version: 511.65
CUDA Version: 11.6
CUDNN Version: 8.4.0.27
Operating System + Version: windows 10
Visual Studio: 2019

NVES · May 31, 2022, 3:07am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

779660843 · May 31, 2022, 3:38am

Thanks for answering!

my output is like the below.

1. ONNX model

: too large to load up, I just only show the ByteTrack-ONNXRuntime python script.

from loguru import logger

import torch
from torch import nn

from yolox.exp import get_exp
from yolox.models.network_blocks import SiLU
from yolox.utils import replace_module

import argparse
import os


def make_parser():
    parser = argparse.ArgumentParser("YOLOX onnx deploy")
    parser.add_argument(
        "--output-name", type=str, default="bytetrack_s.onnx", help="output name of models"
    )
    parser.add_argument(
        "--input", default="images", type=str, help="input node name of onnx model"
    )
    parser.add_argument(
        "--output", default="output", type=str, help="output node name of onnx model"
    )
    parser.add_argument(
        "-o", "--opset", default=11, type=int, help="onnx opset version"
    )
    parser.add_argument("--no-onnxsim", action="store_true", help="use onnxsim or not")
    parser.add_argument(
        "-f",
        "--exp_file",
        default=None,
        type=str,
        help="expriment description file",
    )
    parser.add_argument("-expn", "--experiment-name", type=str, default=None)
    parser.add_argument("-n", "--name", type=str, default=None, help="model name")
    parser.add_argument("-c", "--ckpt", default=None, type=str, help="ckpt path")
    parser.add_argument(
        "opts",
        help="Modify config options using the command-line",
        default=None,
        nargs=argparse.REMAINDER,
    )

    return parser


@logger.catch
def main():
    args = make_parser().parse_args()
    logger.info("args value: {}".format(args))
    exp = get_exp(args.exp_file, args.name)
    exp.merge(args.opts)

    if not args.experiment_name:
        args.experiment_name = exp.exp_name

    model = exp.get_model()
    if args.ckpt is None:
        file_name = os.path.join(exp.output_dir, args.experiment_name)
        ckpt_file = os.path.join(file_name, "best_ckpt.pth.tar")
    else:
        ckpt_file = args.ckpt

    # load the model state dict
    ckpt = torch.load(ckpt_file, map_location="cpu")

    model.eval()
    if "model" in ckpt:
        ckpt = ckpt["model"]
    model.load_state_dict(ckpt)
    model = replace_module(model, nn.SiLU, SiLU)
    model.head.decode_in_inference = False

    logger.info("loading checkpoint done.")
    dummy_input = torch.randn(1, 3, exp.test_size[0], exp.test_size[1])
    torch.onnx._export(
        model,
        dummy_input,
        args.output_name,
        input_names=[args.input],
        output_names=[args.output],
        opset_version=args.opset,
    )
    logger.info("generated onnx model named {}".format(args.output_name))

    if not args.no_onnxsim:
        import onnx

        from onnxsim import simplify

        # use onnxsimplify to reduce reduent model.
        onnx_model = onnx.load(args.output_name)
        model_simp, check = simplify(onnx_model)
        assert check, "Simplified ONNX model could not be validated"
        onnx.save(model_simp, args.output_name)
        logger.info("generated simplified onnx model named {}".format(args.output_name))


if __name__ == "__main__":
    main()

2. check_model.py

: my Onnx model runs well with onnx.checker.check_model(model).

3. TRT EXEC perf.

: this is my TRT EXEC performance with -verbose options
verbose.txt (2.3 MB)

spolisetty · May 31, 2022, 2:35pm

Hi,

This issue is more related to environment setup.
Please make sure CUDA and TensorRT set up correctly.
Similar issue for your reference,

github.com/NVIDIA/TensorRT

[TensorRT] ERROR: ../rtSafe/cuda/cudaConvolutionRunner.cpp (303) - Cudnn Error in execute: 8 (CUDNN_STATUS_EXECUTION_FAILED)

opened 05:10PM - 04 Jan 20 UTC

closed 05:39PM - 04 Jan 20 UTC

Hsintao

## Description ## Environment **TensorRT Version**: 6.0.1.5 **GPU T…ype**: 2080ti **Nvidia Driver Version**: 418.67 **CUDA Version**: 10.0 **CUDNN Version**: 7.6.x **Operating System + Version**: ubuntu16.04 **Python Version (if applicable)**: **TensorFlow Version (if applicable)**: **PyTorch Version (if applicable)**: **Baremetal or Container (if container which image + tag)**: ## Relevant Files ## Steps To Reproduce  sorry to disturb. im just learning to use the tensorrt. i build the tensorrt successfully and can run the sample program. and i also can convert my onnx model to trt engine with any error. but when i try to do inference with the .trt, i get the output beyond my expection. with the error `[TensorRT] ERROR: ../rtSafe/cuda/cudaConvolutionRunner.cpp (303) - Cudnn Error in execute: 8 (CUDNN_STATUS_EXECUTION_FAILED)`. i have try cudnn7.6.0~7.6.5 and i havent address the reason. here is part of my code `input_name = ['input'] output_name = ['output'] input = Variable(torch.randn(1, 3, 224, 224)).cuda() model = torchvision.models.resnet50(pretrained=True).cuda() torch.onnx.export(model, input, 'alexnet.onnx', input_names=input_name, output_names=output_name, verbose=True)` then `onnx2trt resnet50.onnx -o resnet50.onnx -b 1` can u give me some suggestion?

Thank you.

Topic		Replies	Views
Error occurred while running the Tensorrt samples: [reformat.cpp::executeCutensor::385] TensorRT tensorrt	3	1165	December 12, 2023
Convet onnx to trt engine got error TensorRT	3	1189	January 7, 2022
../rtSafe/cuda/cudaConvolutionRunner.cpp (483) - Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM) TensorRT	3	696	November 2, 2022
PyTorch FCN-ResNet50 --> ONNX --> TensorRT TensorRT	3	951	February 17, 2022
Build TRT engine with onnx QAT model throws segmentation fault TensorRT	3	1262	August 12, 2021
safeContext.cpp (184) - Cudnn Error in configure: 7 (CUDNN_STATUS_MAPPING_ERROR) TensorRT	10	3550	July 9, 2021
Conversion from onnx to TensorRT engine TensorRT tensorrt , cuda	1	476	July 24, 2023
Build TensorRT on Cuda compute capability 7.5 and make it backward compatible with previous capabilities TensorRT tensorrt	4	1734	May 19, 2022
About build errors for sampleOnnxMNIST TensorRT tensorrt , cuda	3	988	February 4, 2021
TensorRT 8 convert UNET ERROR TensorRT	5	1908	October 12, 2021