Yolov5 Engine Inference error

y14uc339 · October 10, 2021, 2:51pm

Description

A clear and concise description of the bug or issue.

Environment

TensorRT Version: 8.0.3
GPU Type: dGPU
Nvidia Driver Version: 470
CUDA Version: 11.4
CUDNN Version: 8.2.1
Operating System + Version: Ubuntu 20.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvcr.io/nvidia/tensorrt:21.09-py3

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

Generated python tensorrt engine from BlueMirrors/Yolov5-TensorRT. Correct results with TensorRT python.
Desrialized python engine in C++. Runs perfectly fine. Successful deserialization.
When I run inference on the deserialized TensorRT engine it throws this error.

C++ Code to deserialize python TensorRT engine:
CMakeLists.txt (1003 Bytes)
inference.cpp (9.2 KB)
logging.h (16.4 KB)
utils.hpp (4.8 KB)

Error when running inference:


   [10/10/2021-10:52:04] [I] [TRT] [MemUsageChange] Init CUDA: CPU +328, GPU +0, now: CPU 363, GPU 204 (MiB)
[10/10/2021-10:52:04] [I] [TRT] Loaded engine size: 24 MB
[10/10/2021-10:52:04] [I] [TRT] [MemUsageSnapshot] deserializeCudaEngine begin: CPU 363 MiB, GPU 204 MiB
[10/10/2021-10:52:05] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +498, GPU +214, now: CPU 869, GPU 436 (MiB)
[10/10/2021-10:52:05] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +169, GPU +204, now: CPU 1038, GPU 640 (MiB)
[10/10/2021-10:52:05] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +0, now: CPU 855, GPU 422 (MiB)
[10/10/2021-10:52:05] [I] [TRT] [MemUsageSnapshot] deserializeCudaEngine end: CPU 855 MiB, GPU 422 MiB
deserialized engine successfully.
[10/10/2021-10:52:05] [I] [TRT] [MemUsageSnapshot] ExecutionContext creation begin: CPU 830 MiB, GPU 422 MiB
[10/10/2021-10:52:06] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +184, GPU +210, now: CPU 1014, GPU 632 (MiB)
[10/10/2021-10:52:06] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +8, now: CPU 1014, GPU 640 (MiB)
[10/10/2021-10:52:06] [I] [TRT] [MemUsageSnapshot] ExecutionContext creation end: CPU 1014 MiB, GPU 668 MiB
0
1
[10/10/2021-10:52:06] [E] [TRT] 1: [reformat.cpp::executeCutensor::384] Error Code 1: CuTensor (Internal cuTensor permutate execute failed)
Cuda failure: 700
Aborted (core dumped)

NVES · October 11, 2021, 2:36am

Hi,
Can you try running your model with trtexec command, and share the “”–verbose"" log in case if the issue persist
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec

You can refer below link for all the supported operators list, in case any operator is not supported you need to create a custom plugin to support that operation

github.com

onnx/onnx-tensorrt/blob/main/docs/operators.md

<!--- SPDX-License-Identifier: Apache-2.0 -->

# Supported ONNX Operators

TensorRT 8.4 supports operators up to Opset 17. Latest information of ONNX operators can be found [here](https://github.com/onnx/onnx/blob/master/docs/Operators.md)

TensorRT supports the following ONNX data types: DOUBLE, FLOAT32, FLOAT16, INT8, and BOOL

> Note: There is limited support for INT32, INT64, and DOUBLE types. TensorRT will attempt to cast down INT64 to INT32 and DOUBLE down to FLOAT, clamping values to `+-INT_MAX` or `+-FLT_MAX` if necessary.

See below for the support matrix of ONNX operators in ONNX-TensorRT.

## Operator Support Matrix

| Operator                  | Supported  | Supported Types | Restrictions                                                                                                           |
|---------------------------|------------|-----------------|------------------------------------------------------------------------------------------------------------------------|
| Abs                       | Y          | FP32, FP16, INT32 |
| Acos                      | Y          | FP32, FP16 |
| Acosh                     | Y          | FP32, FP16 |
| Add                       | Y          | FP32, FP16, INT32 |

This file has been truncated. show original

Also, request you to share your model and script if not shared already so that we can help you better.

Meanwhile, for some common errors and queries please refer to below link:

Thanks!

y14uc339 · November 4, 2021, 2:57pm

NVES:

> Note: There is limited support for INT32, INT64, and DOUBLE types. TensorRT will attempt to cast down INT64 to INT32 and DOUBLE down to FLOAT, clamping values to `+-INT_MAX` or `+-FLT_MAX` if necessary.

See below for the support matrix of ONNX operators in ONNX-TensorRT.

I am trying to run the engine generated with python API to work in c++, this works just fine in python but throws the above error in C++.
Sharing the model … c++ code is shared above already. PLease have a look if you can helpyolov5s.engine (23.9 MB)

trild-vietnam · May 6, 2022, 10:27am

Any update on this topic?

Topic		Replies	Views
Tensorrt inference error TensorRT	2	535	November 19, 2020
Unable to perform inference using yolov3 in tensorrt samples TensorRT	1	885	October 14, 2019
Python serialized TensorRT engine output wrong data at TensorRT C++ runtime TensorRT	4	1085	April 20, 2020
error while using TensorRT TensorRT	1	1302	January 10, 2020
Inference error at engine.cpp::enqueue::293 TensorRT	4	2388	January 31, 2019
TensorRT-7.1.3.4 Deserialize the cuda engine failed TensorRT cuda	9	8344	March 28, 2024
Help with TensorRT errors when building an engine TensorRT cudnn	2	251	February 22, 2025
Error Code 1: Cask (Cask convolution execution) TensorRT tensorrt , cuda	3	2073	March 4, 2024
Error Code 1: Serialization (Serialization assertion magicTagRead == magicTag failed.Magic tag does not match) [pp_infer-1] trt_infer: 4: [runtime.cpp TensorRT	1	102	February 1, 2025
Yolov8 Model Crashing TensorRT TensorRT tensorrt , cuda , yolo , cudnn	1	1087	January 29, 2024