Error code 4 internal error unnamed layer

eric_langner · January 1, 2024, 8:16pm

Description

I successfully converted TensorFlow model to ONNX with tf2onnx. The information about the successful conversion was printed out in the terminal.

Now I want to convert an ONNX model to a TensorRT engine. I downloaded and started the latest docker [2]. Then, I just started trtexec [3] and got an error message “Found invalid input type of UINT8” which I could solve with the help of a small script which I found here [4]. With the help of this scrip I was able to fix this error. So far, so good.

I repeated to use trtexec with the same command [3] and got another error which I am not able so solve. See output below. So what does this error mean and how can I solve it ? I have attached the model for investigation.

Thank you and I hope you have a woundeful start in 2024

[01/01/2024-20:07:38] [W] [TRT] onnx2trt_utils.cpp:374: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[01/01/2024-20:07:38] [W] [TRT] onnx2trt_utils.cpp:400: One or more weights outside the range of INT32 was clamped
[01/01/2024-20:07:38] [E] Error[4]: [graph.cpp::symbolicExecute::539] Error Code 4: Internal Error ((Unnamed Layer* 76) [LoopOutput]: an ILoopOutputLayer cannot be used to compute a shape tensor)
[01/01/2024-20:07:38] [E] [TRT] ModelImporter.cpp:771: While parsing node number 354 [Range -> "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_2:0"]:
[01/01/2024-20:07:38] [E] [TRT] ModelImporter.cpp:772: --- Begin node ---
[01/01/2024-20:07:38] [E] [TRT] ModelImporter.cpp:773: input: "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_1/start:0"
input: "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/Select_2:0"
input: "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_2/delta:0"
output: "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_2:0"
name: "StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_2"
op_type: "Range"

[01/01/2024-20:07:38] [E] [TRT] ModelImporter.cpp:774: --- End node ---
[01/01/2024-20:07:38] [E] [TRT] ModelImporter.cpp:777: ERROR: ModelImporter.cpp:195 In function parseGraph:
[6] Invalid Node - StatefulPartitionedCall/Postprocessor/BatchMultiClassNonMaxSuppression/MultiClassNonMaxSuppression/range_2
[graph.cpp::symbolicExecute::539] Error Code 4: Internal Error ((Unnamed Layer* 76) [LoopOutput]: an ILoopOutputLayer cannot be used to compute a shape tensor)
[01/01/2024-20:07:38] [E] Failed to parse onnx file
[01/01/2024-20:07:38] [I] Finished parsing network model. Parse time: 0.0658785

[1]
python -m tf2onnx.convert --saved-model /home/playground/export_v1/saved_model/ --output /home/playground/model.onnx

[2]
sudo docker run -v /home/ubuntu/eric:/home --gpus all -it --rm nvcr.io/nvidia/tensorrt:23.12-py3 /bin/bash

[3]
trtexec --onnx=/home/playground/model.onnx --saveEngine=engine.trt

[4]

github.com/NVIDIA/TensorRT

Error converting onnx model to a tensorrt: Unsupported ONNX data type: UINT8 (2)

opened 06:48PM - 19 Jan 21 UTC

closed 10:15AM - 26 May 21 UTC

vilmara

ONNX triaged

## Description I am trying to convert an onnx model to tensorrt and got the e…rror `Unsupported ONNX data type: UINT8 (2)` ## Environment **TensorRT Version**: 7.0.0 **GPU Type**: T4 **Nvidia Driver Version**: 450.51.06 **CUDA Version**: 10.2 **CUDNN Version**: 7.6.5 **Operating System + Version**: Ubuntu 18.04 **Python Version (if applicable)**: 3.6 **TensorFlow Version (if applicable)**: 1.15.2 **PyTorch Version (if applicable)**: **Baremetal or Container (if container which image + tag)**: nvcr.io/nvidia/tensorflow:20.02-tf1-py3 ## Relevant Files 1- Model: faster_rcnn_inception_v2_coco_2018_01_28 downloaded from TensorFlow 1 Detection Model Zoo. http://download.tensorflow.org/models/object_detection/faster_rcnn_inception_v2_coco_2018_01_28.tar.gz 2- Converted to ONNX with tf2onnx.convert tool. https://github.com/onnx/tensorflow-onnx as explained here https://github.com/onnx/tensorflow-onnx/issues/1277 ## Steps To Reproduce Convert onnx model to a tensorrt engine: `trtexec --onnx=/faster_rcnn_inceptionv2_coco.onnx --explicitBatch` Tracelog: ``` oot@bea440c68757:/workspace# trtexec --onnx=/faster_rcnn_inceptionv2_coco.onnx --explicitBatch &&&& RUNNING TensorRT.trtexec # trtexec --onnx=/faster_rcnn_inceptionv2_coco.onnx --explicitBatch [01/19/2021-18:32:08] [I] === Model Options === [01/19/2021-18:32:08] [I] Format: ONNX [01/19/2021-18:32:08] [I] Model: /workspace/triton_blog/faster_rcnn_inceptionv2_coco.onnx [01/19/2021-18:32:08] [I] Output: [01/19/2021-18:32:08] [I] === Build Options === [01/19/2021-18:32:08] [I] Max batch: explicit [01/19/2021-18:32:08] [I] Workspace: 16 MB [01/19/2021-18:32:08] [I] minTiming: 1 [01/19/2021-18:32:08] [I] avgTiming: 8 [01/19/2021-18:32:08] [I] Precision: FP32 [01/19/2021-18:32:08] [I] Calibration: [01/19/2021-18:32:08] [I] Safe mode: Disabled [01/19/2021-18:32:08] [I] Save engine: [01/19/2021-18:32:08] [I] Load engine: [01/19/2021-18:32:08] [I] Inputs format: fp32:CHW [01/19/2021-18:32:08] [I] Outputs format: fp32:CHW [01/19/2021-18:32:08] [I] Input build shapes: model [01/19/2021-18:32:08] [I] === System Options === [01/19/2021-18:32:08] [I] Device: 0 [01/19/2021-18:32:08] [I] DLACore: [01/19/2021-18:32:08] [I] Plugins: [01/19/2021-18:32:08] [I] === Inference Options === [01/19/2021-18:32:08] [I] Batch: Explicit [01/19/2021-18:32:08] [I] Iterations: 10 [01/19/2021-18:32:08] [I] Duration: 3s (+ 200ms warm up) [01/19/2021-18:32:08] [I] Sleep time: 0ms [01/19/2021-18:32:08] [I] Streams: 1 [01/19/2021-18:32:08] [I] ExposeDMA: Disabled [01/19/2021-18:32:08] [I] Spin-wait: Disabled [01/19/2021-18:32:08] [I] Multithreading: Disabled [01/19/2021-18:32:08] [I] CUDA Graph: Disabled [01/19/2021-18:32:08] [I] Skip inference: Disabled [01/19/2021-18:32:08] [I] Inputs: [01/19/2021-18:32:08] [I] === Reporting Options === [01/19/2021-18:32:08] [I] Verbose: Disabled [01/19/2021-18:32:08] [I] Averages: 10 inferences [01/19/2021-18:32:08] [I] Percentile: 99 [01/19/2021-18:32:08] [I] Dump output: Disabled [01/19/2021-18:32:08] [I] Profile: Disabled [01/19/2021-18:32:08] [I] Export timing to JSON file: [01/19/2021-18:32:08] [I] Export output to JSON file: [01/19/2021-18:32:08] [I] Export profile to JSON file: [01/19/2021-18:32:08] [I] ---------------------------------------------------------------- Input filename: /faster_rcnn_inceptionv2_coco.onnx ONNX IR version: 0.0.7 Opset version: 12 Producer name: tf2onnx Producer version: 1.9.0 Domain: Model version: 0 Doc string: ---------------------------------------------------------------- Unsupported ONNX data type: UINT8 (2) ERROR: image_tensor:0:189 In function importInput: [8] Assertion failed: convertDtype(onnxDtype.elem_type(), &trtDtype) [01/19/2021-18:32:08] [E] Failed to parse onnx file [01/19/2021-18:32:08] [E] Parsing model failed [01/19/2021-18:32:08] [E] Engine creation failed [01/19/2021-18:32:08] [E] Engine set up failed &&&& FAILED TensorRT.trtexec # trtexec --onnx=/faster_rcnn_inceptionv2_coco.onnx --explicitBatch ```

[5]

Environment

TensorRT Version: 8.6.1
GPU Type:
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable): 3.10.12
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):
**PIP

Package Version

appdirs 1.4.4
graphsurgeon 0.4.6
install 1.3.5
Mako 1.3.0
MarkupSafe 2.1.3
numpy 1.23.5
nvidia-pyindex 1.0.9
onnx 1.15.0
onnx-graphsurgeon 0.3.27
Pillow 10.1.0
pip 23.3.1
platformdirs 4.0.0
polygraphy 0.49.3
protobuf 4.25.1
pycuda 2023.1
pytools 2023.1.1
setuptools 59.6.0
tensorrt 8.6.1
typing_extensions 4.8.0
uff 0.6.9
wheel 0.37.1

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

AakankshaS · January 2, 2024, 7:11am

Hi @eric_langner ,
You may have to consider the below pointers,

Please create the network with EXPLICIT_BATCH flag.
The input to the FullyConnected layers are expected to have at least 4 dimensions
Please refer to the link.
onnx-tensorrt/README.md at main · onnx/onnx-tensorrt · GitHub
Thanks

eric_langner · January 3, 2024, 7:18am

Hi @AakankshaS

This means I have to retrain my network. I am asking myself if there is also a solution where you don’t have to retrain the network ?

eric_langner · January 28, 2024, 2:23pm

I couldn’t get any further at this point and solved my problem in another way.
Thank you.

timf34 · May 7, 2024, 3:04pm

Hi Eric,

How did you solve this another way? I’m getting a similar error.

Would be much appreciated!
Tim

eric_langner · May 8, 2024, 6:29am

@timf34 Unfortunately I didn’t get any further at that point. To avoid the error, I tried modifying the network without success (because I have no experience with it). After that and in the discussion with my colleagues it came out that it could possibly be a buggy version of the TensorRT engine (as far as I can remember). I hope that helps.

timf34 · May 8, 2024, 3:54pm

Thanks for replying Eric! I managed to solve my issue thankfully!

timf34 · May 8, 2024, 3:56pm

FYI for anyone else who sees this, my fix involved isolating the line of code in the neural network design via looking at the verbose output when creating the onnx model to isolate where the issue was occurring. And then using GPT4/ online forums to help with replacing that line of code with something suitable

antoni.rodriguez · October 4, 2024, 8:52am

@timf34 From your comment I understand that you replaced the problematic layer in your network before training and then you trained it with this change, right? I have the same issue except I don’t have the chance to modify the model before training, so it is crucial for me to understand if this is even possible. Thanks!

timf34 · October 4, 2024, 10:53am

Hi, yes youre correct, I had to modify the layers in the model (the torch layers) used.
Best,
Tim

Topic		Replies	Views
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1374	November 17, 2022
Unsupported ONNX data type: UINT8 (2) TensorRT	24	8841	May 6, 2021
Conversion error of Mask RCNN ONNX model for different types weights TensorRT tensorrt , pytorch , onnx , jetson	5	848	May 30, 2023
[TensorRT] ERROR: Network must have at least one output TensorRT tensorrt	29	2290	September 30, 2021
mobilenet onnx problem TensorRT	10	1871	October 12, 2021
Running a pytorch network converted to ONNX with TensorRT on the TX2 Jetson TX2	24	8797	October 18, 2021
Cannot convert SSD ONNX model to TensorRT TensorRT tensorrt	15	2331	November 23, 2022
TensorRT's OnnxParser problem TensorRT tensorrt	6	2296	October 12, 2021
Assertion Error in buildMemGraph: 0 (mg.nodes[mg.regionIndices[outputRegion]].size == mg.nodes[mg.regionIndices[inputRegion]].size) TensorRT	10	1285	October 12, 2021
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	1890	June 29, 2021

Error code 4 internal error unnamed layer

Description

Environment

Relevant Files

Steps To Reproduce

Related topics