Reproducible step-by-step ONNX to TensorRT issue: Unsupported ONNX data type: UINT8

windy_hinger · December 13, 2020, 8:59pm

Aspects of this issue have been seen here, here, here, and here, but there is not a clear solution.

To try and push for a solution, I have a step-by-step reproducible example, so that interested parties can identify the issue.

We start with a Tensorflow saved_model.pb, and export it to run in TensorRT. We download an official Tensorflow model, and export with tf2onnx.convert. See the steps here:

$ wget http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.tar.gz
$ tar xvfz ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.tar.gz
$ python3 -m tf2onnx.convert --saved-model ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8/saved_model/ --opset 11 --output ssd_model.onnx

To run inference, I adapt very slightly the ONNX ResNet50 example found at /usr/src/tensorrt/samples/python/introductory_parser_samples/onnx_resnet50.py

All of the files are available at this gist I’m providing. For this next step we’ll need the onnx_to_tensorrt.py file, as well as common.py, coco_labels.txt and data_processing.py.

Run $ python3 onnx_to_tensorrt.py 'ssd_model.onnx'. You’ll see the error: Unsupported ONNX data type: UINT8.

If we are to look at this official advice from NVidia, we can sidestep this issue by running the python3 graph_surgeon.py (this file is also in the gist). Ensure you install graph surgeon according to the linked instructions. This should apparently fix our incorrect data types.

We can verify this by running again with our updated model, parsed with Graph Surgeon.

Run: python3 onnx_to_tensorrt.py 'updated_ssd_model.onnx'.

However, it seems that the Graph Surgeon approach didn’t completely solve our issue, and you will see a new error:

Beginning ONNX file parsing
[TensorRT] WARNING: onnx2trt_utils.cpp:220: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[TensorRT] WARNING: onnx2trt_utils.cpp:246: One or more weights outside the range of INT32 was clamped
[TensorRT] ERROR: INVALID_ARGUMENT: getPluginCreator could not find plugin NonMaxSuppression version 1

This error also occurs with other proprietary models I have tried. But these publicly available models show the same behaviour.

I don’t have any solution beyond this point. I have tried converting my model to UFF, and have failed, and I have tried converting my model to ONNX, and have failed in this path. Anyone have any thoughts?

AastaLLL · December 14, 2020, 3:34am

Hi,

[TensorRT] ERROR: INVALID_ARGUMENT: getPluginCreator could not find plugin NonMaxSuppression version 1

This error indicates TensorRT doesn’t support the NonMaxSuppression operation, which is used in your TensorFlow model.
So you will need to add the support to TensorRT with our plugin API below:
https://docs.nvidia.com/deeplearning/tensorrt/api/c_api/classnvinfer1_1_1_i_plugin.html

Thanks.

windy_hinger · December 14, 2020, 9:34pm

Thanks for the help! I’m not that familiar with TensorRT yet, so didn’t realise that meant there was an unsupported operation.

I guess TF-TRT is probably the way to go here, if I don’t have the development budget to add the operation?

I think I’ve seen some info about that having some performance penalties relative to pure TensorRT? Is there any data or info on that?

AastaLLL · December 15, 2020, 3:13am

Hi,

If the plugin is not an option, you can give TF-TRT a try.
But please noted that TensorRT supported layer in TF-TRT is different from the standalone TensorRT.

You can find the list in the below document:

Thanks.

Topic		Replies	Views
Onnx to tensorrt plugin for NonMaxSuppression TensorRT tensorrt , tensorflow	1	2514	April 26, 2020
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1404	November 17, 2022
ONNX Plugin Layer implements TensorRT	11	1926	January 12, 2021
Tensorflow MobilenetSSD v2 conversion in TensorRT TensorRT tensorrt , tensorflow	3	1011	March 8, 2021
Use pre-trained object detection TF2 models with TensorRT ONNX TensorRT	9	1958	May 31, 2021
Onnx model to TRT conversion error TensorRT	6	3350	April 15, 2022
Mod operator unsupported in TensorRT 8.4.1 (included w/ Jetpack 5.0.2) TensorRT jetpack , tensorrt , cuda , jetson-inference , onnx	5	1575	January 2, 2023
Cannot convert SSD ONNX model to TensorRT TensorRT tensorrt	15	2382	November 23, 2022
mobilenet onnx problem TensorRT	10	1879	October 12, 2021
Error while converting tf grapf to tensorrt (UFFParser: Parser error: Maximum: Unsupported binary op max with constant right) TensorRT tensorrt , tensorflow	4	617	January 4, 2021

Reproducible step-by-step ONNX to TensorRT issue: Unsupported ONNX data type: UINT8

Related topics