Use pre-trained object detection TF2 models with TensorRT ONNX

cint.lorenzo · May 6, 2021, 12:14pm

Description

Hi,

I’m trying to use a pre-trained object detection model from TF2 Model Zoo with TensorRT, but I’m stucked due to errors in engine building.
In particular I would like to convert TF2 saved_model.pb in ONNX format, optimize it with TensorRT and perfom inference with TensorRT engine.

I followed these steps:

Download model from TF2 Model Zoo: Faster R-CNN ResNet50 V1 640x640
Convert saved_model.pb to ONNX whit this command:
python3 -m tf2onnx.convert --saved-model resnet/saved_model/ --opset 13 --output resnet.onnx
Check ONNX file with this code:
import onnx
try:
onnx_model = onnx.load(model_path)
onnx.checker.check_model(onnx_model)
print("Model is valid")
except Exception as e:
print(e)
Use buildEngine.py for building TRT engine. I used code posted here NVIDIA Tutorial (Tensorflow 2 code example).
I run the command:
python3 buildEngine.py --onnx_file resnet.onnx

At first run I got this error:
Unsupported ONNX data type: UINT8

Then I used solution proposed here.

So i run again:
python3 buildEngine.py --onnx_file resnet_f32.onnx

and now I have this error:

[TensorRT] WARNING: onnx2trt_utils.cpp:220: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[TensorRT] WARNING: onnx2trt_utils.cpp:246: One or more weights outside the range of INT32 was clamped
[TensorRT] WARNING: onnx2trt_utils.cpp:246: One or more weights outside the range of INT32 was clamped
Traceback (most recent call last):
File “buildEngine.py”, line 31, in
main(args)
File “buildEngine.py”, line 22, in main
engine = eng.build_engine(onnx_path, shape=shape)
File “engine.py”, line 16, in build_engine
parser.parse(model.read())
IndexError: Attribute not found: axes

So my questions:

How can I solve this?
It is possible to use TF2 object detection pre-trained models with TensorRT?

Thank you very much for support.

Environment

TensorRT Version: 7.2.2.3
GPU Type: NVIDIA GeForce RTX 2070 SUPER, 8GB
Nvidia Driver Version: 460.73.01
CUDA Version: 11.0.3
CUDNN Version: 8.0.5.39
Operating System + Version: Ubuntu 20.04.2 LTS
Python Version (if applicable): 3.8
ONNX Version: 1.9.0
ONNX-GraphSurgeon Version: 0.2.6
TF2ONNX Version: 1.8.4
Baremetal or Container (if container which image + tag): Baremetal

spolisetty · May 10, 2021, 6:57am

Hi @cint.lorenzo,

We request you to share issue reproducible ONNX model and script to try from our end for better assistance.
Meanwhile we also recommend you to try using trtexec.
For your reference,
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec

Thank you.

cint.lorenzo · May 13, 2021, 9:27am

Hi @spolisetty and thanks for reply.

Here you can download the ONNX model:
https://we.tl/t-mekUpBBN8N

Here is the script from NVIDIA tutorial:
buildEngine.py (897 Bytes)

You can run it with:
python3 buildEngine.py --onnx_file resnet50.onnx

I also tried with trtexec, with this command:
trtexec --onnx=resnet50.onnx
and I got the same error.

Here the trtexec log:
trtexec_log.txt (3.9 KB)

Thank you very much.

spolisetty · May 15, 2021, 6:19pm

Hi @cint.lorenzo,

We could reproduce the same error. Looks like you are using the opset 13 version, which is currently unsupported for TRT 7.2.2.3. Can you try exporting your model to a lower opset (i.e. opset 11).

We also recommend you to checkout onnx-simplifier.

Thank you.

cint.lorenzo · May 17, 2021, 4:26pm

Hi @spolisetty and thanks again for your support.

I tried to export the model with opset 11 and now I get these errors when running buildEngine.py:

[TensorRT] WARNING: onnx2trt_utils.cpp:220: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[TensorRT] WARNING: onnx2trt_utils.cpp:246: One or more weights outside the range of INT32 was clamped
[TensorRT] WARNING: onnx2trt_utils.cpp:246: One or more weights outside the range of INT32 was clamped
[TensorRT] ERROR: Network must have at least one output
[TensorRT] ERROR: Network validation failed.

I tried also onnx-semplifier, but also there I got an error:

onnx.onnx_cpp2py_export.checker.ValidationError: Nodes in a graph must be topologically sorted, however input ‘__inference_Preprocessor_ResizeToRange_cond_false_13101_567_const_zero__42:0’ of node:
name: OpType: Slice is not output of any previous nodes.

Thank you very much.

spolisetty · May 20, 2021, 7:35pm

Hi @cint.lorenzo,

Could you please share with us new ONNX model(generated with opset 11) to try from our end for better assistance.

Thank you.

cint.lorenzo · May 21, 2021, 11:52am

Hi @spolisetty,

here you can find ONNX model generated with opset 11:
https://we.tl/t-ZAiEyftqDD

Thank you again for your support.

spolisetty · May 27, 2021, 12:06pm

Hi @cint.lorenzo,

Sorry for the delayed response, we recommend you to try Polygraphy. We also recommend you to use latest TensorRT version 8.0 EA.
Please run polygraphy surgeon sanitize --fold-constants model.onnx -o folded.onnx before running trtexec.
Let us know if you still face this issue.

Thank you.

cint.lorenzo · May 28, 2021, 8:07am

Hi @spolisetty and thank you again for your support.

Unfortunately I still got errors.
I downloaded TensoRT version 8.0 EA and applied Polygraphy as suggested.

This is the trtexec output on the folded model:
trtexec_log.txt (6.0 KB)

Here you can find the model:
folded_fix.onnx

These are the steps to reproduce:

Original model: http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet50_v1_1024x1024_coco17_tpu-8.tar.gz
Conversion to ONNX with:
python3 -m tf2onnx.convert --saved-model resnet/saved_model/ --opset 11 --output resnet.onnx
Apply Polygraphy: polygraphy surgeon sanitize --fold-constants resnet.onnx -o folded.onnx
If I run trtexec with model from point 3, I got this error:
trtexec_log_uint.txt (4.5 KB)
Use a script for format conversion from uint to float:
fix_onnx_model.py (217 Bytes)
Run on model from point 5: trtexec --onnx=folded_fix.onnx
Get error attached at the start of the message. Here the verbose version:
trtexec_log_verbose.txt (268.0 KB)

Hope there is all the information needed.

Thank you.

spolisetty · May 31, 2021, 3:48pm

Hi @cint.lorenzo,

“NonMaxSuppression” is currently not supported in ONNX parser. We may support in future releases.
Please refer How to use NMS with Pytorch model (that was converted to ONNX -> TensorRT) · Issue #795 · NVIDIA/TensorRT · GitHub

You need to create a custom plugin for any unsupported layer in your model. Please refer to below samples:

github.com

NVIDIA/TensorRT/blob/07ed9b57b1ff7c24664388e5564b17f7ce2873e5/plugin/nmsPlugin/README.md

# nmsPlugin

**Table Of Contents**
- [Description](#description)
    * [Structure](#structure)
- [Parameters](#parameters)
	* [`CodeType`](#codetype)
		* [`CodeTypeSSD::CORNER`](#codetypessd_corner)
		* [`CodeTypeSSD::CENTER_SIZE`](#codetypessdcenter_size)
		* [`CodeTypeSSD::CORNER_SIZE`](#codetypessdcorner_size)
		* [`CodeTypeSSD::TF_CENTER`](#codetypessdtf_center)
	* [`inputOrder`](#inputorder)
- [Additional resources](#additional-resources)
- [License](#license)
- [Changelog](#changelog)
- [Known issues](#known-issues)

## Description

The `nmsPlugin`, similar to the `batchedNMSPlugin`, implements a `non_max_suppression` (NMS) operation over bounding boxes for object detection networks. This plugin is included in TensorRT and used in [sampleSSD](https://docs.nvidia.com/deeplearning/sdk/tensorrt-sample-support-guide/index.html#sample_ssd) and [uff_ssd](https://docs.nvidia.com/deeplearning/sdk/tensorrt-sample-support-guide/index.html#uff_ssd) to run SSD.

This file has been truncated. show original

Custom plugin for ONNX:
https://github.com/NVIDIA/TensorRT/issues/6#issuecomment-603683069

Thank you.

Topic		Replies	Views
Cannot convert SSD ONNX model to TensorRT TensorRT tensorrt	15	2360	November 23, 2022
Problem converting ONNX model to TensorRT Engine for SSD Mobilenet V2 Jetson Nano tensorrt , nvbugs , ssd , onnx	38	8776	October 18, 2021
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1397	November 17, 2022
Tensorflow MobilenetSSD v2 conversion in TensorRT TensorRT tensorrt , tensorflow	3	1006	March 8, 2021
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	1921	June 29, 2021
Deploy Object Detection TF-TRT INT8 with DS Triton DeepStream SDK inference-server-triton	16	1304	October 12, 2021
Running a pytorch network converted to ONNX with TensorRT on the TX2 Jetson TX2	24	8884	October 18, 2021
[TensorRT] ERROR: Network must have at least one output TensorRT tensorrt	29	2389	September 30, 2021
ValueError: Node... Axis is not unique while converting tensorflow segmentation model to tensorrt TensorRT tensorrt , segmentation	3	1665	March 9, 2022
ONNX to TRT Engine conversion Error TensorRT tensorrt	8	3708	May 25, 2022

Use pre-trained object detection TF2 models with TensorRT ONNX

Description

Environment

Related topics