Cannot convert SSD ONNX model to TensorRT

gael_lagarde · August 9, 2022, 12:31pm

Description

Hi,

I have encountered some errors when trying to convert ONNX model to TensorRT.
I am using a pretrained SSD Lite MobileNet V2 model that I have retrained.

Firstly, I have converted my saved_model with the following command line:

python -m tf2onnx.convert --saved-model “./saved_model_folder/” --output “./saved_model_folder/output.onnx” --opset 16 --verbose

Then I have compiled trtexec.exe from trtexec solution in TensorRT 8.4.1.5 and run the following command line to convert my ONNX model to TensorRT:

trtexec --onnx=output.onnx --saveEngine=output.trt

I got “Unsupported ONNX data type: UINT8 (2)” error.

So, I successfully convert the input of my ONNX model in FP32 instead of UINT8 with the following code:

Then I run again trtexec with the following command line:

trtexec --onnx=output_float32.onnx --saveEngine=output.trt

The previous error was gone but I got another error that I do not understand:

[graphShapeAnalyzer.cpp::nvinfer1::builder::`anonymous-namespace’::ShapeNodeRemover::processCheck::587] Error Code 4: Internal Error ((Unnamed Layer* 43) [LoopOutput]_output: tensor volume exceeds (2^31)-1, dimensions are [2147483647,3])

Do you know how can I bypass this error to convert my ONNX model in TensorRT please?
Thank you.

# Environment

**TensorRT Version**: 8.4.1.5
**GPU Type**: NVIDIA GTX 1060 6GB
**Nvidia Driver Version**: 512.15
**CUDA Version**: 11.2
**CUDNN Version**: 8.1.1
**Operating System + Version**: Windows 10 Pro
**Python Version (if applicable)**: 3.9.13
**TensorFlow Version (if applicable)**: 1.12.0

NVES · August 9, 2022, 1:07pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

spolisetty · August 9, 2022, 2:12pm

Hi,

Please refer,

Thank you.

gael_lagarde · August 9, 2022, 2:16pm

Thank you for your quick answer!

According to your code, both ONNX models are valid.

Due to confidentiality issue, I cannot share my models with you, but you can find verbose text file output related to the TensorRT conversion of my UINT8 model (output.onnx) and my FLOAT32 model (output_float32.onnx).

Thank you.

output_float32_onnx_trt_conversion_output.txt (72.3 KB)
output_onnx_trt_conversion_output.txt (12.5 KB)

spolisetty · August 9, 2022, 2:22pm

UINT8 data type is currently not supported by TensorRT.
TensorRT supports the following ONNX data types: DOUBLE, FLOAT32, FLOAT16, INT8, and BOOL.

gael_lagarde · August 9, 2022, 2:49pm

Thank you for your answer.

Yes, I have seen other topics with UINT8 data issue.
I have also seen that there is a future update regarding this on this topic:

Do you know when it will be available please?

Because of the UINT8 data issue, I tried to modify my model input with FLOAT32 instead of UINT8, but I got the error (you can see the full error in the output_float32_onnx_trt_conversion_output.txt file)
[graphShapeAnalyzer.cpp::nvinfer1::builder::anonymous-namespace’::ShapeNodeRemover::processCheck::587] Error Code 4: Internal Error ((Unnamed Layer* 43) [LoopOutput]_output: tensor volume exceeds (2^31)-1, dimensions are [2147483647,3])`

Do you have an idea how to fix errors for my FLOAT32 model please?

Thank you.

spolisetty · August 16, 2022, 1:26pm

Hi,

Currently, we do not have an idea of the approximate ETA.

As mentioned earlier, Currently, TRT does not support tensors with more than 2^31-1 elements. We do not have a workaround except for modifying the network.

Thank you.

gael_lagarde · September 8, 2022, 9:26am

Hi,

I would like to share some updates regarding the conversion of my model to tensorrt.

I modified the overall steps like below:

I Took my saved model and specified its input shape to [1,576,720,3]
→ when converting from onnx to tensorrt: remove the “tensor volume exceeds (2^31)-1” error
I have converted it to onnx with opset 13
→ with opset 15 I had the same error as this post:
TensorRT Parsing ONNX Model Error - #5 by Gudbach
I have converted its input to FLOAT32 (with the script I shared above)
I have converted NMS layers for TRT with the script below
I have converted this model to trt with the following command line:
trtexec --onnx=ssd_lite_mobilenet_v2_input_shape_ops_13_float32_BatchedNMSDynamic_TRT.onnx --saveEngine=output.trt

The conversion of the ONNX model in TRT failed with the following error:

Assertion failed: nbInputs == 2
C:\_src\plugin\batchedNMSPlugin\batchedNMSPlugin.cpp:189
Aborting...

I also tried to modify “BatchedNMSDynamic_TRT” to “BatchedNMS_TRT” in step 4, and I got a similar error after trying to convert my ONNX model in TRT:

Assertion failed: nbInputDims == 2
C:\_src\plugin\batchedNMSPlugin\batchedNMSPlugin.cpp:151
Aborting...

I attached the script of the step 4 and TRT conversion verbose outputs with “BatchedNMS_TRT” and “BatchedNMSDynamic_TRT”:
convert_ssd_onnx_model_with_nms_trt.py (2.8 KB)
ssd_lite_mobilenet_v2_input_shape_ops_13_float32_BatchedNMS_TRT.txt (375.8 KB)
ssd_lite_mobilenet_v2_input_shape_ops_13_float32_BatchedNMSDynamic_TRT.txt (375.9 KB)

Do you have an idea how to fix this nbInput error please?

Thank you.

spolisetty · September 14, 2022, 2:46pm

Hi,

Could you please share with us the latest ONNX model to try from our end for better debugging.
Also please refer to the following, which may help you.

github.com

NVIDIA/TensorRT/blob/b55c4710ce01f076c26710a48879fcb2661be4a9/samples/python/efficientdet/create_onnx.py#L279


      
                      # If all the checks above pass, then this node sequence can be optimized by just the ReduceMean itself
                      # operating on a different set of axes
                      input_tensor = transpose.inputs[0]  # Input tensor to the Transpose
                      reduce.inputs[0] = input_tensor  # Forward the Transpose input to the ReduceMean node
                      output_tensor = reduce.outputs[0]  # Output tensor of the ReduceMean
                      conv.inputs[0] = output_tensor  # Forward the ReduceMean output to the Conv node
                      reduce.attrs["axes"] = [2, 3]  # Update the axes that ReduceMean operates on
                      reduce.attrs["keepdims"] = 1  # Keep the reduced dimensions
                      log.info("Optimized subgraph around ReduceMean node '{}'".format(reduce.name))
          
          
def update_nms(self, threshold=None, detections=None):
              """
              Updates the graph to replace the NMS op by BatchedNMS_TRT TensorRT plugin node.
              :param threshold: Override the score threshold attribute. If set to None, use the value in the graph.
              :param detections: Override the max detections attribute. If set to None, use the value in the graph.
              """
          
          
    def find_head_concat(name_scope):
                  # This will find the concatenation node at the end of either Class Net or Box Net. These concatenation nodes
                  # bring together prediction data for each of 5 scales.
                  # The concatenated Class Net node will have shape [batch_size, num_anchors, num_classes],

Thank you.

gael_lagarde · September 20, 2022, 4:07pm

Hi,

I sent you a private message with my models.

Thank you for your help.

spolisetty · September 29, 2022, 2:24pm

Hi,

We suspect that issue is due to the plugin node expecting two inputs (boxes and scores ) but got something else.

[09/27/2022-06:00:35] [V] [TRT] onnx_graphsurgeon_node_0 [BatchedNMSDynamic_TRT] inputs: [Unsqueeze__759:0 → (1, 1, 2517)[FLOAT]],

Seems like there is only one input. We are checking on more details.

Thank you.

spolisetty · September 29, 2022, 4:17pm

Hi @gael_lagarde,

Could you please share ssd_lite_mobilenet_v2_input_shape_ops_13_float32.onnx?
The graphsurgeon script specifies the 2 required inputs, however the processed model has only one.

self.layer(op="BatchedNMSDynamic_TRT", attrs=attrs, # nbInput == 2 error in tensorrt conversion
                      inputs=[boxes_input, scores_input],
                      outputs=[nms_output])

Thank you.

gael_lagarde · September 29, 2022, 4:50pm

Hi,

Thank you for your answers.
Actually, the “ssd_lite_mobilenet_v2_input_shape_ops_13_float32.onnx” model corresponds to the “test_nvidia_ops_13_input_shape_float32_BatchedNMSDynamic_TRT.onnx”, I just renamed it before shared it with you.

I did not try to modify inputs number on my own, is input number modified by ONNX conversion or somewhere else in scripts that I shared with you?

Thank you.

spolisetty · October 4, 2022, 7:20am

Hi,

Could you please check and confirm again.
The above model has the Plugin nodes inserted by ONNX-graphsurgeon. We are looking for the source SSD model without the plugin nodes.

Based on the graphsurgeon script, we can see that ssd_lite_mobilenet_v2_input_shape_ops_13_float32_BatchedNMSDynamic_TRT.onnx is the output model (which you likely renamed). We need the input model below:

input_model_path = r".\\ssd_lite_mobilenet_v2_input_shape_ops_13_float32.onnx"
output_model_path = r".\\ssd_lite_mobilenet_v2_input_shape_ops_13_float32_BatchedNMSDynamic_TRT.onnx"

Thank you.

gael_lagarde · October 4, 2022, 7:54am

Hi,

I sent you models in private message.

Thank you again for your help.

gael_lagarde · November 23, 2022, 12:56pm

Hi @spolisetty ,

Do you have some updates regarding this issue please?

Thank you again.

Topic		Replies	Views
Problem converting ONNX model to TensorRT Engine for SSD Mobilenet V2 Jetson Nano tensorrt , nvbugs , ssd , onnx	38	8698	October 18, 2021
Use pre-trained object detection TF2 models with TensorRT ONNX TensorRT	9	1886	May 31, 2021
Unsupported ONNX data type: UINT8 (2) TensorRT	24	8841	May 6, 2021
mobilenet onnx problem TensorRT	10	1871	October 12, 2021
I am trying to convert the ONNX SSD mobilnet v2 model into TensorRT Engine. I am getting the below error Jetson AGX Xavier tensorrt , jetson	8	787	December 8, 2021
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1374	November 17, 2022
Running a pytorch network converted to ONNX with TensorRT on the TX2 Jetson TX2	24	8797	October 18, 2021
Tensorflow MobilenetSSD v2 conversion in TensorRT TensorRT tensorrt , tensorflow	3	991	March 8, 2021
I am trying to convert the ONNX SSD mobilnet v3 model into TensorRT Engine. I am getting the below error Jetson TX2 tensorrt , tensorflow	24	3661	February 17, 2022
Issues with torch.nn.ReflectionPad2d(padding) conversion to TRT engine TensorRT tensorrt , pytorch , onnx	21	4130	February 8, 2022

Cannot convert SSD ONNX model to TensorRT

Description

check_model.py

Related topics