Mismatch between Tensorflow + ONNX versus TensorRT

solarflarefx · July 5, 2020, 8:47pm

Description

I have a keras/tensorflow model I am working with and it seems I have a mismatch in the output when using TensorRT.

To be more specific, since I am working in Windows I am doing the following conversion process (here is a link to tf2onnx):
TensorFlow → TF2ONNX → ONNX Model → Import to TensorRT

I have tested the ONNX output using OnnxRuntime and it matches the tensorflow model. But when I import the ONNX model into the TensorRT C++ API, the output is no longer correct.

I have been probing different parts of the network and I found the node where the two models start to mismatch.
Essentially I have the following:

Initial part of network →
Conv2DTranspose → Concatenate → Conv2D → ReLU → Conv2D → ReLU → Add →
Conv2DTranspose → Concatenate → Conv2D → ReLU → Conv2D → ReLU → Add →
Conv2DTranspose → Concatenate → Conv2D → ReLU → Conv2D → ReLU → Add

If you notice, there are repeating layers here (Conv2DTranspose to Add).

I noticed the following:
If I make the first Add an output during the ONNX conversion, then the TensorRT output matches the onnxruntime output. However, if I make the second Add node the output, then the outputs start to mismatch. I believe this is because TensorRT is seeing that there are repeating layers and is trying to do an optimization. Unfortunately in my case it seems that this attempted optimization is causing an incorrect output.

There are a few questions:

Any idea what optimization TensorRT is trying to do and why it is failing?
When I create ONNX models, I use Netron to view the layers and nodes. Is there some way to similarly view TensorRT models? Or at the very least is there a way to print the model with all the layers and connections using either the command line tool or C++ API, if I am importing an ONNX model? I would like to see what TensorRT does with the problem set of layers.
Since I am using Windows, it seems my only way to import TensorFlow models is to convert to ONNX using TF2ONNX and then import this ONNX model into TensorRT. In the end I will do inference on a Windows machine. If for testing purposes I were to use an Linux machine to use TF-TRT, could I transfer the output of this tool to my Windows machine. In my case I cannot have the same target GPU on the Linux machine.

Environment

TensorRT Version: 7
CUDA Version: 10.2
CUDNN Version: 7.6
TensorRT API: C++
Operating System + Version: Windows 10 64-bit

AakankshaS · July 6, 2020, 5:09am

Hi @solarflarefx,

There is no such tool to visualize TRT model, atleast that i am aware of.
However I request you to share your model and script so that we can help you better.
Thanks!

Topic		Replies	Views
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1362	October 5, 2021
Recurrent convolution TensorRT	9	1102	October 12, 2021
Differences between tensorflow model inference and tensorRT model inference TensorRT tensorrt , tensorflow	6	1766	September 14, 2022
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	1005	February 28, 2023
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1401	November 17, 2022
ONNX model and TensorRT engine works differently TensorRT	5	743	February 20, 2023
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1583	September 28, 2023
ONNX Model Int64 Weights TensorRT	12	13478	February 17, 2024
Tensorrt 8.6 GA : C++ Inference gives diffrence results compared to onnx \|\| pt model python inference TensorRT	3	637	September 20, 2023
TensorRT 8.2.0.6 Python parse report an error - Failed to add input to the network TensorRT	2	782	December 7, 2021

Mismatch between Tensorflow + ONNX versus TensorRT

Description

Environment

Related topics