TRTExec - Force precision on certain ONNX Op Nodes

vincent.rld · February 23, 2023, 1:15pm

Description

Hello,
I’m trying to convert a transformer in ONNX format to a TRT engine. When I convert the model in fp32 precision, everything is fine (the outputs of the onnx and trt engine are the same). But when I use fp16 precision, it gives me different results (uncomparable). I’ve stumbled across this issue on Github :
fp16 onnx -> fp16 tensorrt mismatched outputs · Issue #2336 · NVIDIA/TensorRT · GitHub. The problem seems very similar to mine as it seems that some nodes have different saturation values.
So my question is fairly simple, is it possible to force precision on certain types of ONNX nodes (in my case, put all Pow or ReduceMean to fp32). I know about the --layerPrecision option but i don’t think it responds to what i want to do exactly.
Thank you for your help !

Environment

TensorRT Version: 8.5.03
GPU Type: RTX A4000

NVES · February 23, 2023, 1:37pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

Topic		Replies	Views
Meet some problem with --precisionConstraints=obey --layerPrecisions TensorRT	1	1468	January 17, 2023
Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model TensorRT	3	2175	February 20, 2023
Some layers of onnx are discarded directly, when pytorch onnx convert egine file on FP16 TensorRT	1	801	June 12, 2020
Misc Error in transformWeightsIfFP: 1 when using onnx model and run for tensorrt using FP16 DRIVE AGX Xavier General tensorrt , driveos-dl	6	531	October 12, 2021
255 weights are affected by this issue: Detected subnormal FP16 values TensorRT	1	1363	January 17, 2023
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2618	October 18, 2021
Onnx output differs largely to TRT engine output TensorRT	14	1973	February 25, 2023
It seems Pow operator in tensorrt reduce the precision Automatically TensorRT	2	590	May 27, 2022
Assertion failed: “If condition must be a initializer!” on efficientdet pretrained model TensorRT	3	758	June 21, 2021
Different FP16 inference with tensorrt and pytorch TensorRT	5	4613	October 25, 2021

TRTExec - Force precision on certain ONNX Op Nodes

Description

Environment

check_model.py

Related topics