Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model

MathiasDall · January 31, 2023, 9:28pm

Description

Hello,

I am trying to convert an .onnx model to a fp16 .engine using the below command:

trtexec --onnx=-beit-base-patch16-224.onnx --fp16 --saveEngine=model.engine --minShapes=\'pixel_values\':1x3x224x224 --optShapes=\'pixel_values\':8x3x224x224 --maxShapes=\'pixel_values\':8x3x224x224 --precisionConstraints=obey --layerPrecisions=/beit/embeddings/patch_embeddings/projection/Conv:fp32

The engine is build as expected when using fp32, however when the --fp16 flag is set, the below warnings appear and cause model outputs to be wrong:

Even when setting the --precisionConstraints=obey and layerPrecisions=/beit/embeddings/patch_embeddings/projection/Conv:fp32 or layerPrecisions=/beit/embeddings/patch_embeddings/projection/Conv.weight:fp32 the weights are still cast to fp16 causing a bad model conversion.

Am I doing something wrong or is there any way to fix the above issue?

ONNX model link: Download | file.io

(Using the Deepstream 6.1.1 docker image)

Thanks!

Environment

TensorRT Version:
8.4.1
GPU Type:
RTX 3050TI
Nvidia Driver Version:
522.30
CUDA Version:
11.8
CUDNN Version:

Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

spolisetty · February 1, 2023, 5:37pm

Hi,

Please see the related post for more information.

Could you please try on the latest TensorRT version 8.5.3 and let us know if you still face this issue.

Thank you.

MathiasDall · February 2, 2023, 9:49am

8.5.3 works as expected - thanks.

As a suggestion (if at all possible), it would be convenient to have a flag to automatically avoid casting layers to fp16 if they get clamped eg. --avoid-clamp

Currently I have ~20 layers that get clamped and I have to explicitly set all of them in --layerPrecision to avoid FP16 clamping.

Thanks!

system · February 20, 2023, 6:38am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Meet some problem with --precisionConstraints=obey --layerPrecisions TensorRT	1	1468	January 17, 2023
255 weights are affected by this issue: Detected subnormal FP16 values TensorRT	1	1363	January 17, 2023
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2618	October 18, 2021
Error on converting ONNX to FP16 TensorRT with my model Deep Learning (Training & Inference)	0	428	August 17, 2020
TRTExec - Force precision on certain ONNX Op Nodes TensorRT	1	1163	February 23, 2023
Tensorrt FP16 conversion issue TensorRT tensorrt , cuda , gstreamer , onnx , deep-learning , deepstream	8	2713	March 6, 2023
Warning While DS App Converts ONNX Model To Engine DeepStream SDK tensorrt	5	949	July 11, 2022
Some layers of onnx are discarded directly, when pytorch onnx convert egine file on FP16 TensorRT	1	801	June 12, 2020
Misc Error in transformWeightsIfFP: 1 when using onnx model and run for tensorrt using FP16 DRIVE AGX Xavier General tensorrt , driveos-dl	6	531	October 12, 2021
Different inference result with tensorrt FP16 and FP32 TensorRT	1	1752	May 16, 2019

Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model

Description

Environment

Relevant Files

Steps To Reproduce

Related topics