TensorRT 8.6 conversion of RTDETR custom model accuracy dropped at fp16

schen1 · July 26, 2024, 6:06pm

Description

The accuracy of converted TensorRT engine was significantly lower (like 50%) than the onnx model, when built with --fp16 flag, while fp32 buit was okay.

I have tried to use polygraph run to figure it out bad layers but had error “Could not find any implementation for node {ForeignNode[/model/decoder/Where_output_0…/model/decoder/decoder/dec_bbox_head.0/layers.2/Add]}.”.

Please kindly help me out the fp16 issue. Thanks!

Environment

TensorRT Version: 8.6.1.6
GPU Type: RTX 3050
Nvidia Driver Version: 550.90.07
CUDA Version: 12.4
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

./trtexec --onnx= rtdetr_r18vd_custom.onnx --fp16 --minShapes=images:1x3x512x512 --optShapes=images:5x3x512x512 --maxShapes=images:16x3x512x512 --saveEngine=rtdetr_r18vd_custom_fp16.engine --verbose

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

Topic		Replies	Views
Failed to convert onnx to engine in TensorRT-8.6.1.6 environment TensorRT onnx	0	157	June 26, 2024
Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model TensorRT	3	1943	February 20, 2023
TensorRT 8.5.2 conversion of RTDETR (object detection model) is failing TensorRT cudnn	4	819	February 8, 2024
Error: Could not find any implementation for node {ForeignNode TensorRT	1	2696	February 24, 2023
TensorRT encountered issues when converting weights between types and that could affect accuracy TensorRT	7	1774	September 22, 2023
TensorRT with fp16 return nan for all outputs TensorRT	5	4065	February 5, 2021
Question about the tensorrt precision transformation TensorRT	4	470	July 12, 2021
TensorRT int8 slower than FP16 due to reformat layer TensorRT tensorrt , cudnn	0	89	October 11, 2024
Why is the size of the model exported by `trtexec --fp16` almost the same size as the model without this flag set? TensorRT	2	1117	December 8, 2022
Onnx to TensorRT conversion TensorRT tensorrt , cuda , ubuntu	1	712	June 21, 2023

TensorRT 8.6 conversion of RTDETR custom model accuracy dropped at fp16

Description

Environment

Relevant Files

Steps To Reproduce

Related topics