Is it normal the tensorrt int8 model with dynamic input shape (batch size) more precise than the one with fixed input shape?

yinxn268081 · January 5, 2023, 4:14am

Description

When I convert my pretrained model to tensorrt int8 engine, I find that if the input shape is fixed, i.e.,

profile.set_shape(inp.name, min=(32, 3, 512, 512), opt=(32, 3, 512, 512), max=(32, 3, 512, 512))

, the performance drops dramatically comparing to the one generated with dynamic input shape:

profile.set_shape(inp.name,  min=(16, 3, 512, 512), opt=(32, 3, 512, 512), max=(48, 3, 512, 512)).

I want to know is it normal?

Environment

TensorRT Version: 8.0.1.6
GPU Type: NVIDIA GeForce RTX 3060
Nvidia Driver Version: 11.4
CUDA Version: cuda_11.3.r11.3
CUDNN Version: 8.1.1
Operating System + Version: 16.04.6 LTS (GNU/Linux 4.4.0-142-generic x86_64)
Python Version (if applicable): 3.6
TensorFlow Version (if applicable):
PyTorch Version (if applicable): ‘1.10.2+cu113’
Baremetal or Container (if container which image + tag):

spolisetty · January 5, 2023, 5:40pm

Hi,

Could you please try on the latest TensorRT version 8.5.2 and let us know if you still face this issue?
Please share the minimal issue repro ONNX model/scripts for us to try for better debugging.

Thank you.

yinxn268081 · January 8, 2023, 2:14am

The problem was solved, it was actually a bug in my code. When I generated the fixed shape engine, I used a wrong calibrator where the data size was smaller than the allocated GPU memory, so the uninitialized GPU memory messed up the calibration process and further degraded the performance of the engine.

system · January 22, 2023, 2:15am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Input dufferent shape to int8.trt.engine TensorRT tensorrt , calibration , dynamic-control	2	737	January 23, 2023
Input shape axis 0 must equal 8, got shape [5,600,1024,3] DeepStream SDK tf-trt	7	1717	March 23, 2021
Issues with dynamic shapes Try increasing the workspace size with IBuilderConfig::setMaxWorkspaceSize() TensorRT	6	1520	June 14, 2022
TensorRT6 Dynamic Input Size does not support int8 with calibrator. TensorRT	13	3575	July 23, 2021
Tensorrt8 shape tensor as input node must have fixed shape？ TensorRT	1	1066	September 12, 2021
Dynamic onnx model becomes a trt engine with a static output TensorRT	9	1982	July 20, 2022
Dynamic input shape get wrong result TensorRT	3	605	November 16, 2022
Dynamic Shape Inferencing Slower Than Fixed Shape TensorRT tensorrt	1	874	July 19, 2022
Trtexec : Error[4]: [graphShapeAnalyzer.cpp::analyzeShapes::1294] Error Code 4: Miscellaneous TensorRT tensorrt , onnx	13	3091	July 7, 2022
Trtexec and dynamic input shapes TensorRT tensorrt	2	981	October 29, 2020

Is it normal the tensorrt int8 model with dynamic input shape (batch size) more precise than the one with fixed input shape?

Description

Environment

Related topics