TensorRT5.0.x INT8 for Onnx?

yinghe2000 · March 27, 2019, 9:45pm

Hi,

I am new to TensorRT, working on optimizing our detection model on it. I roughly tested a fp16 setting for a trt model converted from .onnx, we can see inf time is 7ms, in comparison to default fp32 inf time 24ms, which looks good.

However, I read the TensorRT document, it says that TensorRT 5.0.x doesn’t support INT 8 and INT 8 calibration on ONNX. Is this true? If I have a pytorch detection model, we could save/export it to .onnx format, and then we want to optimize on TensorRT 5.0.2.x by converting this .onnx model to .trt engine. Do we have a way to do INT8 and INT8 calibration?

Thanks.

Wendy

AastaLLL · March 28, 2019, 7:30am

Hi,

The INT8 support is vary from the hardware rather than the library version.
May I know which device do you use? Jetson or desktop?

Here is our support matrix and it’s recommended to check if your device have INT8 support first:
[url]https://docs.nvidia.com/deeplearning/sdk/tensorrt-support-matrix/index.html#hardware-precision-matrix[/url]

Thanks.

Topic		Replies	Views
RT-DETR conversion to int8 TensorRT	1	51	October 23, 2024
TRT Engin in INT8 is much slower than FP16 TensorRT	4	1866	November 11, 2021
INT8 Calibration in Python with TensorRT 8.6 TensorRT tensorrt	5	3166	July 12, 2023
Converting .onnx model to int8 Linux tensorrt , onnx	1	632	August 1, 2023
TensorRT INT8 inference accuracy TensorRT	2	493	May 9, 2022
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	614	October 12, 2021
Differences in performance between onnx models in Pytorch and TensorRT TensorRT	1	2120	June 17, 2019
Do the onnx style model support int8 calibrate? TensorRT	3	1379	January 13, 2020
How to pass uint8 input to a tensorrt engine? TensorRT	6	2645	October 12, 2021
Onnx to int8trt issue Jetson Nano tensorrt , ubuntu , python	5	709	October 15, 2021

TensorRT5.0.x INT8 for Onnx?

Related topics