Acceleration with INT8 precision using TensorRT

himanipatel · February 11, 2021, 6:13am

Description

I have successfully converted ResNet-r50 to fp16 using TensorRT with Python and C++ but I am unable to do the same with INT8 precision. I cant quite understand the calibration step involved with the acceleration using the official documentation.

Can anyone help me with understanding the calibration? A good tutorial or reference links might help.

Thanks in advance.

Environment

TensorRT Version: 7.2.2.1
GPU Type: nvidia RTX 3080
Nvidia Driver Version: 460.27.04
CUDA Version: 11.2
Operating System + Version: LINUX 18.04
Python Version: 3.6
TensorFlow Version: 2.3.1

NVES · February 11, 2021, 8:38pm

Hi , We recommend you to check the supported features from the below link.

Thanks!

spolisetty · February 12, 2021, 5:10am

Hi @himanipatel,

Please refer following links.

https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/sampleINT8API
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/sampleINT8

Thank you.

himanipatel · February 12, 2021, 9:42am

Thank you for the quick reply.
I have checked the compatibility and INT8 is is supported in our GPU. I have run the mnist samples available in the following github repository too: https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/sampleINT8

But I do not understand how to implement the same for custom models.

himanipatel · February 12, 2021, 9:48am

I have referred to these links but I am still having difficulty in converting my custom model. If you have any good tutorials, it would be very helpful.
I am new to the field so sorry if these queries basic or obvious.

jkjung13 · February 13, 2021, 3:45am

Check out my Demo #6: Using INT8 and DLA core of tensorrt_demos. I think you’d be able to reuse most of my calibrator.py code. And the code for building the INT8 TensorRT engine is here.

himanipatel · February 13, 2021, 4:51am

Thank you so much.

Topic		Replies	Views
TensorRT INT8 calibration TensorRT tensorrt , cuda , tensorflow	4	1168	February 15, 2021
TensorRT TensorRT tensorrt , python	1	373	October 27, 2021
RT-DETR conversion to int8 TensorRT	1	311	October 23, 2024
TensorRT trtexec implementation of Resnet50 INT8 precision TensorRT	4	1458	September 10, 2020
Tensorrt inferencing getting failed with custom quantized int 8 TensorFlow model TensorRT tensorrt , ubuntu , python , cudnn	1	97	March 28, 2025
TensorRT 4.0 Python API INT8 Calibration TensorRT	3	1464	August 27, 2018
Inferencing on AGX Xavier in INT8 mode Jetson AGX Xavier jetson-inference	3	1126	December 8, 2021
TensorRT Python INT8 calibration failure TensorRT	3	2023	November 23, 2018
Can we do INT8 inference using python API? TensorRT	3	2212	October 28, 2019
Int8 calibration TensorRT	1	2523	December 17, 2021

Acceleration with INT8 precision using TensorRT

Description

Environment

Related topics