Inferencing on AGX Xavier in INT8 mode

user17113 · November 15, 2021, 4:18pm

hi there. I am trying out on reducing the precision from FP32 to FP16, and it is quite straight forward, but I can only get limited resources on how to do inferencing using INT8 (configuring the ‘config’ and setting up the calibration) in Python language. And good reference on how to do this for TensorRT 8.0.1

AastaLLL · November 16, 2021, 3:17am

Hi,

Since INT8 changes data type from floating into integer, an extra calibration process is required.

You can find a calibration example from the TensorRT sample below:

Below is another good tutorial from the users for your reference:

Thanks.

user17113 · November 16, 2021, 3:45am

Thanks so much! I will try it out

system · December 8, 2021, 3:35am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Jetson agx xavier int8 calibration file DeepStream SDK jetson	2	577	October 12, 2021
Jetson AGX Xavier INT8 Performance Jetson AGX Xavier	4	1756	October 18, 2021
TensorRT TensorRT tensorrt , python	1	317	October 27, 2021
Int8 quantization TensorRT	1	479	December 16, 2021
Failed to use INT8 precision mode when using tf-trt on Xavier Jetson AGX Xavier	4	965	October 18, 2021
TensorRT int8 performance Jetson AGX Xavier	4	1213	October 18, 2021
Converting Caffe to TensorRT using int8 Jetson TX2	4	775	October 18, 2021
Generate calibration file Jetson Xavier NX tensorrt	8	884	September 27, 2021
RT-DETR conversion to int8 TensorRT	1	51	October 23, 2024
pre-quantized models on Jetson AGX Xavier Jetson AGX Xavier	10	940	October 18, 2021

Inferencing on AGX Xavier in INT8 mode

Related topics