How to do calibration for int8 engine correctly?

cocoyen1995 · September 22, 2020, 8:02am

Description

Hi,

I’ve asked a question here.
So far, I’ve done the calibration for my Unet model’s int8 inference with the community’s help from github.
The process of my calibration is as below:

convert keras model to .onnx with explicit batch=1 is set
read the directory path of calibration data
cv::imread the input data and convert cv::Mat to datatype float
the calibration class do the rest

I’ve tested three different cases with the same model structure, trying to figure out the SOP of doing calibration,
but I couldn’t find a standard way to do so…
In my case, adjusting the beta value of cv::Mat::convertTo() can result to very different result.

Here are the cases I’ve tested with the best parameter to get the closest result with FP16 one:

case A:
model input size: 1920 x 1920 x 1
pre-process of calibration:

cv::Mat img = cv::imread(imgPaths[j]);
img.convertTo(img, CV_32FC(MD_size[2]), 1 / 255.0, -0.45);

result of inference: data with value 0 and 0.5

case B:
model input size: 1664 x 288 x 1
pre-process of calibration:

cv::Mat img = cv::imread(imgPaths[j]);
img.convertTo(img, CV_32FC(MD_size[2]), 1 / 255.0, 0);

result of inference: data range from 0 to 1

case C:
model input size: 512 x 960 x 3
pre-process of calibration:

cv::Mat img = cv::imread(imgPaths[j]);
img.convertTo(img, CV_32FC(MD_size[2]), 1 / 255.0, 2.5);

result of inference: data with value 0 and 0.5

If I didn’t adjust the beta value, the result of case A and case C would cause too many overkill at the OK part.
However, adjusting beta value is a trial-and-error solution for me.
(Since I didn’t use that beta value while inferencing with reading data and convert to float)
So, I want to know that:

Am I doing right with the calibration part’s pre-processing?
Why the result of inference’s data range is different?
(For one case, it ranges from 0 to 1; for others, they’re value with 0 and 0.5)

Environment

TensorRT Version: 7.0.0.11
GPU Type: RTX2080 Ti
Nvidia Driver Version: 451.82
CUDA Version: 10.0
CUDNN Version: 7.6.5
Operating System + Version: Windows10
Python Version (if applicable): 3.7.0
TensorFlow Version (if applicable): 1.13.1
PyTorch Version (if applicable): -
Baremetal or Container (if container which image + tag): -

Relevant Files

TRT_code.zip

Steps To Reproduce

Focus on the part ONNX2TRT()
and this part

std::cout << "***USING INT8***\n";
config->setFlag(BuilderFlag::kINT8);

// provided by @le8888e at https://github.com/NVIDIA/TensorRT/issues/557
std::string calibration_imgs_list = Jconfig["cali_image_path"].get<std::string>(); 
std::string calibration_table_save_path = Jconfig["cali_save_path"].get<std::string>();
int8EntroyCalibrator *calibrator = nullptr;
calibrator = new int8EntroyCalibrator(1, calibration_imgs_list, calibration_table_save_path);
config->setInt8Calibrator(calibrator);

The int8EntroyCalibrator class done the calibration part.

Thanks in advance for any help or advice!

Topic		Replies	Views
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1763	February 14, 2022
TensorRT TensorRT tensorrt , python	1	317	October 27, 2021
TRT int8 calibration problem TensorRT	7	1597	September 2, 2021
TensorRT INT8 conversion from an ONNX model TensorRT tensorrt , calibration , onnx	4	5259	July 29, 2024
INT8 Calibration in Python with TensorRT 8.6 TensorRT tensorrt	5	3168	July 12, 2023
Calibration and int8 inference on Onnx model TensorRT tensorrt	17	2359	March 20, 2023
How to do int8 calibration in c++ in tensorRT 5 ? TensorRT	10	4767	October 12, 2021
TensorRT INT8 engine calibration cache TensorRT tensorrt , calibration	2	1363	January 25, 2023
TensorRT INT8 inference accuracy TensorRT	2	493	May 9, 2022
Classification model of densenet converted to int8 that outputs result is error! TensorRT	4	1133	October 28, 2019

How to do calibration for int8 engine correctly?

Description

Environment

Relevant Files

Steps To Reproduce

Related topics