API for getting INT8 calibration scale factors after calibration is finished?

carlosgalvezp · March 24, 2022, 10:32am

Hi,

Is there API for fetching the INT8 calibration scale factors for a given tensor after performing the INT8 calibration process?

Currently this information is encoded into the INT8 calibration cache, but the existing API only gives a raw pointer to a buffer. According to the docs, the calibration cache is an “internal implementation detail”, so I take it I should not “reverse engineer it” to obtain these scale factors, since this can change “any time” at Nvidia’s discretion.

My use case is:

Build DLA engine + Safety + INT8.
Due to Safety, the network must be reformat-free.
Therefore, the I/O tensors are INT8.
I want to end up with FP32 tensors → I need to create a reformatting layer INT8 → Fp32 myself.
I can only accomplish that if I know the correct scale factors to apply to convert from INT8 to FP32. This information is obtained somewhere in the INT8 calibration process - how do I get it?

Thanks!

spolisetty · March 29, 2022, 5:47pm

Hi,

We don’t think there is a scale for the output if it is int8. If this scale does exist, we can only think of one way to get it is by reading the calibration cache file.
https://github.com/NVIDIA/TensorRT/tree/main/samples/sampleINT8#calibration-file

Thank you.

Topic		Replies	Views
TensorRT: Int8 calibration with hand-tuned scale factors Jetson TX2	6	2987	October 18, 2021
How can I get the weights, biases and the scaling factors from tensorrt engine ? Deep Learning (Training & Inference) mixed-precision	0	748	October 1, 2019
INT8 Calibration Cache format - could it be officially documented? TensorRT	5	794	August 1, 2023
TensorRT INT8 engine calibration cache TensorRT tensorrt , calibration	2	1522	January 25, 2023
Interpret the content of calibration table TensorRT	1	760	December 24, 2019
ERROR:Calibration failure occured with no scaling factors detected TensorRT	0	2494	March 12, 2019
Int8 calibration TensorRT	1	2533	December 17, 2021
TensorRT TensorRT tensorrt , python	1	379	October 27, 2021
How do I generate INT8 calibration file wiht caffe? TensorRT tensorrt	1	880	August 12, 2020
Post-Training INT8 Quantization -> TensorRT Calibration Table TensorRT	7	1417	May 14, 2020

API for getting INT8 calibration scale factors after calibration is finished?

Related topics