int8 calibration table

poirot · December 14, 2017, 6:45am

Hello, everyone. I am trying to compress my network to 8bit. But i found that some customized layers didn’t work. So maybe i need to finetune the network in 8bit. But i can not find any document about that calibration table. Can anyone tell me how to interpret that cache table or where i can find the document about the details of 8bit calibration? Thanks !

AastaLLL · December 14, 2017, 8:50am

Hi,

INT8 can only run on GPU architecture=6.1 platform.
TX2 is sm=6.2 design and doesn’t support the INT8 feature.

Thanks.

poirot · December 15, 2017, 7:45am

Thanks for your reply. The platform is P4 which supports 8 bit. Because I can not find an area in this forum for P4/P40, so i posted my problem here.

AastaLLL · December 19, 2017, 5:55am

Hi,

Sorry for the late reply.

The INT8 key concept can be found in our user guide:
/usr/share/doc/tensorrt/TensorRT-3-User-Guide.pdf.gz

3.7. SampleINT8 - Calibration and 8-bit Inference

Please find more information on it.
Thanks.

poirot · December 20, 2017, 4:11am

In 3.7. SampleINT8 - Calibration and 8-bit Inference, it writes “The parameters are recorded in the table. If the network or calibration set changes, it is the application’s responsibility to invalidate the cache.”
But i can’t find any details about these parameters. The cache file saved by tensorrt is stored as binary format.
For example:
(Unnamed ITensor* 38): 3c3604b4
(Unnamed ITensor* 88): 3c42b64c
(Unnamed ITensor* 55): 3cb2d9e0
(Unnamed ITensor* 45): 3c22703f
(Unnamed ITensor* 9): 3c4023ca
(Unnamed ITensor* 30): 3c5e5988
(Unnamed ITensor* 123): 3c3b5bb9
(Unnamed ITensor* 41): 3ccc51b9
(Unnamed ITensor* 6): 3c6ac62b
(Unnamed ITensor* 31): 3c663928
(Unnamed ITensor* 26): 3ce3acfb
…

Because of the some custom layers, i can’t use the cache table directly.
If i want to finetune the network, i need to know how to use these parameters in my training codes.

AastaLLL · December 21, 2017, 9:31am

Hi,

The calibration table is dumped from memory buffer directly.

We have a native sample to demonstrate INT8 feature:
In /usr/src/tensorrt/samples/sampleINT8/sampleINT8.cpp:

...
Int8EntropyCalibrator calibrator(calibrationStream, FIRST_CAL_BATCH);
...

Please check it for information.
Thanks.

farescharfii · November 22, 2019, 3:49pm

Hello,

Can you please explain what these Hex numbers mean in the calibration table?

ncxinhanzhong · February 28, 2020, 8:25am

Hi,

The hex number reflect the range that int8 calibration process uses. For example, if a hex number is 114. Then that tensor will be valued in [-114, 114].

ncxinhanzhong · February 28, 2020, 8:36am

In 3.7. SampleINT8 - Calibration and 8-bit Inference, it writes “The parameters are recorded in the table. If the network or calibration set changes, it is the application’s responsibility to invalidate the cache.”
But i can’t find any details about these parameters. The cache file saved by tensorrt is stored as binary format.
For example:
(Unnamed ITensor* 38): 3c3604b4
(Unnamed ITensor* 88): 3c42b64c
(Unnamed ITensor* 55): 3cb2d9e0
(Unnamed ITensor* 45): 3c22703f
(Unnamed ITensor* 9): 3c4023ca
(Unnamed ITensor* 30): 3c5e5988
(Unnamed ITensor* 123): 3c3b5bb9
(Unnamed ITensor* 41): 3ccc51b9
(Unnamed ITensor* 6): 3c6ac62b
(Unnamed ITensor* 31): 3c663928
(Unnamed ITensor* 26): 3ce3acfb
…

Because of the some custom layers, i can’t use the cache table directly.
If i want to finetune the network, i need to know how to use these parameters in my training codes.

I think you can use the calibration table directly for your model with customized layers. TRT just take layers with cache into consideration and leave your customized layers.

I also tried doing int8 calibration directly on my model (which includes a bunch of customized layers), but I met the following error:

[2020-02-28 05:23:22 ERROR] FAILED_ALLOCATION: std::exception
[2020-02-28 05:23:22 ERROR] Requested amount of memory (18446744065119617096 bytes) could not be allocated. There may not be enough free memory for allocation to succeed.
[2020-02-28 05:23:22 ERROR] /home/jenkins/workspace/TensorRT/helpers/rel-6.0/L1_Nightly/build/source/rtSafe/resources.h (57) - OutOfMemory Error in CpuMemory: 0
[2020-02-28 05:23:22 ERROR] FAILED_ALLOCATION: std::exception
[2020-02-28 05:23:22 ERROR] Requested amount of memory (18446744065119617096 bytes) could not be allocated. There may not be enough free memory for allocation to succeed.
[2020-02-28 05:23:22 ERROR] /home/jenkins/workspace/TensorRT/helpers/rel-6.0/L1_Nightly/build/source/rtSafe/resources.h (57) - OutOfMemory Error in CpuMemory: 0
[2020-02-28 05:23:22 ERROR] FAILED_ALLOCATION: std::exception
[2020-02-28 05:23:22 ERROR] Requested amount of memory (18446744065119617096 bytes) could not be allocated. There may not be enough free memory for allocation to succeed.
terminate called after throwing an instance of ‘std::out_of_range’
what(): _Map_base::at

But I succeed the calibration process by providing the program a fake cache table, this cache table is generated from a subset of the whole model (say backbone).

My computer has 64GB as memory, so I kinda confused by the error log.

Anyone can help? Thanks.

kayccc · March 5, 2020, 4:37am

Hi ncxinhanzhong,

Please help to open a new topic for your issue. Thanks

Topic		Replies	Views
About CalibrationTablemnist in sampleINT8 TensorRT	1	558	November 21, 2018
Interpret the content of calibration table TensorRT	1	770	December 24, 2019
INT8 cache portable across TRT versions? TensorRT	5	810	March 23, 2022
TensorRT: Int8 calibration with hand-tuned scale factors Jetson TX2	6	2997	October 18, 2021
When IInt8Calibrator::read/WriteCalibrationCache been called? DeepStream SDK	2	1027	January 23, 2018
Reading calibration table Deep Learning (Training & Inference) mixed-precision	2	707	November 26, 2019
TensorRT INT8 engine calibration cache TensorRT tensorrt , calibration	2	1526	January 25, 2023
Are int8 calibration cache files platform independent? TensorRT	5	1139	August 7, 2023
Query regarding Cache file in INT8 optimization in trtexec TensorRT	1	121	March 28, 2025
CalibrationTable and executable engine TensorRT	12	3678	October 13, 2023

int8 calibration table

Related topics