What batch size to use for post-training quantization int8 calibration?

cyrus.behr · July 16, 2023, 9:39pm

Reading over the TRT 8.6 docs, I see the following statement:

What does this mean in practice? Above it says 500 images are sufficient for calibration. In theory, setting the batch size as large as possible would mean that I use 1 batch of 500 images. That doesn’t seem right to me, so what is a practical batch size to use for a sample size of 500 images?

nzmora · July 21, 2023, 3:14pm

Hi @cyrus.behr,

The advice is to use as large a batch as possible.
How many images comprise this batch will depend on the model and the resources it requires vs the resources available on your machine. In other words, if your GPU has enough resources to process 500 images in a single batch, we advise that you use 500 images.

It’s important to understand that 500 is an empirical value that may not be large enough for certain models, data sets and tasks. You might want to try using more images for calibration if the accuracy is low. You can examine the dynamic ranges TensorRT computes for each tensor when using progressively larger calibration sets and make sure they stabilize.
During calibration TensorRT collects statistics of the dynamic range of each intermediate activation tensor. To provide a good estimate of the range, the calibration dataset needs to be a good representation of the "real’ dataset (images that will be inputs during deployment).

Topic		Replies	Views
Questions on TensorRT 2.1.2 GPU-Accelerated Libraries	2	679	January 17, 2018
Calibrating with multiple images to create TRT engine TensorRT	1	391	May 28, 2020
INT8 calibration causes a significant decrease in accuracy when batch_size is greater than 1 TensorRT tensorrt	6	968	January 15, 2021
Generate the INT8 calibration In TensorRT GPU-Accelerated Libraries	0	643	October 23, 2017
Calibaration file generation from TAO toolkit TAO Toolkit	8	577	February 15, 2023
convert faster rcnn to int8 GPU-Accelerated Libraries	0	906	October 12, 2017
Calibrate INT8 with different inputs of different sizes TensorRT	0	690	June 26, 2019
TRT INT8 Quantify: Accuracy depend on Calibration dataset? TensorRT	9	2787	May 15, 2021
Generate the INT8 calibration GPU-Accelerated Libraries	0	559	October 23, 2017
TensorRT builder->setMaxBatchSize(maxBatchSize); question Jetson TX2	9	6576	October 18, 2021

What batch size to use for post-training quantization int8 calibration?

Related topics