Why is the variable `nbins` in `_compute_amax_entropy` determined by the sign?

dk.hong · March 9, 2023, 12:33am

As I understand, HistogramCalibrator collects data distribution of each activation, and it is compressed into 2^bits bins for computing entropy.

In the signed 8-bit setting, integer values among -128~127 can be used, and the number of candidate integers is 256.

In the unsigned 8-bit setting, integer values among 0~255 can be used, and the number of candidate integers is also 256.

So, I think nbins in _compute_amax_entropy should always be 256 regardless of the sign.

However, the implementation in pytorch_quantization, nbins of signed 8-bit setting is half that of unsigned 8-bit setting.

github.com

NVIDIA/TensorRT/blob/8e756f163f83d54389c7ff82235e57a518f6eb03/tools/pytorch-quantization/pytorch_quantization/calib/histogram.py#L204


      
                  distr = distr / summ
          
          
bins = calib_hist[:]
          bins[0] = bins[1]
          
          
total_data = np.sum(bins)
          
          
divergences = []
          arguments = []
          
          
# we are quantizing to 128 values + sign if num_bits=8
          nbins = 1 << (num_bits - 1 + int(unsigned))
          
          
starting = start_bin
          stop = len(bins)
          
          
new_density_counts = np.zeros(nbins, dtype=np.float64)
          
          
for i in range(starting, stop + 1, stride):
              new_density_counts.fill(0)
              space = np.linspace(0, i, num=nbins + 1)

Can you inform me what I missed?

AakankshaS · March 31, 2023, 6:34am

Hi @dk.hong ,

Apologies for the delay,

Did you check the available doc?
Thanks

Topic		Replies	Views
Three questions about 8-bit quantization(Entropy Calibration - pseudocode) TensorRT	0	1818	December 20, 2018
Tensorrt - Entropy Calibration - pseudocode TensorRT	5	2691	January 9, 2020
How to form the Q-distribution in tensorrt 8bit Calibration? TensorRT	0	920	December 24, 2018
Quantization to int8 still confusing TensorRT tensorrt	1	511	March 23, 2020
tensorRT quantization CUDA Programming and Performance	0	662	January 31, 2018
tensorRT quantization problem GPU-Accelerated Libraries	0	461	January 31, 2018
What does histogram of activation mean in Caliberation for INT8 in tensorRT? General	3	1400	June 1, 2018
INT8 calibration, calculation of Kullback-Leiber Divergence TensorRT	4	2999	November 28, 2019
Questions for PTQ Entropy Calibration TensorRT	1	714	November 22, 2021
IInt8EntropyCalibrator TensorRT	2	1151	September 4, 2018

Why is the variable `nbins` in `_compute_amax_entropy` determined by the sign?

Related topics