TensorRT - INT8 Quantization - weights - activations

farescharfii · December 9, 2019, 4:07pm

Hello everyone,

Can you please tell me if the INT8 quantization with TensorRT (TRT5) is doing activations only quantizations,
or it is quantizing both weights and activations to INT8 precision?

SunilJB · December 10, 2019, 10:20am

Hi,

It is quantizing both weight and activation to INT8 precision, but TRT doesn’t accept quantized weights as input from the user on TRT 5.x.

Thanks

farescharfii · January 9, 2020, 3:43pm

Hello,

Thanks for the answer.

Do you know if the weights are quantized using the entropy calibrator? or are they quantized using min and max quantization?

Thanks

Topic		Replies	Views
TensorRT INT8 Quantization : weights + activations quantization TensorRT	4	2233	February 13, 2020
TensorRT 8-bit Quantization questions TensorRT	7	4945	April 26, 2018
Alexnet using INT8 GPU-Accelerated Libraries	5	1791	August 29, 2017
IInt8EntropyCalibrator TensorRT	2	1188	September 4, 2018
TF-TRT How to extract quantized weights after post-training quantization TensorRT	3	1333	January 21, 2020
pre-quantized models on Jetson AGX Xavier Jetson AGX Xavier	10	1084	October 18, 2021
Is there a no way to get quantized weights after calibration? TensorRT	1	481	November 26, 2020
How to apply a custom int8 quantization method with TensorRT ? TensorRT	1	1052	April 8, 2019
Is there any method to build model with int8 weight in tensorrt? TensorRT	1	1301	July 29, 2021
Int8 quantization TensorRT	1	561	December 16, 2021

TensorRT - INT8 Quantization - weights - activations

Related topics