Using TRT Quantization Toolkit

alex.spivakovsky · September 2, 2021, 3:37pm

Description

Hello,

I’m exploring the TRT Quantization Toolkit. I would like to use a simple example to get things clear.
I’ll have a single Conv2d layer network with pretrained weights.

As I understood, in order to calibrate the network, I need to swap my original Conv2d layer with the QuantConv2d layer which has input and weight quantizers. After doing this I paid attention that after doing this the named_modules of the network now include 3 layers, the QuantConv2d, _input_quantizer and a _weight_quantizer.
When collecting statistics should I just source my input to the QuantConv2d or do something like described here?

Environment

TensorRT Version: 8
GPU Type: 2080 TI
Nvidia Driver Version: 470.57.02
CUDA Version: 11.3
CUDNN Version: 8.0
Operating System + Version: Ubuntu 18.04
Python Version (if applicable): 3.7
PyTorch Version (if applicable): 1.9

SunilJB · September 3, 2021, 5:54am

Hi @alex.spivakovsky
Please refer to below link in case it’s helpful in your case:
https://docs.nvidia.com/deeplearning/tensorrt/pytorch-quantization-toolkit/docs/index.html
https://docs.nvidia.com/deeplearning/tensorrt/pytorch-quantization-toolkit/docs/tutorials/quant_resnet50.html

Thanks

Topic		Replies	Views
How exactly are you supposed to do explicit quantization? TensorRT	1	75	March 4, 2025
TensorRT TensorRT	1	353	August 26, 2021
How to set quantized layers index? TensorRT	1	373	November 23, 2020
TensorRT TensorRT tensorrt	5	654	January 19, 2022
TensorRT conversion issues of ONNX model trained with Quantization Aware Training + custom quantization scale TensorRT tensorrt	5	1379	April 14, 2021
How to set my own quantized weigit and bias scale(not activation)? TensorRT tensorrt	1	345	January 15, 2021
Model multiple quantification in int8 mode TensorRT	1	297	August 18, 2020
Int8 quantization TensorRT	1	504	December 16, 2021
Pytorch Quantization Toolkit TensorRT cudnn	0	37	March 7, 2025
TensorRT TensorRT	1	444	August 26, 2021

Using TRT Quantization Toolkit

Description

Environment

Related topics