Is there a no way to get quantized weights after calibration?

muger1031 · November 25, 2020, 8:12am

Hello, I am studying about TensorRT.
I wanna make custom layers(using IPluginV2IOExt and CUDA kernel) and do INT8 quantization in the model.
So, I made a simple and same convolution layer by using CUDA kernel.
However, when I added a IInt8EntropyCalibrator2 and do Int8quantization, I realized that there is a no way to give the custom layers quantized weights, only input data and output data.

To be brief,

make custom layers by CUDA kernel and IPluginV2IOExt
do INT8 quantization by IInt8EntropyCalibrator2
but I think that there is a no way to give IPluginV2IOExt the data of weights. So, I cannot quantization with my custom layers.
Is it impossible?

please help.

Thank you.

AakankshaS · November 26, 2020, 4:55am

Hi @muger1031,
You should manage weights himself if you are using plugin to implement your custom layer
I think using trt’s IInt8EntropyCalibrator2, you can get input/output scale factor, you may have to compute int8 weights using scale factor yourself.
You can refer to the below example for the same.
https://github.com/NVIDIA/TensorRT/blob/master/samples/opensource/samplePlugin/fcPlugin.h

Thanks!

Topic		Replies	Views
TensorRT INT8 Quantization : weights + activations quantization TensorRT	4	2235	February 13, 2020
TensorRT 8-bit Quantization questions TensorRT	7	4950	April 26, 2018
How to set my own quantized weigit and bias scale(not activation)? TensorRT tensorrt	1	407	January 15, 2021
Problem to quantize the INT8 model TensorRT tensorrt	2	1092	February 15, 2022
Is all layer quantized to int8? TensorRT	2	1256	August 30, 2019
IInt8EntropyCalibrator TensorRT	2	1189	September 4, 2018
TensorRT - INT8 Quantization - weights - activations TensorRT	2	1021	January 9, 2020
pre-quantized models on Jetson AGX Xavier Jetson AGX Xavier	10	1084	October 18, 2021
Is there any method to build model with int8 weight in tensorrt? TensorRT	1	1302	July 29, 2021
TF-TRT How to extract quantized weights after post-training quantization TensorRT	3	1335	January 21, 2020

Is there a no way to get quantized weights after calibration?

Related topics