Can TensorRT 7.1.3 convert an INT8 pytorch QAT model to engine?

srsjd · April 14, 2022, 6:19pm

I have been successfully using RT c++ API to load float32 pytorch model and convert it to a INT8 RT engine on a local machine. Now I want to convert an INT8 pytorch QAT model to an INT8 RT engine. Can I do this with TensorRT 7.1.3 since upgrading to RT8 would require a lot change? If so, what steps should I take?

TensorRT Version: 4.1.3
GPU Type: xavier
Nvidia Driver Version:
CUDA Version: 10.2
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

NVES · April 14, 2022, 6:37pm

Hi, Please refer to the below links to perform inference in INT8

Thanks!

srsjd · April 15, 2022, 2:26am

Thanks for your reply, but it doesn’t really answer my question. The above examples shows how to convert FLOAT pytorch model to an INT8 RT model, which I have implemented.

My question is how to parse an INT8 pytorch model (QAT) as it is and convert it to an INT8 RT model without calibration?

Right now it seems that I need to convert INT8 weight parameters to float type otherwise RT won’t take it.

spolisetty · April 21, 2022, 1:27pm

Hi,

Looks like you’re not following the TensorRT doc. If you convert the QAT model to “real int8” using torch.quantization.fuse_modules, TensorRT does not support this and it is also not supported by export to ONNX. This creates fused layers such as conv+relu+bn and also quantizes the weights to int8.

Thank you.

Topic		Replies	Views
How to enforce convert all layers to INT8 when building int8 engine model? TensorRT	5	402	June 21, 2023
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	478	December 1, 2021
Int8 quantization TensorRT	1	477	December 16, 2021
TensorRT INT8 inference accuracy TensorRT	2	493	May 9, 2022
Int8 weight extraction TensorRT tensorrt	2	521	April 27, 2020
TensorRT 8-bit Quantization questions TensorRT	7	4794	April 26, 2018
Question about the tensorrt precision transformation TensorRT	4	469	July 12, 2021
Int8 calibration TensorRT	1	2260	December 17, 2021
About qat model conversion engine TensorRT cudnn	0	12	October 11, 2024
How to generate int8 calilb table for trtexec engine generation TensorRT tensorrt	7	4316	October 12, 2021

Can TensorRT 7.1.3 convert an INT8 pytorch QAT model to engine?

Related topics