How to use INT8 LSTM with TensorRT 7.0.0 ?

yanghao19930101 · December 24, 2019, 8:02am

I want to use LSTM with INT8 data type.

However, the orgin IRNNv2 Layer only support for FP32 and FP16, not INT8.So I can’t use the origin API for INT8 LSTM.

TensorRT 7.0.0 released a new plugin,but it also only support for FP16.
The following section describes the new Persistent LSTM plugin. The Persistent LSTM plugin supports half-precision persistent LSTM.

The IPluginV2 supports INT8 and I may use it to implement a custom INT8 LSTM Layer.It seems that this is the only way to use INT8 LSTM.

I would like to know is it possible for me to implement a custom INT8 LSTM Layer with IPluginV2 Layer. And is there a better way for me to do that?

SunilJB · December 24, 2019, 9:53am

Hi,

You can implement any custom plugin you want for your application/use-case.
For newer TRT version, IPluginV2ExtDynamic might be a better idea for custom plugin.

Thanks

FusionYu · October 13, 2023, 2:20am

In the new version of trt8, is int8 quantization of LSTM already supported? Is there any plan for LSTM int8? Thank you for your reply.

Topic		Replies	Views
can we using INT8 if there is a customer/plugin layer? DeepStream SDK	2	723	December 11, 2017
TensorRT INT8 plugin layer TensorRT	3	1811	November 13, 2019
can we use INT8 WITH plugin layer in tensorrt 4.0？ TensorRT	1	486	October 29, 2018
Casting INT32 tensor to FLOAT TensorRT	7	1724	April 14, 2021
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	485	December 1, 2021
Implement Plugin Layer with support of FP16 mode TensorRT	0	1017	April 26, 2019
Tensorrt 7 - Best Practice for implementing plugin that supports both FP16 and Fp32 TensorRT	1	459	August 16, 2022
Custom plugin supporting int8 I/O type check fail TensorRT	2	546	May 26, 2023
What is the difference between IRNNv2 and PersistentLSTMPlugin? TensorRT	3	1056	October 12, 2021
Question about the tensorrt precision transformation TensorRT	4	470	July 12, 2021

How to use INT8 LSTM with TensorRT 7.0.0 ?

Related topics