QLinearConv implementation in TensorRT and onnx model conversion

neda · November 26, 2020, 5:50pm

Description

Hello, I am in the process of writing custom QLinearConv and QLinearMatMul layers in tensorrt to be able to export an already quantized model to tensorrt.

Something that I am not clear about is that when I finish writing and registering the plugins using REGISTER_TENSORRT_PLUGIN API; as per the documentation

then can I export the quantized onnx model to tensorrt like any model with supported layers?

Or it is more than that and I need to craft my network in tensorrt with this additional plugins? Basically, I mean, if I have 20 QLinearConv layers in my graph, do I need to define QLinearConv plugin 20 times, one for each, and add them to the correct place in the network and then build the network?
If so, is there a more straight forward way of supporting these layers? Like modifying onnx_trt for such a support?

Environment

TensorRT Version : 7.0.0.11
GPU Type : T4
Nvidia Driver Version : 440.33.01
CUDA Version : 10.2
CUDNN Version : 7605
Operating System + Version : Ubuntu 18.04.5 LTS
Python Version (if applicable) : 3.6.9
PyTorch Version (if applicable) : 1.6.0

AakankshaS · November 27, 2020, 6:51am

Hi @neda,
Request you to check the below link for reference.

Thanks!

Topic		Replies	Views
Question about converting onnx quantized model to tensorrt TensorRT tensorrt , onnx	2	1075	November 9, 2020
Efficient NMS plugin to TensorRT engine at runtime TensorRT	4	6412	May 17, 2022
TensorRT inference engine with a quantized onnx model does not work TensorRT onnx	2	867	December 20, 2022
Onnx to tensorrt plugin for NonMaxSuppression TensorRT tensorrt , tensorflow	1	2456	April 26, 2020
ONNX Plugin Layer implements TensorRT	11	1899	January 12, 2021
Is there any usage example of TensorRT Plugins such as bertQKVToContextPlugin? TensorRT	5	636	February 8, 2023
Install TensorRT in Virtual Environment in Python with Custom Plugin? TensorRT	3	1031	May 1, 2022
A qustion about custom layer plugin when i use onnx parser TensorRT	3	392	April 6, 2021
Workflow for adding plugin layer to tensorflow model TensorRT tensorflow	4	753	April 6, 2021
Onnx model to TRT conversion error TensorRT	6	3138	April 15, 2022

QLinearConv implementation in TensorRT and onnx model conversion

Description

Environment

Related topics