Question about converting onnx quantized model to tensorrt

neda · November 5, 2020, 8:55pm

Description

I am trying to convert an already quantized onnx model to tensorrt!
When I try to parse my quantized onnx network, I get the following error

In node 1 (parseGraph): UNSUPPORTED_NODE: No importer registered for op: QLinearConv.

In the list of Tensorrt supported onnx operators here https://github.com/onnx/onnx-tensorrt/blob/master/operators.md, I can see that QlinearConv is not supported.

Is there any guideline on how to convert quantized onnx model to trt?

Environment

TensorRT Version: 7.0.0.11
GPU Type: T4
Nvidia Driver Version: 440.33.01
CUDA Version: 10.2
CUDNN Version: 7605
Operating System + Version: Ubuntu 18.04.5 LTS
Python Version (if applicable): 3.6.9
PyTorch Version (if applicable): 1.6.0

Steps To Reproduce

    TRT_LOGGER = trt.Logger(trt.Logger.WARNING)
    EXPLICIT_BATCH = 1 << (int)(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)

    with trt.Builder(TRT_LOGGER) as builder, builder.create_network(EXPLICIT_BATCH) as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
        builder.max_workspace_size = max_workspace_size
        builder.max_batch_size = 1

        if not parser.parse(model):
            for i in range(parser.num_errors):
                error = parser.get_error(i)
                print(error)

and the network is a quantized onnx model.

AakankshaS · November 8, 2020, 12:14pm

Hi @neda,
You will need to add custom plugin for the unsupported layer.
Please refer to the example below.

Thanks!

neda · November 9, 2020, 7:35pm

Thank you @AakankshaS!
I am reading through the docs and it is not clear to me whether it is possible to write/implement the costume layers all in python, or some parts of the custom layer creation need to necessarily happen in C++?

I am mainly referring to this sentence You can use the C++ API to create a custom layer, package the layer using pybind11 in Python, then load the plugin into a Python application, which is in the section 4.2 Adding Custom Layers Using The Python API. So the custom layer should be defined in c++ and we need a python wrapper to use it?
I would appreciate if you could answer!

Topic		Replies	Views
QLinearConv implementation in TensorRT and onnx model conversion TensorRT tensorrt	1	723	November 27, 2020
Convert ONNX model to TRT with custom layers TensorRT tensorrt	1	1594	December 24, 2020
Onnx model to TRT conversion error TensorRT	6	3350	April 15, 2022
Import pytorch model in TensorRT and add custom plugin layer TensorRT	7	5322	May 13, 2019
TensorRT conversion issues of ONNX model trained with Quantization Aware Training + custom quantization scale TensorRT tensorrt	5	1388	April 14, 2021
Creating a TensorRT engine from an ONNX model file TensorRT	1	6001	November 11, 2019
Importing convolution layers from onnx, with tensor inputs and tensor weights TensorRT	8	2207	October 12, 2021
onnx2trt - Depthwise Cross Correlation TensorRT	4	1778	July 12, 2020
Converting Pytorch model through ONNX: "UNSUPPORTED_NODE: No importer registered for op: Greater" TensorRT	2	936	October 12, 2021
Write converter for torch2trt for custom layer in pytorch and tensorrt TensorRT	6	1099	February 15, 2022

Question about converting onnx quantized model to tensorrt

Description

Environment

Steps To Reproduce

Related topics