ONNX Quantization with INT8 calibration TOP1 accuracy using TensorRT C++ API

Harry-S · July 26, 2022, 11:33am

Description

I would like to get the TOP1 accuracy by doing quatization with INT8 calibration to a ONNX model using validating images.jpeg with TensorRT C++ API

Environment

TensorRT Version: 8.4 EA
GPU Type: Jetson AGX ORIN
Nvidia Driver Version:
CUDA Version: 11.4
CUDNN Version: 8.3.2.49
Operating System + Version: Ubuntu 20.04 LTS
Python Version (if applicable): 3.8
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): Baremetal on Jetson AGX Orin

Hello, please I would like to have a very general sample how to use a ONNX model to calibrated in INT8 and run inference to get the TOP1 accuracy. I already looked in theses samples below but nothing is clear and they have different implementation each:

SampleINT8
SampleINT8API
SampleOnnxMNIST

But I have images under jpeg format not as the MNIST format so I can’t use the MNISTBatchStream class like in the samples.

So could you please provide me an example or if you don’t have any example please provide me something clear where I can use it to solve my problem.

Thank you in advance.

Best regard,
Harry

NVES · July 26, 2022, 12:07pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

Harry-S · July 26, 2022, 12:25pm

Hello @NVES,

Thank you for your reply but I already saw theses and for the ONNX model, it is not that important because I need something general but I will share with you a model, please have a look below:
D_resnet18-v1-7.onnx (44.7 MB)

For trtexec, I already use it but it does not show me TOP1 and TOP5 accuracy so for me it is not a solution.

NOTE: I am using the JetPack 5.0.1 DP wich has TensorRT 8.4 EA.

Thank you.

Best regard,
Harry

spolisetty · August 3, 2022, 1:51pm

Hi

is it possible to get TOP1 and TOP5 accuracy?

No, there’s no “built-in” way to do this. The user can do it themselves in multiple ways - either having a CPU-side top-K calculation with the outputs of the engine, modifying the parsed network itself and adding a TRT TopK layer through the TRT API, editing the ONNX graph itself using ONNX-Graphsurgeon to add the topk node, etc.

Thank you.

Harry-S · August 4, 2022, 7:56am

Hello @spolisetty,

Thank you very much for your message.

I just want to add so if someone was asking, to manipulate jpeg images I used OpenCV.

system · August 18, 2022, 7:56am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ONNX Model INT8 Engine Build TensorRT tensorrt , jetson-inference , calibration , onnx	3	1965	July 26, 2022
How to inference Image Classification using Jetson NANO and CSI Camera for Custom CNN model Jetson Nano camera	5	1153	May 17, 2022
TensorRT INT8 inference accuracy TensorRT	2	507	May 9, 2022
Trouble with TensorRT inference pipeline for Segmentation on Jetson Orin Nano Jetson Orin Nano tensorrt	4	32	June 9, 2025
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1818	February 14, 2022
Mod operator unsupported in TensorRT 8.4.1 (included w/ Jetpack 5.0.2) TensorRT jetpack , tensorrt , cuda , jetson-inference , onnx	5	1567	January 2, 2023
TensorRT only supports input K as an initializer TensorRT	9	3085	August 10, 2021
ONNX Model Int64 Weights TensorRT	12	13524	February 17, 2024
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1588	September 28, 2023
Converting to TRT a model from Quantization Aware Training without applying calibration TensorRT	5	1728	February 2, 2021

ONNX Quantization with INT8 calibration TOP1 accuracy using TensorRT C++ API

Description

Environment

check_model.py

Related topics