ONNX Model INT8 Engine Build

Splendor027 · July 20, 2022, 11:22pm

Description

I’ve successfully build engines by using prototxt file with INT8 calibrations. Besides, when I use ONNX models with FP16 data, I can also build engines. However, I’m receiving an error with ONNX Model + INT8 calibration. This sounds like a simple error however, I can’t find the source of it. The message is attached below:

Note: I’ve attached the files below. I’m aware of calibrating values are not correct. However, I don’t focus on the accuracy of the model at this point but getting some runtimes with INT8.

Error Message

INFO:EngineBuilder:Using calibration cache file: tensorrt_scripts/calibrator_networks/pose_densenet121_body_calibration
[07/20/2022-17:09:52] [TRT] [E] 4: [standardEngineBuilder.cpp::initCalibrationParams::1398] Error Code 4: Internal Error (Calibration failure occurred with no scaling factors detected. This could be due to no int8 calibrator or insufficient custom scales for network layers. Please see int8 sample to setup calibration correctly.)
[07/20/2022-17:09:52] [TRT] [E] 2: [builder.cpp::buildSerializedNetwork::620] Error Code 2: Internal Error (Assertion engine != nullptr failed. )

Environment

Jetson AGX Orin with Jetpack 5.0.1. Specifically, TensorRT 8.4.0

Relevant Files

I’m using Prototxts files and ONNX models from jetson-inference repo.
engine_building.py.py (5.2 KB)
pose_densenet121_body_calibration (8.5 KB)
pose_densenet121_body.onnx (79.4 MB)

Steps to Reproduce

I’ve attached the code and the model just in case if you need to reproduce. Please dont forget to update the file paths.

NVES · July 20, 2022, 11:37pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

Splendor027 · July 21, 2022, 2:49pm

I believe I shared at the top what you requested. Was this message an auto-message or is there anything that is not clear in my post?

Thanks!

spolisetty · July 26, 2022, 2:02pm

Hi,

Could you please give us more details on how you generated the calibration cache.

If you’re doing calibration cache generation wrongly, this may occur.
Please make sure calibration algorithm runs before the fusions occur (this way every tensor should have a scale attached to it). Calibrate an FP32/FP16 model using some data, this will generate a collection of scaling factors for each layer.

Please refer to the following sample.

https://github.com/NVIDIA/TensorRT/tree/main/samples/sampleINT8#calibration-file

Thank you.

Topic		Replies	Views
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	1911	June 29, 2021
Convert int8-onnx model to trt engine? TensorRT onnx	6	1073	April 29, 2023
ONNX Quantization with INT8 calibration TOP1 accuracy using TensorRT C++ API TensorRT tensorrt , jetson-inference , calibration , benchmarks , jetson	5	1635	August 18, 2022
TensorRT INT8 conversion from an ONNX model TensorRT tensorrt , calibration , onnx	4	5453	July 29, 2024
Converting to TRT a model from Quantization Aware Training without applying calibration TensorRT	5	1673	February 2, 2021
No clear indication on what the format of the calibration data should be for the trtexec application should be Jetson Xavier NX tensorrt	4	1039	September 25, 2023
INT8 calibration cache doesn't created TensorRT tensorrt	3	1062	March 24, 2022
INT8 calibration file not generating, not building in INT8 mode TensorRT tensorrt , ubuntu , python , jetson-nano	15	2422	June 4, 2022
Converting .onnx model to int8 Linux tensorrt , onnx	1	673	August 1, 2023
Darknet YoloV4-tiny model in TensorRT 8 inference TensorRT tensorrt , onnx	7	2196	October 22, 2021