Enable INT8 mode for a YOLO/ONNX model in DeepStream

arivuofficialnasa · August 26, 2025, 6:43am

🖥️ Environment

DeepStream version: 7.0.0
TensorRT version (C++/DeepStream): 8.6.1.6
TensorRT version (Python): 10.13.2.6 (note: mismatch, but DS links against 8.6.1.6)
CUDA version: 11.5 (from nvcc)
GPU driver: 535.230.02
GPU: NVIDIA RTX A4000 (16GB)
CUDA reported by driver: 12.2
OS: Ubuntu 22.04.5 LTS, kernel 6.8.0-65-generic

Implementation

Model: Custom trained YOLOv8 (Ultralytics export)

Export:

yolo export model=custom_yolov8.pt format=onnx opset=12

ONNX opset: 12
Precision: Trying INT8 (with calibration table)

\[property\]
net-scale-factor=0.0039215697906911373
model-color-format=0
int8-calib-file=/home/proglint-ai10/interns/Optimization/quantisation/Quantization-YOLOv8/calib.table
onnx-file=checkout.onnx
labelfile-path=cash_checkout.txt
network-mode=1                 # INT8
num-detected-classes=18
interval=0
gie-unique-id=3
process-mode=1
network-type=0
network-input-order=0          # NCHW
cluster-mode=2
maintain-aspect-ratio=1
scaling-filter=1
symmetric-padding=1
offset=114;114;114;
parse-bbox-func-name=NvDsInferParseYolo
custom-lib-path=../nvdsinfer_custom_impl/libnvdsinfer_custom_impl_Yolo.so

\[class-attrs-all\]
nms-iou-threshold=0.50
topk=300

ERROR: ../nvdsinfer/nvdsinfer_model_builder.cpp:1129 Build engine failed from config file
ERROR: ../nvdsinfer/nvdsinfer_model_builder.cpp:821 failed to build trt engine.
ERROR: NvDsInferContextImpl::buildModel() build engine file failed
ERROR: NvDsInferContextImpl::generateBackendContext() build backend context failed
ERROR: NvDsInferContextImpl::initialize() generate backend failed, check config file settings
WARN  : error: Failed to create NvDsInferContext instance
NvDsInfer Error: NVDSINFER_CONFIG_FAILED

Can Anyone help me how to debug this? I’ve followed everything available in the git and forums. Nothing seems working i donno what am i missiong

Fiona.Chen · August 28, 2025, 5:11am

Please use “trtexec” to check your onnx model and the calib file first.

Fiona.Chen · August 28, 2025, 5:32am

The DeepStream 7.0 GA is bsed on CUDA 12.2 version. Please make sure you have followed the DeepStream compatibility limitation. Installation — DeepStream documentation

Y-T-G · August 28, 2025, 5:33pm

Does exporting TensorRT INT8 engine directly from Ultralytics work?

Fiona.Chen · August 29, 2025, 1:04am

Please consult the author of Ultralytics for their own work.

You can verify the ONNX model and your calibration file with “trtexec” to check what is the issue with the model itself.

yingliu · September 26, 2025, 6:51am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks.

system · October 10, 2025, 6:52am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deepstream yolov5 int8 fail DeepStream SDK python , deepstream	3	340	May 8, 2024
Custom yolov8 model not working for int8 DeepStream SDK deepstream	8	306	February 20, 2025
ERROR: [TRT]: 1: Unexpected exception _Map_base::at DeepStream SDK	2	715	December 25, 2023
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	683	October 12, 2021
VDSINFER_CONFIG_FAILED in deployment of int8 model DeepStream SDK deepstream	5	62	September 26, 2025
Converting a custom yolo_model.onnx to int8 engine DeepStream SDK	4	1081	February 19, 2024
TensorRT --- non-int8 fallback when trying to calibrate ONNX model DeepStream SDK tensorrt , deepstream	11	592	July 1, 2024
In Deepstream on Jetson from fp32 to int8 failed DeepStream SDK yolo , jetson , deepstream	4	559	December 7, 2023
Int8 YOLOv8s on Jetson Orin Nano issue with DeepStream 6.3 DeepStream SDK	4	926	November 23, 2023
The result of yolox is wrong in int8 mode DeepStream SDK	4	1174	March 22, 2022

Enable INT8 mode for a YOLO/ONNX model in DeepStream

🖥️ Environment

Implementation

Related topics