Converting a custom yolo_model.onnx to int8 engine

erence · February 12, 2024, 12:31pm

Hardware Platform (Jetson / GPU) Orin Nano
DeepStream Version :6.3
JetPack Version (valid for Jetson only) 5.1.2-b104
TensorRT Version 8.5.2-1+cuda11.4
Issue Type Question

I have a working yolo_v4_tiny model onnx file. Running deepstream converts it to fp16-engine, but this works on limits of 6 gb RAM of Jetson Orin Nano and slows/crashes.

I would like to create an int8 file out of model.onnx. What are the steps I should do for the easiest way? Best Reagrds

yingliu · February 13, 2024, 6:36am

You can use the below command to convert the ONNX model to an engine file, you need the calibration cache file generated during the calibration process:

trtexec --onnx=<model.onnx> --saveEngine=<model_int8.engine> --int8 --calib=<calibration.cache>

erence · February 13, 2024, 1:23pm

Thank you. I will will run it on nvcr.io/nvidia/tensorrt:24.01-py3-igpu. How can I create the ```
calibration.cache? Many Thanks:)

Fiona.Chen · February 19, 2024, 11:11am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

DeepStream is only an inferencing framework. There is no calibration function in DeepStream.

There is int8 calibration API in TensorRT. Developer Guide :: NVIDIA Deep Learning TensorRT Documentation, please scroll to chapter 7.3

For more TensorRT calibration API usage, please refer to TensorRT forum. Latest Deep Learning (Training & Inference)/TensorRT topics - NVIDIA Developer Forums

Topic		Replies	Views
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	741	October 12, 2021
Converting yolov4 model to engine file in deepstream DeepStream SDK jetson , deepstream	6	168	January 9, 2025
Enable INT8 mode for a YOLO/ONNX model in DeepStream DeepStream SDK deepstream	6	257	September 26, 2025
I would like to ask how to get a int8 model? Jetson Xavier NX tensorrt	9	890	September 27, 2021
ERROR: [TRT]: 1: Unexpected exception _Map_base::at DeepStream SDK	2	764	December 25, 2023
RT-DETR conversion to int8 TensorRT	1	357	October 23, 2024
Converting .onnx model to int8 Linux tensorrt , onnx	1	872	August 1, 2023
Deepstream yolov5 int8 fail DeepStream SDK python , deepstream	3	368	May 8, 2024
TensorRT --- non-int8 fallback when trying to calibrate ONNX model DeepStream SDK tensorrt , deepstream	11	690	July 1, 2024
Yolov4 int8 calibration steps required DeepStream SDK	6	607	April 21, 2023

Converting a custom yolo_model.onnx to int8 engine

Related topics