Complete information of setup.
• Hardware Platform (Jetson / GPU): GPU
• DeepStream Version: 8.0
• TensorRT Version: 10.9.0.34-1+cuda12.8
• NVIDIA GPU Driver Version (valid for GPU only): 570
• Issue Type( questions, new requirements, bugs): question
The model onnx file used is yolov11s_qat_int8_672_dynamic.onnx taken from [here](deepstream_tools/yolo_deepstream at main · NVIDIA-AI-IOT/deepstream_tools · GitHub).
In brief:
Creating engine file using tensorrt bin → int8 engine file is getting created.
When deepstream is creating engine file (when engine file is not present) → it is falling back to fp16 by saying calib file is not specified. (It is explict quant model. Please see detailed explanation below)
In detail:
Building engine file with tensorrt → int8 engine file is correctly getting created. The command used is /usr/src/tensorrt/bin/trtexec --onnx=yolov11s_qat_dynamic.onnx --int8 --fp16 --saveEngine=yolov11s_qat_dynamic.onnx_b1_gpu0_int8.engine.
But, if deepstream creates engine file (when the engine file is not present), then, in the config file, it is seeing network-mode=1 for int8 and is saying that calib file is not specified, so falling back to fp16, but this model onnx file doesn’t come with calib file as it is explict quant.
The config file is taken from [here]( deepstream_tools/yolo_deepstream/deepstream_yolo/config_infer_primary_yoloV11.txt at main · NVIDIA-AI-IOT/deepstream_tools · GitHub ) and edited. It is uploaded here: config.txt (3.8 KB). And I used deepstream-test1 sample app to auto create the engine file with deepstream, the config file for the app is uploaded here: dstest1_config.yml.txt (1.1 KB) (this file is renamed to text as yml is not a supported file in the forum post)
Question: How to set deepstream to generate int8 engine file in such case?
Reason for this question: If engine file was accidentally not-created before launching the deepstream app, it will fallback to fp16 while creating engine file. Also, it was observed in the past that it will rebuild on every launch of the deepstream app because it will always first look for the given engine file path in the config file, which is ...int8.engine, and, because it won’t be there as the saved file will be ...fp16.engine, it will rebuild the same file.
Please help.