I would like to ask how to get a int8 model?

wilicyy · July 3, 2021, 10:24pm

i trained a yolov4 model with 1536*1536, but FP16 runs only 15FPS.
how can i get a simple way to change int8 for more fast?
pls give me some instructions about that.

thank you

AastaLLL · July 5, 2021, 3:40am

Hi,

Which framework do you use for inference?
If it is TensorRT, you can create an INT8 engine with the following command directly:

$ /usr/src/tensorrt/bin/trtexec --int8 ...

Thanks.

wilicyy · July 5, 2021, 8:43am

i user yolov4 to onnx to TensorRT, it works well by fp16,
and i can create int8 engine successfully but it can not recognize object from image.
i am not sure it is input problem(608-608??)
my model should be 1536_1536…

thanks

wilicyy · July 5, 2021, 6:11pm

not input problem i am sure, and int8.engine can not find object after create from onnx,
my model is yolov4.
does it need a cal_trt.bin file?

thanks

wilicyy · July 8, 2021, 4:30pm

anybody to answer me about yolov4 int8 questions?

thanks

AastaLLL · July 12, 2021, 5:15am

Hi,

Sorry for the late update.
If your app works well with the fp16 model, then the issue should come from the calibration file.

Do you use the official YOLOv4 model?
If yes, you can try the calibration cache from our user:

Thanks.

wilicyy · July 12, 2021, 11:07am

yes i use official yolov4-tiny, but it is 1536*1536model which be trained,
can i use yolov4-tiny-406-calib_cache directly for int8?

thanks!

AastaLLL · July 19, 2021, 5:53am

Hi,

Since the input size is different, it’s recommended to re-generate a calibration cache.
You can find an example in GitHub as well:

Thanks.

wilicyy · July 19, 2021, 4:59pm

thank you i will study that.

system · September 27, 2021, 1:06pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Converting a custom yolo_model.onnx to int8 engine DeepStream SDK	4	797	February 19, 2024
Generate calibration file Jetson Xavier NX tensorrt	8	892	September 27, 2021
What's different .trt and .engine of model? Jetson AGX Xavier tensorrt	12	1562	November 24, 2021
YoloV4 int8 conversion issue TensorRT tensorrt	1	514	January 11, 2022
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	616	October 12, 2021
Yolov3 int8 on tensorrt 7.1.0.16 Jetson Xavier NX tensorrt	4	850	October 18, 2021
Can convert to INT32 but not with FP16 TensorRT	3	1039	November 29, 2022
Running YoloV4 INT8 Version on Jetson NX Xavier: Compatibility of TensorRT Engine Generated on x86 Platform TAO Toolkit tensorrt	3	193	June 11, 2024
Driver error-TensorRT INT8 deploy TensorRT	3	699	November 20, 2020
Converting .onnx model to int8 Linux tensorrt , onnx	1	679	August 1, 2023

I would like to ask how to get a int8 model?

Related topics