What's different .trt and .engine of model?

wilicyy · September 22, 2021, 9:10pm

hello,
i used tensorrt_demos to change custom yolov4.weights to onnx and int8 engine.
i got model like this: yolov4-608-int8.trt, but it can not run at test5.
if i used ./trtexec to change that i will get engine which name xxx.engine and it works well, but only fp16.
could you advice to me what is different .trt and .engine files?
how can i do ?

thanks,

wilicyy · September 22, 2021, 10:45pm

and when i run:
./trtexec --onnx=yolov4_-1_3_608_608_dynamic.onnx --explicitBatch --saveEngine=yolov4_608.engine --best --verbose --calib=calib_yolov4-int8-608.bin

it shows error like below:

AastaLLL · September 23, 2021, 2:33am

Hi,

.trt and .engine are all TensorRT engine file.
Since it is just a serialized data, the file extension varies from the name user chooses.

The error you met is from the INT8 calibrator.
Do you generate the calibration cache with YOLOv4 on Xavier directly?

Thanks.

wilicyy · September 23, 2021, 5:11am

hi,
i used tensorrt_demos to generate my int8.trt files with my calib-images arround 10000pcs.
i changed classes names and nums for my own model.
and the file works well when i tested it by python3, when i put this model in test5, pipline can not open model.
i try to use this new cache file of int8.bin to run above command in my trainning path: ./trtexec xxxxxxxxx --int8 --calib, that shows error.
what do you mean generate cache with yolov4 on Xavier directly?
pls give some advice to go on my works for int8…

thank you very much!!

AastaLLL · October 5, 2021, 8:06am

Hi,

It looks like you are facing a similar issue with the following comment:

The above issue can be fixed by updating the input layer name from 000_net into data.
Could you check if the same solution also works for you?

Thanks.

wilicyy · October 10, 2021, 12:04am

yes it is that problem, now i get int8 works.
but new problem is why int8 inference time is slower that fp16?

AastaLLL · October 26, 2021, 6:59am

Hi,

Have you maximized the device performance?
If not, could you do so and try it again?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

Thanks.

wilicyy · October 27, 2021, 4:49pm

yes i used that already.
but i command is :
sudo nvpmodel -m 2
sudo jetson_clocks

wilicyy · October 27, 2021, 4:54pm

and when i try to command:
sudo nvpmmodel -m 0
it shows like this:
NVPM WARN: patching tpc_pg_mask: (0x1:0x4)
NVPM WARN: patched tpc_pg_mask: 0x4

is this correct?

thank you

wilicyy · October 30, 2021, 9:57pm

hi,
i tried to change trt again after fix nvpmmodel to 0,
and int8 is better than before and i can use normally right now, before i thought 2 is more power…:((((
thank you very much!

and i would like to ask a new quesiton again,
my custom model is multi objects and i have to test and modify it long time,
so sometimes i have to use yolo tiny for quickly works.
if i use 608608 on pgie and sgie, deepstream works well, but 15361536 on pgie and sgie, deepstream will be hung after 20-30 minutes, said scale problems on sgie!
could you advice a way to get it work?

thanks!

wilicyy · October 30, 2021, 9:59pm

hi,
i tried to change trt again after fix nvpmmodel to 0,
and int8 is better than before and i can use normally right now, before i thought 2 is more power…:((((
and i would like to ask a new quesiton again,
my custom model is multi objects and i have to test and modify it long time,
so sometimes i have to use yolo tiny for quickly works.
if i use 608608 on pgie and sgie, deepstream works well, but 15361536 on pgie and sgie, deepstream will be hung after 20-30 minutes, said scale problems on sgie!
could you advice a way to get it work?

thanks!

AastaLLL · November 1, 2021, 3:53am

Hi,

Mode 2 will enable more CPU cores but mode 0 has a higher GPU frequency.
You can find more details in the below document:

https://docs.nvidia.com/jetson/l4t/index.html#page/Tegra%20Linux%20Driver%20Package%20Development%20Guide/power_management_jetson_xavier.html#wwpID0E0PT0HA

For the new question, would you mind opening a new topic for it?
Thanks.

Topic		Replies	Views
How to verify if QAT TRT engine is indeed INT8 on Xavier Jetson AGX Xavier tensorrt	16	798	October 5, 2022
I would like to ask how to get a int8 model? Jetson Xavier NX tensorrt	9	888	September 27, 2021
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	711	October 12, 2021
Poor Result After INT8 Optimization (TLT Getting Started Guide) TAO Toolkit	32	1836	October 12, 2021
TX2 "INT8 not supported by platform. Trying FP16 mode" TAO Toolkit	11	2918	October 12, 2021
Unable to generate tensorrt engine using ds-tao-detection app for yolov4_tiny for QAT trained etlt model DeepStream SDK	16	762	June 14, 2023
Engine failed to match config params, trying rebuild DeepStream SDK	10	314	July 9, 2024
TensorRT --- non-int8 fallback when trying to calibrate ONNX model DeepStream SDK tensorrt , deepstream	11	660	July 1, 2024
Failed to create .engine File TAO Toolkit	33	2372	July 11, 2022
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	2166	June 29, 2021

What's different .trt and .engine of model?

Related topics