Why is the model engine file so much large with TensorRT 7.1 compared to 5.1 or 6.0?

SB_97 · July 6, 2020, 1:28pm

Hi

I’m using Jetpack on the Jetson Nano to run a TensorRT version of YOLO. I’m finding that the serialised model engine is much larger with TensorRT 7.1 (Jetpack 4.4 DP) compared to TensorRT 6.0 (Jetpack 4.3) and TensorRT 5.1 (Jetpack 4.2.1), even though the model is the same in each case.

The sizes are as follows:
TensorRT 5.1: 184 MB
TensorRT 6.0: 184 MB
TensorRT 7.1: 302 MB.

I initially thought that perhaps a kFLOAT model was being created instead of the kHALF model that I wanted. However, when I generated a kFLOAT model using TensorRT 7.1 it was 600 MB, so that doesn’t seem to be the cause of what I’m seeing. The performance of three models are similar in the few tests that I have run.

I’d appreciate it if anyone could shed light on this.
Thanks!

foobar.warren · July 6, 2020, 4:29pm

I’m no expert, but if its not the precision, then it must be the weights have changed.

AastaLLL · July 7, 2020, 3:13am

Hi,

The TensorRT engine will differ from the chosen inference algorithm.
Since we introduce many acceleration in the new release, the file size will be different.

Would you mind to share the model and the command to reproduce this issue?
We want to feedback this issue to our internal team first.

Thanks.

SB_97 · July 7, 2020, 6:47am

Thanks @AastaLLL. The command used to build the model is IBuilder::buildEngineWithConfig().

Is it the serialised plan file that you want or the code that was used to generate the model? The code is based on Nvidia sample code from a trt-yolo sample app that came with the deepstream 3 repository. I don’t think it’s available online any more. It builds up the yolo model from scratch based on a yolo config file. I have made some changes from the original sample though.

AastaLLL · July 8, 2020, 2:38am

Thanks for the information.
We will try it and update more information with you later.

Thanks.

SB_97 · July 8, 2020, 6:53am

OK - thank you. If it would be helpful I could probably condense my code into a sample app that would build the model under the different TensorRT versions for comparison.

Topic		Replies	Views
Serialized trt engine file is much larger than the original model file? TensorRT tensorrt	2	941	October 12, 2021
Unusual engine size when generated on JetPack 4.6.4 Jetson Xavier NX tensorrt	2	327	September 1, 2023
TRT engine file bigger than original Darknet YOLO V3 model TensorRT tensorrt , deepstream	3	614	January 30, 2023
The size of tensorrt's engine is different TensorRT	4	1191	October 12, 2021
Issue regrading size and type of the model during conversion from ONNX to TRT Jetson Nano tensorrt , jetson-inference , onnx	2	476	October 18, 2021
Model file size on jetson nano with 6.0-full-dims is larger than that on desktop PC with 6.0 Jetson Nano onnx	4	833	October 18, 2021
Difference between TRT engine file size for FP16 & 32 TensorRT	2	1231	October 12, 2021
File size vs. batch size for TensorRT serialized engine files TensorRT tensorrt	1	608	September 30, 2020
TX2 NX ONNX Convert TensorRT Engine Jetson TX2 tensorrt , hw , jetson-inference	2	665	October 18, 2021
Trt file from onnx is too large TensorRT	1	944	March 10, 2021

Why is the model engine file so much large with TensorRT 7.1 compared to 5.1 or 6.0?

Related topics