Model file size on jetson nano with 6.0-full-dims is larger than that on desktop PC with 6.0

kylezheng04 · August 14, 2020, 10:45pm

I find that the model file size of the detection model (49MB) converted using ONNX-Tensorrt 6.0 on the desktop is slightly larger than the ONNX model (44MB). However, after converting the same ONNX model (44MB) on Jetson Nano using ONNX-Tensorrt 6.0-full-dims, the output Tensorrt model file size increase to 170 MB. I am wondering what is the problem with the significant model file size increase on Jetson Nano?

The command I am using is:

onnx2trt -o detection_model.trt -b 1 -d 16 -l model.onnx

Here, I have set the max batch size to be 1, the model data type is float16.

The onnx-tensorrt code I am using:

The jetson nano is installed with Jetpack 4.3 with tensorrt 6.

AastaLLL · August 17, 2020, 3:16am

Hi,

The TensorRT version for Jetson and desktop should be slightly different on minor version.
So the output file size will have some different.

Thanks.

kylezheng04 · August 17, 2020, 3:53am

Thanks for prompt reply. I want to understand a bit more on the representation under the hood of the model on Jetson Nano. Any documentation you could point me looking at? If it is possible, could you provide toolkit suggestion provided by NVIDIA that could improve the model file size and speed?

AastaLLL · August 18, 2020, 4:00am

Hi,

It’s recommended to use our latest software first.
You should be able to get some improvement when upgrading TensorRT into v7.1.

Most of our TensorRT implementation is not open-sourced.
But you can find some serializer for plugin here:

github.com

NVIDIA/TensorRT/blob/master/plugin/nmsPlugin/nmsPlugin.cpp#L129


      
          int DetectionOutput::getNbOutputs() const noexcept
          {
              // Plugin layer has 2 outputs
              return 2;
          }
          
          
int DetectionOutputDynamic::getNbOutputs() const noexcept
          {
              // Plugin layer has 2 outputs
              return 2;
          }
          
          
int DetectionOutput::initialize() noexcept
          {
              return STATUS_SUCCESS;
          }
          
          
int DetectionOutputDynamic::initialize() noexcept
          {
              return STATUS_SUCCESS;
          }

Thanks.

Topic		Replies	Views
Issue regrading size and type of the model during conversion from ONNX to TRT Jetson Nano tensorrt , jetson-inference , onnx	2	491	October 18, 2021
Workspace size for Jetson Nano Jetson Nano tensorrt , jetson-inference	4	1521	October 18, 2021
Issue regrading size and type of the model during conversion from ONNX to TRT TensorRT tensorrt , cuda , pytorch , onnx	1	559	October 12, 2020
TX2 NX ONNX Convert TensorRT Engine Jetson TX2 tensorrt , hw , jetson-inference	2	683	October 18, 2021
ONNX to TensorRT model conversion failure on Jetson Nano Jetson Nano tensorrt	7	745	November 6, 2023
How to convert Tensorflow model to Tensorrt? Jetson Nano tensorrt , tensorflow	8	2562	October 15, 2021
TensorRt Error Network must have at least one output , using onnx model in Jetson nano TensorRT	13	2725	September 24, 2020
SSD Mobilenet onnx from saved_model trained in tensorflow api Jetson Nano tensorrt , tensorflow , jetson-inference , onnx	2	1309	October 15, 2021
TensrFlow2.0 to run on Jetson Nano 2GB Jetson Nano tensorflow , nano2gb	4	621	October 15, 2021
I do not get any performance improvement after using TensorRT provider for object detection model Jetson Nano tensorrt , onnx	7	1536	July 12, 2022

Model file size on jetson nano with 6.0-full-dims is larger than that on desktop PC with 6.0

Related topics