Trt file from onnx is too large

jolly.ming2005 · March 10, 2021, 9:33am

Description

hi,
I have an onnx model(the file size is 282M).
After converting to tensorrt model, the final trt file is 739M .
Why is the trt file so much larger than the onnx file?
Any suggestions?
Thanks!

Environment

TensorRT Version: v7.1.3.4
GPU Type: 1080Ti
Nvidia Driver Version: 455.45
CUDA Version: 11.0
CUDNN Version: 8.5
Operating System + Version: ubuntu 18.04
Python Version (if applicable): python3.6
PyTorch Version (if applicable): torch1.7

python code

import tensorrt as trt

import os
TRT_LOGGER = trt.Logger()

EXPLICIT_BATCH = 1 << (int)(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)

def get_engine(onnx_file_path, engine_file_path=""):
    """Attempts to load a serialized engine if available, otherwise builds a new TensorRT engine and saves it."""
    def build_engine():
        """Takes an ONNX file and creates a TensorRT engine to run inference with"""
        with trt.Builder(TRT_LOGGER) as builder, builder.create_network(EXPLICIT_BATCH) as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
        #with trt.Builder(TRT_LOGGER) as builder, builder.create_network() as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
            builder.max_workspace_size = 1 << 30 # 256MiB
            builder.max_batch_size = 1
            # Parse model file
            if not os.path.exists(onnx_file_path):
                print('ONNX file {} not found, please run yolov3_to_onnx.py first to generate it.'.format(onnx_file_path))
                exit(0)
            print('Loading ONNX file from path {}...'.format(onnx_file_path))
            with open(onnx_file_path, 'rb') as model:
                print('Beginning ONNX file parsing')
                parser.parse(model.read())
            network.get_input(0).shape = [1, 3, 1024, 1224]

            print('Completed parsing of ONNX file')
            print('Building an engine from file {}; this may take a while...'.format(onnx_file_path))
            engine = builder.build_cuda_engine(network)
            print("Completed creating Engine")
            with open(engine_file_path, "wb") as f:
                f.write(engine.serialize())
            return engine
            
def main():
    onnx_file = "/data3/deeplearning/output/iter_160000.pth.onnx"
    trt_file = "/data3/deeplearning/output/iter_160000.pth.onnx.trt"
    get_engine(onnx_file, trt_file)
    print("FINISH")

if __name__ == '__main__':
    main(

NVES · March 10, 2021, 1:05pm

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

Topic		Replies	Views
Serialized trt engine file is much larger than the original model file? TensorRT tensorrt	2	872	October 12, 2021
The same onnx file produces engine files with different size under same configurations TensorRT	4	686	May 12, 2022
Load ONNX model with batch size TensorRT	3	1779	October 12, 2021
When converting BERT onnx to TensorRT engine, get different num_layers TensorRT	12	1865	February 21, 2021
Fail to onnx2trt! TensorRT tensorrt	4	380	November 26, 2020
How to convert .onnx to .trtmodel tensorrt using trtexec with size output optional TensorRT	2	1389	September 11, 2024
A weird bug: Two similar onnx but one engine with bug TensorRT	3	486	February 27, 2023
TensorRT runtime batch processing in C++ TensorRT tensorrt	5	1578	September 8, 2021
Input length mismatch (onnx conversion to .trt) TensorRT tensorrt , onnx	4	1274	July 13, 2022
This version of TensorRT does not support dynamic ReverseSequence length TensorRT tensorflow	3	3233	February 17, 2022

Trt file from onnx is too large

Description

Environment

python code

check_model.py

Related topics