TensorRT model build time and deployment

ND_satnik · February 1, 2022, 6:05pm

Hello again,
this is not related to inference of a model but rather to build time of TRT engine.

For example, if we consider Yolov 5 model using TensorRT , the buildEngineWithConfig takes long time to compile a trt model. Now, it doesn’t matter if the ONNX or CAFFE parser is used, the build time will be similar.
Honestly, the topic about a long build time has beed discussed many times, for example in this post or here.

Therefore, this question is more oriented on providing a solution or providing some options to avoid (or reduce) model build time. As I previously stated, some applications may require the use of multiple deep learning solutions. Consider that the application employs 10 completely different deep learning solutions that are powered by TensorRT. The build time may varry and it could be from 1-5min depending on the architecture. If customers choose to install the application with those models, it may take a very long time to build all of them on their machine. Customers might avoid to use it because they might see it as an obstacle.

So what would be optimal solution to this issue?

Currently, I don’t see a solution or any other options; rather, I expect the build time to increase with a new TensorRT version 8.0.1. where it’s stated

Engine build times for TensorRT 8.0 may be slower than TensorRT 7.2 due to the engine optimizer being more aggressive.

Best regards,
Andrej

Topic		Replies	Views
Extreme engine building time for certain models on Windows with FP16 TensorRT	6	1217	March 23, 2022
ONNX Model Int64 Weights TensorRT	12	13633	February 17, 2024
TensorRT inference take too much time than expected TensorRT tensorrt	2	1039	December 22, 2020
How can I optimize multi-batch and parallel inference in TensorRT for faster performance on high-resolution image patches? TensorRT tensorrt , cuda , ubuntu , python , cudnn , deep-learning	2	99	December 2, 2024
Tensorrt Execution Provider TensorRT tensorrt , cudnn , onnx	1	851	November 27, 2023
TensorRT Engine Creation Methods’ Differences TensorRT tensorrt	1	428	September 27, 2023
Trtexec failed to create an engine from onnx file with fp16 TensorRT	7	1257	July 8, 2022
Speed up or measure progress of the network profiling/building phase TensorRT	3	494	May 24, 2022
【TensorRT】buildEngineWithConfig too slow in FP16 TensorRT tensorrt	11	3780	April 5, 2022
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1614	September 28, 2023

TensorRT model build time and deployment

Related topics