Running multiple TensorRT-optimized models in Tensorflow

dinamicandriy · April 6, 2020, 2:35pm

I am working with the Tensorflow 2.0 project that uses multiple models for inference.
Some of those models were optimized using TF-TRT.

I tried both regular offline conversion and offline conversion with engine serialization. In case of regular conversion TensorRT engine is rebuilt every time model execution context changes. While using models with serialized engines, I’m not able to load more than one TensorRT-optimized models.

My application uses single Session at runtime.

I am using nvcr.io/nvidia/tensorflow:19.12-tf2-py3 docker container to optimize models and run the application.

More about the issue in:
https://stackoverflow.com/questions/60967867/running-multiple-tensorrt-optimized-models-in-tensorflow

What is the correct approach to run simultaneously multiple TensorRT-optimized models with pre-built engines using Tensorflow?

Is it a valid solution to use a separate Session for each of those models?

NVES_R · April 7, 2020, 3:02am

Hi @dinamicandriy,

Sorry, I’m not too sure on specific TF-TRT questions like this. I would recommend reaching out on GitHub - tensorflow/tensorrt: TensorFlow/TensorRT integration

Topic		Replies	Views
[TensorRT] How to load multiple TF-TRT model? TensorRT tensorrt , tensorflow	2	1827	May 14, 2020
Can not run two tensorrt models (two dockers) on same GPU TensorRT tensorrt , tensorflow , tf-trt	1	920	September 7, 2021
TensorRT model is double size TensorRT	3	604	April 27, 2020
Run multiple model(engine) with tensorrt without deepstream TensorRT	1	1127	April 20, 2020
Concurrent tensorRT engines TensorRT jetson	1	403	December 5, 2022
TF-TRT5: How to run tensorflow-tensorrt inferences with multiple GPUs TensorRT	10	3605	September 3, 2019
Multiple TensorRT models Inference on Jetson Orin Jetson AGX Orin tensorrt	2	279	April 24, 2024
TensorRT with Tensorflow models - what are the options? TensorRT	1	612	April 15, 2019
how to run trt in multithreading？ Jetson TX2	15	7988	October 18, 2021
Slow first inference and very slow two models inference TensorRT	3	1258	August 2, 2022

Running multiple TensorRT-optimized models in Tensorflow

Related topics