Performance comparison of TensorRT-optimized model between: (i) TF-TRT vs (ii) TensorRT C++ API?

ardianumam · January 2, 2019, 7:51am

Hi,

If I have a Tensorflow model, I have two options to optimize to TensorRT-optimized model: (i) via TF-TRT, which is relatively easy and simple, and (ii) using TensorRT C++ API. From a same model, in a same GPU, will both methods, (i) and (ii), generate a same performance, e.i., same FPS result? Or there will be a different of the performance? Can you provide a benchmark result of them?

Thanks.

NVES · January 2, 2019, 3:42pm

Hello,

We are currently working on TF-TRT vs. TRT benchmarks. Unfortunately, we are not sharing the results yet. Please stay tuned for future announcements.

regards,
NVIDIA Enterprise Support

ardianumam · January 18, 2019, 3:08am

Hi All,

After starting to try TensorRT optimization and I personally found difficulties here and there, so, I decide to make a video tutorial here how we can optimize deep learning model obtained using Keras and Tensorflow. I also demonstrate to optimize YOLOv3. Hope it helps for those who begins trying to use TensorRT, and don’t encounter similar difficulties as I experienced before.

Optimizing Tensorflow to TensorRT:
01 Optimizing Tensorflow Model Using TensorRT with 3.7x Faster Inference Time - YouTube
Visualizing model graph before and after TensorRT optimization:
02 Visualizing Deep Learning Graph Before and After TensorRT Optimization - YouTube
Optimizing Keras model to TensorRT:
03 Optimizing Keras Model to TensorRT - YouTube
Optimizing YOLOv3:
06 Optimizing YOLO version 3 Model using TensorRT with 1.5x Faster Inference Time - YouTube
YOLOv3 sample result, before and after TensorRT optimization:
07 Another YOLOv3 Detection Result (Native Tensorflow vs TensorRT optimized) - YouTube

harryhsl8c · October 8, 2019, 11:41am

Has there been any update on this benchmarking that can be shared?

vonessenjakob · July 7, 2020, 9:21am

I didn’t found any official benchmark. But in “Deep LearningInference on PowerEdge R7425” by dell is a comparison of TensorRT-API and TF-TRT.
In my research i got simillar results, so i can confirm the section in this whitepaper.

Topic		Replies	Views
Wanna Share A Video Tutorial Series About Optimizing Tensorflow and Keras Model to TensorRT TensorRT	1	695	February 18, 2019
TensorRT vs TensorFlow-TRT Jetson TX2 tensorrt	2	631	October 18, 2021
TF-TRT vs TensorRT Jetson Nano	2	3532	October 14, 2021
What is the relation between tensorflow-tensorrt, tensorrt and tlt-converter tensorrt and their deployment strategies TensorRT	5	1017	October 12, 2021
Inference time using TF-TRT is the same as Native Tensorflow for Object Detection Models TensorRT tensorrt , tf-trt	4	1008	March 31, 2022
How can I optimize Tensorflow models on windows OS? The TF models are saved in the SavedModel format TensorRT	1	312	December 13, 2021
Looking for Insight on Disappointing Results Optimizing an Object Detection Network with TensorRT TensorRT	1	886	March 29, 2019
Difference between TF-TFT and uff->tensorrt Jetson Nano	4	718	October 14, 2021
TensorRT 3: Faster TensorFlow Inference and Volta Support Technical Blog	16	462	December 8, 2020
Lower performance with TRT than plain TF? Jetson Xavier NX tensorrt , jetson-inference	14	1955	October 18, 2021

Performance comparison of TensorRT-optimized model between: (i) TF-TRT vs (ii) TensorRT C++ API?

Related topics