Why is TensorRT faster than TensorFlow?

lolcocks1231 · April 22, 2022, 2:20am

Hello,

What is the exact technical reason why TensorRT is faster than TensorFlow or others?

Everywhere I see, I see reasons like “40% faster” etc. No technical reasons.

Does anyone have a link to a technical reason?

NVES · April 22, 2022, 2:37am

Hi,

Request you to share the model, script, profiler, and performance output if not shared already so that we can help you better.

Alternatively, you can try running your model with trtexec command.

While measuring the model performance, make sure you consider the latency and throughput of the network inference, excluding the data pre and post-processing overhead.
Please refer to the below links for more details:
https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-803/best-practices/index.html#measure-performance

https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-803/best-practices/index.html#model-accuracy

Thanks!

lolcocks1231 · April 22, 2022, 2:31pm

Thank you, NVES!

But this is a more general question.

I am asking, why does TensorRT in general perform better than TensorFlow on the GPU? Like how exactly is TensorRT taking advantage of the hardware to perform better than other machine learning libraries.

spolisetty · April 26, 2022, 5:16am

Hi,

We document some of what we do here, Please refer following.
https://developer.nvidia.com/tensorrt#features

Thank you.

Topic		Replies	Views
Tensorrt is slower than pytorch TensorRT	2	2207	September 15, 2021
TensorRT vs TensorFlow-TRT Jetson TX2 tensorrt	2	629	October 18, 2021
Tensorrt inference slower than tensorflow TensorRT	3	483	November 27, 2020
Slow first inference and very slow two models inference TensorRT	3	1216	August 2, 2022
Slow inference UNet Industrial TF-TRT TensorRT tensorrt , tensorflow	1	455	July 2, 2023
Inference time of tensorrt 6.3 is slower than tensorrt 6.0 TensorRT tensorrt , driveos	7	912	October 12, 2021
INT8 (8-bit inference, post-training quantization) on Windows 10 is much slower than Ubuntu 20.04 TensorRT	5	722	September 23, 2022
Get the Best Performance for Your Neural Networks with TensorRT Technical Blog	0	252	August 21, 2022
I found that using tensorrt for inference takes more time than using tensorflow directly on GPU TensorRT	1	745	April 9, 2019
Why TensorRT model is slower? TensorRT tensorrt	3	1305	June 20, 2022

Why is TensorRT faster than TensorFlow?

Related topics