inference time of UFF using tensorrt is slower than tensorflow

373197201 · October 26, 2017, 1:44am

I convert a tensorflow model to UFF. Howerver, the inference time is slower than the original tensorflow model. My GPU is 1070

AastaLLL · October 26, 2017, 3:28am

Hi,

Could you share more information about your use-case?

For example,

Which network do you use?
What is the environment of your use-case?
Is it TensorRT on Jetson and TensorFlow on GTX-1070?

Thanks.

373197201 · October 26, 2017, 4:04am

Hi AastaLL:
My network is mobilenet, both TensorRT and TensorFlow on GTX-1070
My environment is Unbuntu16.04, cuda8.0.

AastaLLL · October 27, 2017, 1:51am

Hi,

We found a public mobilenet repository.

Is this the model you are using? We want to check it further.

373197201 · October 27, 2017, 8:49am

Hi AastaLL:

It is exactly what I use.
By the way, I found another confused problem.
I follow your example lenet.py. When I covert the whole graph to UFF, the tensorrt is faster than tensorflow. However, when the graph is only one convolution layer, the tensorrt is much slower than tensorflow, I don’t know the reason.

AastaLLL · November 2, 2017, 8:57am

Hi,

Please check this comment:
[url]depthwise convolution is very slow using tensorrt3.0 - Jetson TX2 - NVIDIA Developer Forums

Thanks.

cosin7877 · June 5, 2018, 1:29am

Hey,i also have some trobule with the lantency of tensorRT.
My uff-tensorflow model in tensorRT (with docker) takes 300s for 1000 inferences.And in naive tensorflow 1.8 it only takes about 30s??!

Don’t familiay with cuda.So ,the main advantage of tensorRT is the throughput， or there should be some bug in my code?

ps:both test on p100

AastaLLL · June 8, 2018, 6:39am

Hi,

Do you use TensorRT API or TensorFlow-TensorRT interface?
Thanks.

lpkhappy · January 17, 2019, 8:00am

Hi

I encounter with the same problem as cosin7877, I use TensorRT API and convert the tensorflow model to uff file. Usually
,what should be the reasons of these problems?

Thanks.

Topic		Replies	Views
TensorRT inference Time TensorRT	1	759	September 20, 2018
Recap on tensorflow object detection API on TX2 Jetson TX2	9	4746	October 18, 2021
Low GPU Usage with Tensorflow Inference on Jetson Tx2 Jetson TX2	13	4440	October 18, 2021
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1435	September 27, 2019
Getting Tensorflow based mobilenet SSD to run with TensorRT for inference speed up TensorRT	1	1947	October 3, 2018
Slow inference on jetson TX2 with tensorflow Jetson TX2	2	599	October 18, 2021
depthwise convolution is very slow using tensorrt3.0 Jetson TX2	11	4652	May 14, 2019
SSD-MobilenetV2 bad performance on XavierNX using Tensorflow + TF_TRT Jetson Xavier NX tensorrt , opencv , cuda , tensorflow	5	1856	October 18, 2021
TensorFlow to TensorRT - Object Detection API Recommended Workflow TensorRT tensorrt , tensorflow , onnx	1	707	October 15, 2021
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1695	October 18, 2021

inference time of UFF using tensorrt is slower than tensorflow

Related topics