Slow inference on jetson TX2 with tensorflow

thomasoyp0u · February 14, 2019, 6:42pm

I am getting started on deploying deep learning models to a Jetson TX2 flashed with JetPack3.3

I have been using the following resources:

GitHub - dusty-nv/jetson-inference: Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
GitHub - NVIDIA-AI-IOT/tf_trt_models: TensorFlow models accelerated with NVIDIA TensorRT

so far, I have successfully trained models with DIGITS and deployed using the tutorial in the first repo. The performances are matching the benchmarks (<100ms for classification, <200ms for detection).

However, most of our DL research has been done in Keras and Tensorflow so far and it would be easier to use these frameworks and python directly.

The tf_trt repo looked promising but I am nowhere near the performances announced. My device has been flashed with JetPack3.3 but otherwise I followed the instructions in the repo (tf 1.8.0 etc).

The following code gives an inference time over 1 second (I literally run the classification notebook with inception_v1), so 3 orders of magnitude from the 7 ms announced.

from time import time
start = time()
output = tf_sess.run(tf_output, feed_dict={tf_input: image[None, ...]})
end = time()
print("inference time: {}s".format(end-start))

AastaLLL · February 15, 2019, 7:21am

Hi,

Not all TensorFlow operation is supported by TensorRT.
For those non-supported ops, TensorFlow-TRT fallback to using TensorFlow implementation directly.

To check the bottleneck comes from, it’s recommended to check which implementation(TF or TRT) is used of each layer.
Could you help to generate this information first?

Thanks.

Topic		Replies	Views
High latency while run TensorFlow with keras on Jetson Tx2 Jetson TX2	4	1709	November 16, 2017
Correct way of deploying a tensorflow model on TX2? Jetson TX2	16	6013	July 30, 2019
TensorFlow object detection and image classification accelerated for NVIDIA Jetson Jetson TX2	25	10892	June 3, 2019
Model inferencing with TensorRT on Jetson (TX2) Jetson TX2	3	1054	February 26, 2020
Jetson TX2 working at full capacity for a object detection model Inference Jetson TX2 jetson-inference	3	927	February 17, 2022
Slow inference using tensorrt sampleFasterRCNN, 320ms/frame Jetson TX2	4	1544	February 21, 2018
Performance of Tensorflow (1.5) on Jetson TX2 slower than expected Jetson TX2	2	2887	February 7, 2018
Tensorflow with TensorRT on Jetson TX2 Jetson TX2	1	649	August 19, 2019
Aboat tensorrt python api on TX2 Jetson TX2	2	1668	May 20, 2019
TensorRT or Tflite or ONNX for EfficientDet using Jetson TX2 Jetson TX2 tensorrt , tensorflow	1	2698	December 15, 2021

Slow inference on jetson TX2 with tensorflow

Related topics