on TX2, there is no effect using tensorRT to speed up my trained model

xuefengxiaoyang · March 11, 2019, 6:40am

Hello,

Linux distro and version:ubuntu16.04
GPU type :nvidia TX2
nvidia driver version:Jetpack3.2
CUDA version:9.0
CUDNN version:7.05
Python version [if using python]:2.7
Tensorflow version :1.8
TensorRT version:TensorRT 3.0 GA
If Jetson, OS, hw versions

Describe the problem
I trained a tensorflow model on the server and deployed it to the GPU of TX2. It processed a frame of image for about 0.14s. I accelerated the model with tensorRT and found that there is no effect. It still needs 0.14s to process one frame of image. I followed this guide GitHub - NVIDIA-AI-IOT/tf_trt_models: TensorFlow models accelerated with NVIDIA TensorRT and run sudo nvpmodel -m 0
sudo ~/jetson_clocks.sh
this is my part of code:
config = tf.ConfigProto()
config.gpu_options.allow_growth = True
with tf.Graph().as_default() as tf_graph:
with tf.Session(config=config) as sess:
saver = tf.train.import_meta_graph(‘model/model.meta’)
saver.restore(sess, tf.train.latest_checkpoint(‘model/’))

            frozen_graph = tf.graph_util.convert_variables_to_constants(
                sess,
                sess.graph_def,
                output_node_names=['network/Squeeze']
            )

    trt_graph = trt.create_inference_graph(
        input_graph_def=frozen_graph,
        outputs=['network/Squeeze'],
        max_batch_size=1,
        max_workspace_size_bytes=1 << 25,
        precision_mode='FP16',
        minimum_segment_size=50
    )

    writer = open('trt_graph/trt_graph.pb','wb')
    writer.write(trt_graph.SerializeToString())
    writer.close()

    self.sess = tf.Session(config=config)
    self.f = tf.gfile.GFile('trt_graph/trt_graph.pb','rb')
    self.graph_def = tf.GraphDef()
    self.graph_def.ParseFromString(self.f.read())
    self.sess.graph.as_default()
    tf.import_graph_def(self.graph_def)
    self.placeholder = self.sess.graph.get_tensor_by_name('import/input/input:0')
    self.logits = self.sess.graph.get_tensor_by_name('import/network/Squeeze:0')

Topic		Replies	Views
Cannot run model tested on GPU on TX2 - garbage returned. TensorRT	7	1069	October 29, 2018
Correct way of deploying a tensorflow model on TX2? Jetson TX2	16	6013	July 30, 2019
Dont see any speedups using TensorRT TensorRT	13	3221	November 15, 2018
Tensorflow not using GPU in Jetson TX2 Jetson TX2	11	4525	February 12, 2018
tensorflow-gpu not using gpu? Jetson TX2	3	4409	March 22, 2018
No improvements from TensorRT on NVIDIA-AI-IOT/tf_trt_models TensorRT	3	1667	February 21, 2019
TensorRT model performance on desktop gpu Jetson TX2 tensorrt	3	510	May 4, 2021
Failed to create Input layer tensor InputPH_0 TensorRT	0	430	April 22, 2019
Converting Tensorflow model to tensorRT model. Jetson TX2	5	4823	March 15, 2019
Jetson TX2 working at full capacity for a object detection model Inference Jetson TX2 jetson-inference	3	927	February 17, 2022

on TX2, there is no effect using tensorRT to speed up my trained model

Related topics