TensorRT can't speed on TensorFlow model

chenxingzhen0204 · July 31, 2019, 3:23am

I trained a “Pornographic image recognition” model with tensorflow and saved as a savemodel. Then I use tf-trt of the container “nvidia/tensorflow:19.02-py3” to optimize the model. Finally, I use the container tensorflow/serving:1.14.0-devel-gpu to serve the TF savemodel and the TRT optimezed TF savemodel. I test with the TF Serving client,I find that both models took the same time.

Here is my tf-trt optimezed code :

import os
import tensorflow as tf
import tensorflow.contrib.tensorrt as trt

flags = tf.app.flags
FLAGS = flags.FLAGS

# Inference with TF-TRT `SavedModel` workflow:
batch_size=8
max_workspace_size=(1<<32)

flags.DEFINE_string("export_dir", '/tftrt_serving_model/', "export_dir")

graph = tf.Graph()
with graph.as_default():
    with tf.Session() as sess:
        # Create a TensorRT inference graph from a SavedModel:
        trt_graph = trt.create_inference_graph(
            input_graph_def=None,
            outputs=None,
            input_saved_model_dir='/tf_serving_model',
            input_saved_model_tags=['serve'],
            max_batch_size=batch_size,
            max_workspace_size_bytes=max_workspace_size,
            precision_mode='FP32')

        tf.import_graph_def(trt_graph, name='')
        for i,n in enumerate(trt_graph.node):
            print("Name of the node - [%s]" % n.name)

        x = sess.graph.get_tensor_by_name("data:0")
        prob = sess.graph.get_tensor_by_name("out/softmax:0")
        #         label = sess.graph.get_tensor_by_name("out/output:0")
        print(x)
        print(prob)
        #         print(label)
        values, indices = tf.nn.top_k(prob, 15)
        print(values, indices)

        #创建模型输出builder
        builder = tf.saved_model.builder.SavedModelBuilder(FLAGS.export_dir)
        tensor_info_x = tf.saved_model.utils.build_tensor_info(x)
        tensor_info_pro = tf.saved_model.utils.build_tensor_info(tf.reshape(values, [15]))
        tensor_info_classify = tf.saved_model.utils.build_tensor_info(tf.reshape(indices, [15]))
        signature_def_map = {
            "predict_image": tf.saved_model.signature_def_utils.build_signature_def(
                inputs={"image": tensor_info_x},
                outputs={"pro": tensor_info_pro,
                         "classify": tensor_info_classify
                         },
                method_name=tf.saved_model.signature_constants.PREDICT_METHOD_NAME
            )}
        builder.add_meta_graph_and_variables(sess,
                                             [tf.saved_model.tag_constants.SERVING],
                                             signature_def_map=signature_def_map)
        builder.save()

Here is the model files compare:
TF model size 23M
tf_serving_model/
├── saved_model.pb
└── variables
├── variables.data-00000-of-00001
└── variables.index

1 directory, 3 files

TRT optimeze TF model size 65M
tftrt_serving_model
├── saved_model.pb
└── variables

1 directory, 1 file

Can anyone help me

Pooya-Davoodi · August 7, 2019, 4:52pm

“nvidia/tensorflow:19.02-py3” comes with TF 1.13. It would be great if you can try TF 1.14 which is available in “nvidia/tensorflow:19.07-py3”. In TF 1.14, many more operators are converted to TensorRT: Accelerating Inference In TF-TRT User Guide :: NVIDIA Deep Learning Frameworks Documentation

Due to a bug, the TF 1.14 pip package released by Google doesn’t support TensorRT (will be fixed in 1.14.1). I am not sure if tensorflow/serving:1.14.0-devel-gpu has the same problem.

Could you also post the log that you get from TF.
You can follow this: Accelerating Inference In TF-TRT User Guide :: NVIDIA Deep Learning Frameworks Documentation

Topic		Replies	Views
TensorRT can't speed on TensorFlow model TensorRT	0	523	July 31, 2019
How can I optimize Tensorflow models on windows OS? The TF models are saved in the SavedModel format TensorRT	1	311	December 13, 2021
TF-TRTModel loading time is very slow TensorRT tensorrt , tensorflow	10	1024	September 1, 2023
Error converting tf saved_model to trt TensorRT tensorrt , tensorflow	3	708	June 10, 2021
TF-TRT graph conversion failed for Tensorflow version 1 TensorRT tensorrt , tensorflow , ubuntu , python , tf-trt	1	822	May 17, 2022
TensorRT not improving FPS on GTX 1080ti TensorRT	9	2381	November 21, 2018
No performance improvement with TF-TRT optimization (ResNet50, DenseNet121) TensorRT	4	1085	June 15, 2020
No speed up with TensorRT FP16 or INT8 on NVIDIA V100 TensorRT	7	2771	November 15, 2019
How to use TensorRT models with Streamlit TensorRT	1	509	July 22, 2022
TF-TRT Saved Model Optimization Flow Input Signature Defs TensorRT	4	493	January 28, 2021

TensorRT can't speed on TensorFlow model

Related topics