the first inference speed is so slow

andylina · December 3, 2019, 1:48am

when i translate my pb model to tensorrt model, the first time it run sess.run ,is almost 3 three hour,then it get down to 0.17s:
I use tensorfow-gpu 2.0
cuda 10.0
cudnn:7.6.0
nvidia:GForce1080Ti
the code is below:
import os
import tensorflow.compat.v1 as tf
import time
import numpy as np

os.environ[“CUDA_DEVICE_ORDER”] = “PCI_BUS_ID”
os.environ[“CUDA_VISIBLE_DEVICES”] = “6”

location = False

if location:
# trtfilepath = “/home/andy/models_trt/densenet/densenet_rt8.pb”
trtfilepath = “/home/andy/models_trt/densenet/densenet_rt8.pb”
else:
trtfilepath = “/home/dongdong/qian/models_trt/densenet/densenet_rt8.pb”
# trtfilepath = “/home/dongdong/qian/models/densenet/loss_min_threeN_GPU16.pb”

input_x = “0:0”
outimg1 = “Sigmoid:0”
outimg2 = “Sigmoid_1:0”
outimg3 = “Sigmoid_2:0”

shape = [1, 1, 256, 256]
features = np.random.random(shape).astype(np.float32)

with tf.Session() as sess:
with tf.gfile.GFile(trtfilepath, ‘rb’) as f:
frozen_graph = tf.GraphDef()
frozen_graph.ParseFromString(f.read())
sess.graph.as_default()
tf.import_graph_def(frozen_graph, name=‘’)
tf_input = sess.graph.get_tensor_by_name(input_x)
tf_output1 = sess.graph.get_tensor_by_name(outimg1)
tf_output2 = sess.graph.get_tensor_by_name(outimg2)
tf_output3 = sess.graph.get_tensor_by_name(outimg3)
t1 = time.time()

    while True:
        t1 = time.time()
        output1, output2, output3 = sess.run([tf_output1, tf_output2, tf_output3], feed_dict={
            tf_input: features
        })
        t2 = time.time()
        print(t2-t1)

what’s wrong with it?

TsarevnaSvetlana · October 14, 2020, 6:40pm

Any updates on this? I’m having a similar problem.

hoangtm.fami · April 3, 2021, 3:50am

Any update on this? Seems like the whole tech for edge device is still at early stage. So many bugs and inefficiencies around

Topic		Replies	Views
The first inference using tensorRT model takes far longer time than that using tensorflow model TensorRT	0	658	November 13, 2020
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1433	September 27, 2019
TensorRT results in reduced accuracy and performance TensorRT tensorrt	1	1489	July 30, 2020
Slow first inference and very slow two models inference TensorRT	3	1239	August 2, 2022
Tensorflow Deeplab to TensorRT conversion TensorRT	3	1798	January 30, 2019
TensorRT inference Time TensorRT	1	758	September 20, 2018
Tensorrt inference slower than tensorflow TensorRT	3	484	November 27, 2020
TensorRT inference is slower than tensorflow model TensorRT	1	954	June 28, 2019
Tlt-infer is slow TAO Toolkit	13	830	October 12, 2021
When using tensorrt's c++ API for inference under 3060 graphics card, the speed of loading the first picture is very slow TensorRT tensorrt , cuda	0	449	May 30, 2022

the first inference speed is so slow

Related topics