TensorRT inference Time

alexgershgorin · September 17, 2018, 7:58am

Hi,
I understand that my Tensorflow model should run faster on Jetson TX2 using TensorRT.
But after converting my TF model to TensorRT I found out that the inference time is slower with TensorRT engine, 80 ms instead of 20 ms.

My net:
1 Input1 of 1,448,576
1 Input1 of 1,448,576
1 output of 5,233,297

After converting to uff I run this function once:

def preprare_inference(self, channel_size, height, width, batch_size):
            # Allocate pagelocked memory
            self.output = pycuda.pagelocked_empty(5 * 233 * 297, dtype=np.float32)
            # alocate device memory
            self.d_input1 = pycuda.mem_alloc(1 * 448 * 576 * 4)
            self.d_input2 = pycuda.mem_alloc(1 * 448 * 576 * 4)
            self.d_output = pycuda.mem_alloc(1 * 5 * 233 * 297 * 4)

            self.stream = cuda.Stream()
            self.bindings = [int(self.d_input1), int(self.d_input2), int(self.d_output)]

and run with the following code

def do_infer(self, input1, input2):
            input1 = input1.astype(np.float32)
            input2 = input2.astype(np.float32)
            cuda.memcpy_htod_async(self.d_input1, input1,self.stream)
            cuda.memcpy_htod_async(self.d_input2, input2,self.stream)

            # execute model
            self.context.enqueue(1, self.bindings, self.stream.handle, None)
            # transfer predictions back
            cuda.memcpy_dtoh(self.output, self.d_output)

            return np.reshape(self.output, (5, 233, 297))

Can you please help me understand how is it possible?
Thanks

NVES · September 20, 2018, 10:54pm

Hello,

can you share the UFF with us? What version of TF and TRT are you using?

thanks

Topic		Replies	Views
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1440	September 27, 2019
Slow inference on jetson TX2 with tensorflow Jetson TX2	2	600	October 18, 2021
inference time of UFF using tensorrt is slower than tensorflow Jetson TX2	9	2757	October 18, 2021
The first inference using tensorRT model takes far longer time than that using tensorflow model TensorRT	0	662	November 13, 2020
Detection tesnorRT takes seconds to run on TX2 Jetson TX2 tensorrt , jetson-inference	8	660	October 18, 2021
No performance improvement for Tensorflow TensorRT model on converted on Jetsons Xavier NX Jetson Xavier NX tensorrt , tensorflow	2	680	October 18, 2021
Dont see any speedups using TensorRT TensorRT	14	2969	October 12, 2021
Low Compute utilization of converted TensorFlow model during inference Jetson TX2	19	1719	October 18, 2021
Slow first inference and very slow two models inference TensorRT	3	1255	August 2, 2022
Inference is so slow with torch1.6 Jetson Xavier NX nvbugs , pytorch	12	3551	October 23, 2020

TensorRT inference Time

Related topics