TensorRT model inference is slower than normal model

saikrishnadas666 · July 29, 2020, 5:36am

Description

I converted my yolov4 model to trt model to run in jetson Xavier nx. But after inferencing the trt model, i found out that it took around 2-3 secs to detect a image , which is considered slower than the normal model which took only a sec.

I came across a WARNING while running the trt inference.

[TensorRT] WARNING: TensorRT was linked against CuDNN 7.6.5 but loaded against CuDNN 7.5.1

Environment

TensorRT Version: 7.0.0.11
GPU Type: P8 (Aws Deep learning 30.0 ubuntu 18.04 instance)
Nvidia Driver Version: 440.33.01
CUDA Version: 10.2
CUDNN Version: 7.5.1
Operating System + Version: ubuntu 18.04
Python Version (if applicable): 3.6
TensorFlow Version (if applicable):
PyTorch Version (if applicable): NO pytorch used
Baremetal or Container (if container which image + tag):

NOTE:
The CUDA,CuDNN is pre-installed in the Aws Deep learning 30.0 ubuntu 18.04 instance.

Hoping to hear soon,
Thanks,
SAI

AakankshaS · July 29, 2020, 9:50am

Hi @saikrishnadas666,
Can you please help me with your model and script so that i can check it at my end.
Thanks!

saikrishnadas666 · July 29, 2020, 11:01am

I have used yolov4 coco model.
The script and method i followed can be taken from GitHub - jkjung-avt/tensorrt_demos: TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet ( Converstion of YOLOv4 to trt model )

SunilJB · August 18, 2020, 5:52am

Hi,
Sorry for late response. Are you using AWS P2 instance for this script testing or running on Jetson Xavier NX?
In case you are using P2 instance, it has Tesla K80 with compute capability of 3.7. Will recommend to try G4 or P3 instances instead.
Please refer below link

In case of Jetson Xavier, could you please share the Jetpack version.

Thanks

saikrishnadas666 · August 18, 2020, 9:58am

I use jetson nx to run the inference script.
Jetpack 4.4

SunilJB · August 18, 2020, 10:50am

Request you to raise issue in Issues · jkjung-avt/tensorrt_demos · GitHub

Thanks

Topic		Replies	Views
TensorRT Inference is Slower Than Other Frameworks TensorRT	7	3695	December 9, 2019
Performance drop when performing inference with a .trt engine on a python script TensorRT tensorrt	3	478	February 18, 2022
Extremely slow inference with MMDetection on Jetson Xavier NX Jetson Xavier NX jetson-inference	7	1894	June 27, 2022
Detectron2 model converted into Tensorrt taking large time TensorRT	1	545	September 20, 2022
Inference is so slow with torch1.6 Jetson Xavier NX nvbugs , pytorch	12	3529	October 23, 2020
Nvidia Jetson NX extremely slow even with TensorRT inference for yolov3 TensorRT	3	1194	August 23, 2021
Inference time of tensorrt 6.3 is slower than tensorrt 6.0 TensorRT tensorrt , driveos	7	912	October 12, 2021
Tensorrt is slower than pytorch TensorRT	2	2195	September 15, 2021
converting a frozen graph to tensorRT Jetson Nano	5	1786	October 14, 2021
Optimize Inference Time of yolov2 model on Jetson Nano NX Jetson Xavier NX tensorrt , tensorflow	2	727	June 15, 2022

TensorRT model inference is slower than normal model

Description

Environment

Related topics