TensorRT inference time issues with different driver version

m.digiusto · September 14, 2023, 3:22pm

Description

Hi! I’m experiencing some issues in performing TensorRT inference.

Using the driver version 473.81 the inference times are stable and low, upgrading the drivers to the
last versions (ex. 537.13) and keeping fixed CUDA Toolkit and CuDNN, inference times are 4x higher and with no stability at all. I do inference with YOLOv5 exported engine model using torch.hub.load.

Can be a driver related issue?
Thanks!

Environment

TensorRT Version: 8.4.3.1
GPU Type: RTX Quadro T1000 4GB
Nvidia Driver Version: 473.81 and 537.13
CUDA Version: 11.6
CUDNN Version: 8.5
Operating System + Version: Windows 10
Python Version (if applicable): 3.10.10
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 1.13.1 + cu116
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

session = load(‘ultralytics/yolov5’, ‘custom’, model_path)
result_inference = session(images, size=(256,256))

spolisetty · September 20, 2023, 4:45am

Hi,

We recommend you use the latest TensorRT version 8.6.1.

Thank you.

Topic		Replies	Views
Tensorrt test time is not stable TensorRT tensorrt	2	482	September 20, 2022
TensorRT inference time extremely slow TensorRT	1	449	January 31, 2023
TensorRT Inference is Slower Than Other Frameworks TensorRT	7	3709	December 9, 2019
Question about output TensorRT cudnn	1	228	January 11, 2024
Inference time mismatch between same configuration on Windows and Ubuntu TensorRT tensorrt , windows-driver	2	657	September 27, 2023
TensorRT model inference is slower than normal model TensorRT tensorrt , cuda , yolo , cudnn	5	1193	August 18, 2020
Inference time increases in for loop TensorRT	2	362	February 6, 2023
ConvNeXT inference with int8 quantization slower on tensorRT than fp32/fp16 TensorRT cudnn , tensorrt-model-optimizer	1	70	November 30, 2024
Inference time of tensorrt 6.3 is slower than tensorrt 6.0 TensorRT tensorrt , driveos	7	915	October 12, 2021
The problem of time - consuming jump appears in TensorRT 7.0 accelerated yolov5s model reasoning TensorRT	5	582	August 20, 2020

TensorRT inference time issues with different driver version

Description

Environment

Relevant Files

Steps To Reproduce

Related topics