Yolo V4 on Tensorflow First Inference is very slow take minutes on NVIDIA GPU A30

Samir-Och · May 31, 2022, 6:19pm

Description

I’m running yolov4 on NVIDIA A30 but the first inference is very slow take 15-20 minutes.

Environment

GPU Type: NVIDIA A30
Nvidia Driver Version: 512.78
CUDA Version: 10.1
CUDNN Version: 7.6
Operating System + Version: Windows Server 2016
Python Version (if applicable): Python 3.7
TensorFlow Version (if applicable): 2.3.0

Relevant Files

I’m use these code for testing: GitHub - theAIGuysCode/yolov4-deepsort: Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

Is anyone please give me a suggestion to accelerate first inference ?

spolisetty · June 1, 2022, 5:48am

Hi,

This looks more within the scope of the Tensorflow. We recommend you to please reach out on Issues · theAIGuysCode/yolov4-deepsort · GitHub or https://discuss.tensorflow.org/ to get better help.

Thank you.

Samir-Och · June 1, 2022, 6:30pm

Hi Spolisetty,

Thanks for your response.

It was Cuda version for Ampere architecture, I changed Cuda 10 Version to Cuda 11 and Cudnn 7.6 to Cuda 8.2. Now is fast less than 30 seconds the first inference, the others inference take 20 FPS aprox (I thought A30 would be more fast)

For reference I read in somewhere where recommend you will use Cuda 10 for Touring architecture and Cuda 11 for Ampere architecture, it worked for me

Environment

GPU Type: NVIDIA A30
Nvidia Driver Version: 512.78
CUDA Version: 11.0
CUDNN Version: 8.2
Operating System + Version: Windows Server 2016
Python Version (if applicable): Python 3.7
TensorFlow Version (if applicable): 2.4.0 (in specific tensorflow-gpu)

Reference

To select cuda version to tensorflow I guided GPU table of Kaynaktan oluştur | TensorFlow

system · June 15, 2022, 6:31pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
TensorRT model inference is slower than normal model TensorRT tensorrt , cuda , yolo , cudnn	5	1196	August 18, 2020
TensorRT Inference is Slower Than Other Frameworks TensorRT	7	3710	December 9, 2019
When using tensorrt's c++ API for inference under 3060 graphics card, the speed of loading the first picture is very slow TensorRT tensorrt , cuda	0	449	May 30, 2022
TESLA T4 very slow inference CUDA Setup and Installation	0	931	February 3, 2020
GPU Performance Worse than CPU Performance on YOLO inferences GPU - Hardware cuda , visual-studio , net	0	158	July 27, 2024
inference time of tensorrt is slower than tensorflow !!! TensorRT	2	1435	September 27, 2019
For the pre-processing of YOLOv5, how do I speed up the pre-processing of YOLOv5 through TensorRT？ TensorRT	2	2073	September 1, 2020
Increase FPS on Yolo4 model Jetson AGX Xavier python , linux , machine-learning	3	2850	March 30, 2022
Multiple threads running inference are causing a slowdown TensorRT jetson , deepstream	1	751	August 1, 2023
Yolov4 TensorRT slower than Yolov4 darknet TensorRT	6	3413	September 1, 2020

Yolo V4 on Tensorflow First Inference is very slow take minutes on NVIDIA GPU A30

Description

Environment

Relevant Files

Environment

Reference

Related topics