run yolov3-tiny with tensorRT model

ibondokji · September 24, 2019, 2:13pm

hi,

what is the way to run yolov3-tiny optimized with tesnorRT? i have translated the model to onnx then to tensorRT with help from this repo: [url]https://github.com/zombie0117/yolov3-tiny-onnx-TensorRT[/url]

now what is the correct framework to run this model for video inference?

i know that currently deepstream support yolov3-tiny, but i want to be able to run tensortRT model without deepstream.

thanks

AastaLLL · September 25, 2019, 6:37am

Hi,

Have you tried our deepstream SDK?
It contains the samples for YOLOv2, YOLOv2_tiny, YOLOv3 and YOLOv3_tiny model.

https://developer.nvidia.com/deepstream-sdk
/opt/nvidia/deepstream/deepstream-4.0/sources/objectDetector_Yolo

Thanks.

simone.rinaldi · September 25, 2019, 7:14am

I tested yolov3-tiny with deepstream and without it and there is no difference. Number of frames/sec are the same.
I suggest you to use:

Inside you can edit darknet_video.py to your purpouse. I also suggest to use this model:

I’m able to reach 17-18 fps.

ibondokji · September 25, 2019, 7:38am

hi,

yes i have tried deepstream. but i am looking for a way to run dakrnet yolo-tiny model accelerated with tensorrt in python.

hi,

what is the way to run yolov3-tiny optimized with tesnorRT? i have translated the model to onnx then to tensorRT with help from this repo: https://github.com/zombie0117/yolov3-tiny-onnx-TensorRT

now what is the correct framework to run this model for video inference?

i know that currently deepstream support yolov3-tiny, but i want to be able to run tensortRT model without deepstream.

thanks

I tested yolov3-tiny with deepstream and without it and there is no difference. Number of frames/sec are the same.
I suggest you to use:
GitHub - AlexeyAB/darknet: YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Inside you can edit darknet_video.py to your purpouse. I also suggest to use this model:
GitHub - WongKinYiu/PartialResidualNetworks: partial residual networks

I’m able to reach 17-18 fps.

hi simone.rinaldi,

i was not able to reach this much fps using darknet on a 416416 yolo-tiny model, i had to lower the resolutions to 256256. how did it work for you?

actually i have tested deepstream with a yolo-tiny 416*416 model and it ran on 29 fps. but i dont want to use because i am facing some implications in running deepstream using rtsp url to my cameras. also i need to use my tracking algorithm which is written in python.

simone.rinaldi · September 25, 2019, 7:53am

Me too… also my application is written in python and detect objects from IP cameras using RTSP.
About FPS please take a look to my post:
https://devtalk.nvidia.com/default/topic/1057006/jetson-nano/hello-ai-world-now-supports-python-and-onboard-training-with-pytorch-/post/5384517/#5384517

FPS shown on Nvidia application are related only to network time and not to complete application.

In my application if I get an RTSP stream (1080p 25fps) and detect it I’m able to reach a maximum of 11-13 fps, but consider that a big part of resources are used by opencv in order to draw boxes and to show image in a windows so my suggestion is to create a headless system.

ibondokji · September 25, 2019, 8:41am

hi simone.rinaldi,

i was not able to reach this much fps using darknet on a 416416 yolo-tiny model, i had to lower the resolutions to 256256. how did it work for you?

actually i have tested deepstream with a yolo-tiny 416*416 model and it ran on 29 fps. but i dont want to use because i am facing some implications in running deepstream using rtsp url to my cameras. also i need to use my tracking algorithm which is written in python.

Me too… also my application is written in python and detect objects from IP cameras using RTSP.
About FPS please take a look to my post:
https://devtalk.nvidia.com/default/topic/1057006/jetson-nano/hello-ai-world-now-supports-python-and-onboard-training-with-pytorch-/post/5384517/#5384517

FPS shown on Nvidia application are related only to network time and not to complete application.

In my application if I get an RTSP stream (1080p 25fps) and detect it I’m able to reach a maximum of 11-13 fps, but consider that a big part of resources are used by opencv in order to draw boxes and to show image in a windows so my suggestion is to create a headless system.

thanks for your reference!

what do you mean by network speed? i have tested deepstream with a test video of 5 mins length and 25 fps and it finished in 4 mins and 5 seconds which i thought confirms the fps i see in the terminal. anyway i am currently looking at a tensorflow implementation with tensorRT optimization. you can check the article here: https://medium.com/ardianumam/optimizing-yolov3-using-tensorrt-in-jetson-tx-or-dekstop-2db47865a50b

i have also tested with headless mode. i got for my yolo-tiny 256*256 model including all overhead 17 fps. i want to test with 416 model, but i think i will get sth around 12, where i need a minimum of 15 fps.

simone.rinaldi · September 25, 2019, 12:34pm

hi simone.rinaldi,

i was not able to reach this much fps using darknet on a 416416 yolo-tiny model, i had to lower the resolutions to 256256. how did it work for you?

actually i have tested deepstream with a yolo-tiny 416*416 model and it ran on 29 fps. but i dont want to use because i am facing some implications in running deepstream using rtsp url to my cameras. also i need to use my tracking algorithm which is written in python.

Me too… also my application is written in python and detect objects from IP cameras using RTSP.
About FPS please take a look to my post:
https://devtalk.nvidia.com/default/topic/1057006/jetson-nano/hello-ai-world-now-supports-python-and-onboard-training-with-pytorch-/post/5384517/#5384517

FPS shown on Nvidia application are related only to network time and not to complete application.

In my application if I get an RTSP stream (1080p 25fps) and detect it I’m able to reach a maximum of 11-13 fps, but consider that a big part of resources are used by opencv in order to draw boxes and to show image in a windows so my suggestion is to create a headless system.

thanks for your reference!

what do you mean by network speed? i have tested deepstream with a test video of 5 mins length and 25 fps and it finished in 4 mins and 5 seconds which i thought confirms the fps i see in the terminal. anyway i am currently looking at a tensorflow implementation with tensorRT optimization. you can check the article here: https://medium.com/ardianumam/optimizing-yolov3-using-tensorrt-in-jetson-tx-or-dekstop-2db47865a50b

i have also tested with headless mode. i got for my yolo-tiny 256*256 model including all overhead 17 fps. i want to test with 416 model, but i think i will get sth around 12, where i need a minimum of 15 fps.

Oh! Very interesting, I will try it!
About minimum FPS required, I have a suggestion for you: if you are not able to reach 15 Fps consider to skip frames that you are not able to manage.
In my python application I calculate (each second) how many frames I’m able to manage per second and application drops exceeding frames in order to be always synchronized with real time events.
In this way my application automatically increases or decreases number of frames managed adapting itself in relation to how I configure yolo.

So also if my video is 25 Fps but my application is able to manage only 12-13 fps, my object-detection application has no delay compared to realtime.

jkjung13 · January 4, 2020, 3:53pm

I modified TensorRT ‘yolov3_onnx’ sample and was getting ~14.2 FPS (yolov3-tiny-416) on Jetson Nano. (The FPS measurement included image acquisition and all of preprocessing/postprocessing.) Source code and a corresponding blog post have been shared online. I welcome feedbacks.

https://jkjung-avt.github.io/tensorrt-yolov3/
https://github.com/jkjung-avt/tensorrt_demos#yolov3

Topic		Replies	Views
Yolov3 is very slow Jetson Nano	21	20305	October 14, 2021
Jetson nano crashed when using tiny yolo v3 model Jetson Nano	24	12659	October 18, 2021
Full Yolov3 on the nano using TensorRT or Deepstream 4.0.1 Jetson Nano	7	2504	October 14, 2021
Low fps when doing object detection on jetson nano Jetson Nano jetson-inference	19	9083	March 1, 2022
Recommendation of an existing application for object detection and tracking with jetson Nano, YOLO V3 Tiny and Tensorflow Jetson Nano tensorflow	17	1859	October 12, 2021
0.3fps when using yolov3_onnx in TensorRT examples provided by Nvidia in Jetson Nano Jetson Nano	8	1789	October 14, 2021
Python wrapper for tensorrt implementation of Yolo (currently v2) Jetson Nano	32	8051	July 2, 2020
How to run yolov3-tiny.engine on tensorrt converted by run deepstream-app TensorRT	3	588	September 29, 2022
deepstream-yolo-app performance vs Tensor-Core optimized yolo-darknet DeepStream SDK	9	3652	October 12, 2021
Running YOloV4 on jetson Nano at Higher FPS? Jetson TX2 yolo	8	10515	October 18, 2021

run yolov3-tiny with tensorRT model

Related topics