Speed up yolov3 inference on nano (deepstream 4.0.1) using Coral USB accelerator?

imbatraman · October 7, 2019, 6:16am

Hey guys,

I’ve got full yolov3 running on the Jetson nano using deepstream 4.0.1, the inference fps is very slow ~2. I am trying to speed this up and I am wondering if we can do something using the Google Coral USB accelerator. (Get started with the USB Accelerator | Coral)

It’s meant to work on Ubuntu 10.0+.

The coral usually takes tensorflow models and speeds them up, but how would this work in the case of the native yolov3 model on deepstream 4.0.1?

CJR · October 10, 2019, 6:44pm

Hi,

Deepstream uses TensorRT SDK for inference which is not supported on coral. The engine files from TensorRT are already optimized. The dafault configs for yolo samples in Deepstream SDK perform inference on every frame of the video. You can make use of the tracker and add intervals between successive inferences and then obtain a higher throughput. See related discussion here - https://devtalk.nvidia.com/default/topic/1058668

Since yolov3 is a compute heavy model you can also try the following -

switch to yolov3-tiny
Use fp16 mode for inference

You can also see this config file on how various plugin properties are set for jetson nano hardware specifically. - source8_1080p_dec_infer-resnet_tracker_tiled_display_fp16_nano.txt

pranaynhbpn · November 26, 2019, 7:08am

Hi,
@NvCJR thanks for your reply.
But I realised that on Nano the FPS reduces drastically when multiple models are loaded also even for a single model the load times are too long .
None the less the FPS offered by Coral USB accelerator can be leveraged with jetson Nano’s GPU specially when lower cost PCI-e accelerators can be added.

So wanted to know community’s opinion on the following method to pursue the integration of Coral USB accelerator with Deepstream:

Step 1. Use AppSrc and AppSink similar to https://github.com/google-coral/examples-camera/blob/master/gstreamer/gstreamer.py to do inferencing on an image pipeline via Coral USB accelerator → Questions in this step are

What do you think about this approach ? Any pitfalls or this idea is not feasible at all?
How to handle situations when Batch>1

Step 2. After step 1 is complete and we have the bouning boxes. We feed the bounding boxes detected into nvTracker by injecting NvDsObjectMeta into NvDsFrameMeta. Questions in this step:

But not sure if this step is feasible-->Need feedback here and some pointers/examples if possible

Any feedback will really help.
Thanks

pranaynhbpn · November 26, 2019, 12:03pm

Hi started a new thread :https://devtalk.nvidia.com/default/topic/1067239/deepstream-sdk/using-coral-usb-accelerator-with-jetson-nano/ for the above-mentioned query as I thought its better that way as the original poster had accepted the answer to the question

Hi,
@NvCJR thanks for your reply.
But I realised that on Nano the FPS reduces drastically when multiple models are loaded also even for a single model the load times are too long .
None the less the FPS offered by Coral USB accelerator can be leveraged with jetson Nano’s GPU specially when lower cost PCI-e accelerators can be added.

So wanted to know community’s opinion on the following method to pursue the integration of Coral USB accelerator with Deepstream:

Step 1. Use AppSrc and AppSink similar to https://github.com/google-coral/examples-camera/blob/master/gstreamer/gstreamer.py to do inferencing on an image pipeline via Coral USB accelerator → Questions in this step are

What do you think about this approach ? Any pitfalls or this idea is not feasible at all?

How to handle situations when Batch>1

Step 2. After step 1 is complete and we have the bouning boxes. We feed the bounding boxes detected into nvTracker by injecting NvDsObjectMeta into NvDsFrameMeta. Questions in this step:

But not sure if this step is feasible-->Need feedback here and some pointers/examples if possible

Any feedback will really help.
Thanks

Topic		Replies	Views
Low fps when doing object detection on jetson nano Jetson Nano jetson-inference	19	8769	March 1, 2022
Full Yolov3 on the nano using TensorRT or Deepstream 4.0.1 Jetson Nano	7	2496	October 14, 2021
Python wrapper for tensorrt implementation of Yolo (currently v2) Jetson Nano	32	7988	July 2, 2020
Improve inference performances yolov5 Jetson Nano yolo , nano2gb	4	1538	June 2, 2022
Yolov3 in nanojetson Jetson Nano tensorrt	12	1074	October 18, 2021
Using deepstream with yolo models - performance on jetson nano? DeepStream SDK	3	906	October 12, 2021
Deepstream 4 + yolov3 multi source slow DeepStream SDK	9	1813	October 12, 2021
deepstream-yolo-app performance vs Tensor-Core optimized yolo-darknet DeepStream SDK	9	3614	October 12, 2021
Running YOloV4 on jetson Nano at Higher FPS? Jetson TX2 yolo	8	10341	October 18, 2021
Detection + classification + Tracking code with Deepstream sdk DeepStream SDK	8	439	August 1, 2023

Speed up yolov3 inference on nano (deepstream 4.0.1) using Coral USB accelerator?

Related topics