Yolov3 on Jetson Nano GPU = 1 really slow and freezes

eimarinb.telefonica · January 30, 2020, 4:05pm

Hi everyone!

I just received my Jetson nano and wanted to get YOlov3 running! But I can’t get it to work yet and I’d appreciate some help. I detail what I did, ad more detail on my setup at the end.

{Instalation instructions}

After following the {Setup Details} (see it at the ned of the post), I followed setup instructions on
https://pjreddie.com/darknet/

I compiled the original make file and run YoloV3 on the test

./darknet detect cfg/yolov3.cfg yolov3.weights data/dog.jpg

It runs slowly (~90 seconds)

After that, I changed the make file (GPU, CUDNN, OPENCV =1) and recompiled to use the GPU. I ran the same test

./darknet detect cfg/yolov3.cfg yolov3.weights

,but this time it is extremely slow, and it freezed and restarts (or it kills it) on layer 9.

I read a lot on possible causes:

I changed this, to fix possible incompatibility between CUDA and the GPU

ARCH= -gencode arch=compute_53,code=[sm_53,compute_53] \
      -gencode arch=compute_62,code=[sm_62,compute_62]

-I changed a minor possible issue with darknet
https://github.com/pjreddie/darknet/issues/1141

{Setup Details}
-nv-jetson-nano-sd-card-image-r32.2.1
-I’m using a 5V/4A charger

- I run install_basics.sh to add CUDA stuffs into the PATH and LD_LIBRARY_PATH variables

https://jkjung-avt.github.io/setting-up-nano/
-I added 8 Gb of swap memory

$ sudo fallocate -l 8G /mnt/8GB.swap
$ sudo mkswap /mnt/8GB.swap
$ sudo swapon /mnt/8GB.swap
$ /mnt/8GB.swap none swap sw 0 0

-I switched to high power mode

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

JerryChang · January 31, 2020, 3:16am

hello eimarinb.telefonica,

please also check Topic 1050377 for the instructions of deep learning inference benchmark.
thanks

eimarinb.telefonica · February 4, 2020, 4:09pm

Thanks! @JerryChang
A couple of updates

I could run Yolov3 if I lowered some values (batch, subdivisions and width and height). However I’m curious to know if there’s anyway to run the default version of darknet Yolov3 on the Jetson with the original parameters without it freezing and killing the process. Do you know if there’s any way to do this?
I could do a couple of CUDA samples test I saw on
https://devtalk.nvidia.com/default/topic/1049811/jetson-nano/cuda-and-vision-works-demos/
matrixMul takes 23.54 Hflops, which is a bit far from the 472 GFLOPS referenced in
https://devblogs.nvidia.com/jetson-nano-ai-computing/
I could do a couple of benchmark tests of the Topic you referenced!

SSD-Mobilenet-V2
I couldn’t follow step 2. I tried to patch the file via terminal. I couldn’t apply the patch and tried the next steps anyway, but got an error. Can you walk me through on how to apply the patch?

Image classification

ResNet-50
Average over 10 runs is 28.1001 ms

Inception V4
over 10 runs is 96.5391 ms

VGG-19
Average over 10 runs is 101.042

U-Net Segmentation
error

Pose Estimation
Average over 10 runs is 71.6836

Tini Yolov3
Inference time per image: 31.2 ms (for 5 test images provided)

AastaLLL · February 24, 2020, 8:29am

Hi,

Sorry for the late update.
AFAIK, Darknet use OpenCV as the camera interface which is slow due to the CPU memory implementation.

We have a several YOLO sample for Jetson system and it’s worthy to try first:
Pure TensorRT: /usr/src/tensorrt/samples/python/yolov3_onnx/
Integrated with Deepstream: /opt/nvidia/deepstream/deepstream-4.0/sources/objectDetector_Yolo

Here is a discussion that to reach 20FPS with YOLOv3 model on Nano for your reference:
https://devtalk.nvidia.com/default/topic/1064871/deepstream-sdk/deepstream-gst-nvstreammux-change-width-and-height-doesn-t-affect-fps/post/5392823/#5392823

Thanks.

eimarinb.telefonica · February 24, 2020, 4:07pm

Thanks! I’ll check it out now.

Topic		Replies	Views
Yolov3 is very slow Jetson Nano	21	20488	October 14, 2021
Jetson nano crashed when using tiny yolo v3 model Jetson Nano	24	12781	October 18, 2021
Why no different to run between yolov3 and yolov3-tiny on Jetson nano Jetson Nano yolo	6	1943	October 15, 2021
yolo v3 runs slowly in jetson nano DeepStream SDK	4	1196	October 12, 2021
YOLOv3 TensorRT Inference Super Slow In Nano Jetson Nano	3	1118	October 14, 2021
Tiny Yolo v3 Frame Rate Jetson Nano cuda , yolo	2	2293	October 18, 2021
My Jetson Nano CPU is out performing my GPU by miles! What is point of Jetson Nano? Jetson Nano yolo	2	699	October 15, 2021
OpenCV DNN not making use of Jetson Nano GPU Jetson Nano	3	2749	October 15, 2021
Full Yolov3 on the nano using TensorRT or Deepstream 4.0.1 Jetson Nano	7	2578	October 14, 2021
Python wrapper for tensorrt implementation of Yolo (currently v2) Jetson Nano	32	8237	July 2, 2020

Yolov3 on Jetson Nano GPU = 1 really slow and freezes

Related topics