Tracking with Dlib in Jetson TX2 slower than Nano

elbouziadyabderrahim · October 9, 2019, 3:31pm

Hi all,

I have a program doing detection and tracking of moving vehicles with some statistics visualization. The program is well optimized to work in real-time on Jetson Nano (using TensorRT engine for detection). I moved the same code + models from Nano to TX2 with generating new engine files. The program runs 5-6 times slower than Nano. Normally TX2 has 2 additional Denver CPUs and is supposed to be faster. I measure the processing time for detection and tracking. Detection Time is similar, but the tracking with Dlib is slower. I m using the correlation tracker in Dlib.

Environment (for both TX2 and Nano):

Jetpack L4T 32.2.1
GPU ARCH (6.2 TX2) / (5.3 Nano)
OpenCV 4.1.1 compiled CUDA YES (Compiled from source in a 128GB SD card (TX2) / internal storage (Nano))
TensorRT 5.1.6.1
Dlib 19.17 (Compiled from source in internal storage)
Matplotlib 2.1.1 (installed with pip3)
CUDA 10.0.326
cuDNN 7.5.0.56
VisionWorks 1.6.0.500n
Python 3.6.8

Things I tried and failed :

all the NVP models on TX2
compiled OpenCV with TBB support
use the last version of Dlib with CUDA/LAPACK/BLAS (19.18 released 2/3 weeks ago): https://github.com/davisking/dlib/releases

Are there any ideas, i can try to fix this issue?

Thanks in advance

AastaLLL · October 14, 2019, 5:51am

Hi,

Do you setup the Nano and TX2 from the same JetPack installer?
If yes, you should get the similar performance since the software version is identical.

It looks like you try to setup the nvpmodel but no jetson_clocks?
If yes, would you mind to give it a try.
This script maximize the CPU/GPU clocks and improve the performance.

sudo jetson_clocks

Thanks.

elbouziadyabderrahim · October 14, 2019, 8:02am

Hi AastaLLL,

Thanks a lot for your reply.
No, we setup the Nano with SD card image, and TX2 via SDK Manager installed on an Ubuntu18 VMware machine. Can flashing with VMware create issues? even if it was done successfully?

We tried all nvpmodel modes, with jetson_clock, but no significant improvement so far. Also in The Nano, we are not using the jetson_clocks.

BR

kayccc · October 24, 2019, 1:45am

Hi elbouziadyabderrahim,

" Can flashing with VMware create issues?" → probably, due to some unknown reason, and the VM is not suggested for developing.

elbouziadyabderrahim · October 26, 2019, 9:11pm

Yep, you were right, it creates issues. we flashed and installed everything from scratch using a laptop with native Ubuntu. Now it is working. Thanks

Topic		Replies	Views
TX2: caffe model that runs slower on nvcaffe GPU than on OpenCV CPU Jetson TX2	3	685	October 18, 2021
Running with GPU is slower than CPU on TX2 Jetson TX2	2	779	October 18, 2021
Performance of Tensorflow (1.5) on Jetson TX2 slower than expected Jetson TX2	3	2837	October 18, 2021
Object Detection Performance Jetson Tx2 slower than expected Jetson TX2	22	14894	October 18, 2021
First time run of Tensorflow on Jetson Tx2 is slow. Jetson TX2	2	1289	October 18, 2021
Python Tracking on Jetson Jetson AGX Xavier	5	606	October 18, 2021
Jetson TX2 framerate with face detection and 5 point facial landmark Jetson TX2	2	2492	October 18, 2021
DNN Face detection performance on TX2 Jetson TX2	6	2679	October 18, 2021
Dlib testing on Jetson Nano, TX2 and Xavier Jetson TX2	6	2078	October 18, 2021
100% CPU usage while tracking objects using dlib Jetson TX2	5	1410	October 18, 2021

Tracking with Dlib in Jetson TX2 slower than Nano

Environment (for both TX2 and Nano):

Things I tried and failed :

Related topics