Python wrapper for tensorrt implementation of Yolo (currently v2)

moshe.livne · May 22, 2019, 10:29am

I have made a wrapper to the deepstream trt-yolo program. It was not easy, but its done.
Inference speed on Nano 10w (not MAXN) is 85ms/image (including pre-processing and NMS - not like the NVIDIA benchmarks :) ), which is FAR faster then anything I have tried.
Also load time is very fast after the first engine compilation.

The code is a bit rough and still needs a lot of attention but I would be grateful if anyone can try and follow the installation because, sadly, I ran out of memory cards and don’t want to erase this one.

The github is at https://github.com/mosheliv/tensortrt-yolo-python-api . Feel free to contribute to it - I still need to implement wrapper for Yolov3 (shouldn’t be hard) and add support for better image resizing logic (currently 416 is hardcoded) and error handling.

Any comments are welcome. It has been a while since I used c++ (we are talking decades here) and this wrapper was tricky.

AastaLLL · May 23, 2019, 7:04am

Hi,

Thanks for the sharing.

romilly.cocking · May 23, 2019, 5:26pm

@moshe I needed to
sudo apt-get install libgflags-dev

I will let you know if I hit more issues.

moshe.livne · May 23, 2019, 11:51pm

added and added support to yoloV3 as well, both tiny and regular works now. yolov2 tiny is the fastest…

rick_kan · May 25, 2019, 6:14pm

@moshe Thanks for this initiative. I’m working on a video analytics project and would like to know if with this API you made I can apply a video stream to be analysed on the Jetson Nano. Thanks in advance!

moshe.livne · May 25, 2019, 10:31pm

should work fine. also please check this slightly more formal and better written version of SSD models by NVIDIA https://devtalk.nvidia.com/default/topic/1051699/jetson-nano/what-almost-everyone-with-a-nano-is-looking-for/post/5342736/#5342736

Note that other then the inference speed, you need to handle the processing/image acquisition pipeline properly to get high throughput (what this means in English, you need to have two threads (at least), one fetching the image from the camera and the other processing it. It better still to have a third thread doing the image processing before inference.)

MtHiker · May 30, 2019, 8:34am

Hi moshe,

I tried to run your work and got some error.
At step 6, the file path might miss ‘build’ after trt-yolo.
And when I run trt-yolo-app application, I get this;

trt-yolo-app: /home/nano/Downloads/deepstream_reference_apps/yolo/lib/yolo_config_parser.cpp:122: bool verifyRequiredFlags(): Assertion `(FLAGS_config_file_path.find(".cfg") != std::string::npos) && "config file not recognised. File needs to be of '.cfg' format"' failed.
Aborted (core dumped)

Finally, would you say what it means XXXXXXX of step 10?

Thanks

moshe.livne · May 30, 2019, 11:27am

yes, sorry, my bad… I didn’t run the cmake and make from the build directory
you need to add build as you already found. the other error is from i think missing one “…” because of not being in the build directory. I is better to give it absolute path.

I have fixed and checked the readme. Hopefully following the instructions now will result in working project… I recommend removing the deepstream directory and cloning it again

MtHiker · May 30, 2019, 11:40pm

Thanks Moshe,

I will test it again and update soon

MtHiker · May 31, 2019, 8:51am

Hi moshe,

I tested your updated resources and leave some comments here.

5.ii, to build I had to remove ‘set(CUDA_NVCC_FLAGS “${CUDA_NVCC_FLAGS} -fPIC” )’
9.iii, it returns error,

File does not exist : data/labels.txt
trt-yolo-app: /home/nano/Downloads/deepstream_reference_apps/yolo/lib/trt_utils.cpp:124: std::vector<std::__cxx11::basic_string<char> > loadListFromTextFile(std::__cxx11::string): Assertion `fileExists(filename)' failed.
Aborted (core dumped)

To execute, I have to move $YOLO_ROOT directory.
3. 15. like above 2, at $YOLO_ROOT it succeed.

Good work! Way better!
And I hope your work would support camera input.

Thanks,

moshe.livne · May 31, 2019, 9:25am

Thank you for helping me in making this stable! Much appreciated.

as for (1), this is really strange… without adding this line I got an error while linking. I’ll retry from scratch and see. Did you get an error?
I’ll also recheck the paths. this basically looks like you didn’t do 14… the paths in the yolo configs are relative.

It basically supports anything you throw at it… if you want to use a camera, use opencv VideoCapture to get the frames. I am currently writing a home security system with threaded fetch from the camera, motion detection, object detection and snippet recording. If this is also what you aim for, you can wait for it… its not in a stage that I can release it yes but give me a week or two…

MtHiker · May 31, 2019, 11:35am

Good!

when you update this git, I will be willing to test !

moshe.livne · June 1, 2019, 7:24am

can you please tell me why you removed the NVCC line? what error did you get?

MtHiker · June 2, 2019, 11:51pm

Hi Moshe,

I update what I did;

Jetson Nano default SD image is fused.

If I add a ‘NVCC flag line’ in CMakeLists.txt, it returns following:

nano@nano-devel:~/Downloads/deepstream_reference_apps/yolo/apps/trt-yolo/build$ sudo make install
[  7%] Building NVCC (Device) object lib/CMakeFiles/cuda_compile_1.dir/cuda_compile_1_generated_kernels.cu.o
nvcc fatal   : Unknown option 'fPIC'
CMake Error at cuda_compile_1_generated_kernels.cu.o.Release.cmake:219 (message):
  Error generating
  /home/nano/Downloads/deepstream_reference_apps/yolo/apps/trt-yolo/build/lib/CMakeFiles/cuda_compile_1.dir//./cuda_compile_1_generated_kernels.cu.o


lib/CMakeFiles/yolo-lib.dir/build.make:225: recipe for target 'lib/CMakeFiles/cuda_compile_1.dir/cuda_compile_1_generated_kernels.cu.o' failed
make[2]: *** [lib/CMakeFiles/cuda_compile_1.dir/cuda_compile_1_generated_kernels.cu.o] Error 1
CMakeFiles/Makefile2:122: recipe for target 'lib/CMakeFiles/yolo-lib.dir/all' failed
make[1]: *** [lib/CMakeFiles/yolo-lib.dir/all] Error 2
Makefile:129: recipe for target 'all' failed
make: *** [all] Error 2

step 14 should be done before 9.3, modification $YOLO_ROOT/config/yolov2-tiny.txt.
In addition to step 14, ‘–test_images’ should be modified properly as well.

Thanks,

moshe.livne · June 3, 2019, 12:16am

Thank you for your patience on this and on helping get it right!

Did you cut and paste the line or did you write it? this is really perplexing… on my nano it works very well, actually it does not work without it.

after a bit of research, it might have to do with the version of cmake. can you please try “–compiler-options -fPIC” instead of “-fPIC”?

MtHiker · June 3, 2019, 1:18am

Hi moshe,

Yeah!
Indeed.

--compiler-options or -Xcompiler option can fix this issue.

Thanks,

fcrisafulli · June 11, 2019, 8:23pm

Hi Moshe,
great job thanks a lot.
Any plan to extend it to video streams?
Cheers
Fabrizio

moshe.livne · June 11, 2019, 8:49pm

Hi,
The problem seems to be that you are trying to use 8 bit, which os not supported by nano and anyways needs calibration.

Just Google the weights, it should be very easy to find on the darknet site. I am on the phone now so awkward but if you can’t find it, let me know and i will look it up for you later.

razorK · June 29, 2019, 5:31pm

Moshe, good job with your implementation. It’s hard to come across a port of YOLO on tensorRT, not to mention a Python wrapper.
I’ll keep an eye on your project.

ibondokji · July 15, 2019, 2:24pm

Hi,

what version of opencv are you using to run this? i have tried opencv 3.4.6 and 4.0 and both gave similar errors about some undefined symbol inside the libyolo.so

Topic		Replies	Views
Yolov3 is very slow Jetson Nano	21	20264	October 14, 2021
Yolo for Jetson DeepStream SDK	41	6315	August 13, 2024
Jetson nano crashed when using tiny yolo v3 model Jetson Nano	24	12654	October 18, 2021
0.3fps when using yolov3_onnx in TensorRT examples provided by Nvidia in Jetson Nano Jetson Nano	8	1772	October 14, 2021
Low fps when doing object detection on jetson nano Jetson Nano jetson-inference	19	9007	March 1, 2022
YoloV4 with OpenCV Jetson Nano yolo	12	6201	October 15, 2021
Recommendation of an existing application for object detection and tracking with jetson Nano, YOLO V3 Tiny and Tensorflow Jetson Nano tensorflow	17	1834	October 12, 2021
Deep Learning Inference Benchmarking Instructions Jetson Nano	134	47564	May 30, 2023
run yolov3-tiny with tensorRT model Jetson Nano	7	3418	January 4, 2020
Yolov3 in nanojetson Jetson Nano tensorrt	12	1074	October 18, 2021

Python wrapper for tensorrt implementation of Yolo (currently v2)

Related topics