Deepstream 5.0 Python app with 3rd party OCR

kutsenkoilya · June 8, 2020, 8:08pm

Good day,

I’m working on a Deepstream 5.0 python application which would count vehicle traffic flow and OCR license plates in realtime on Xavier device.
And looking for guidance and best practices in feature implementation.

As I understand on featuers:

Vehicle detection can be done using TrafficCamNet
Tracking using Deepstream built-in trackers
License plate detection using ending from back-to-back detector from Deepstream 4.0 deepstream_reference_apps/back-to-back-detectors at master · NVIDIA-AI-IOT/deepstream_reference_apps · GitHub
On OCR I’m choosing between Teseract and OpenALPR

So my question is - what are the best/fastest-in-implementation way to implement feature #4 in Python application?
Options I know are:

Import tesseract-ocr in Python code and process license plates cutouts in separate function
Scrap Tesseract/OpenALPR for pretained models and use them as TensorRT engine in a Deepstream pipeline
Make a gstreamer module (this is more like C++ coding)

I’d be glad to hear any feedback. Thanks!

zhliunycm2 · June 18, 2020, 1:23am

Hi! Both Tesaract OCR and OpenALPR have Python bindings:

You can use them from a probe function in your Python app. The sample code on those sites load images from file which would be inefficient. If those bindings can directly work with numpy or cv2.image, then it would be much better. Still, it’s possible to save images to file from probe function and then reload them.

Please refer to the image data access app for retrieving the images and processing/saving them with OpenCV: