Suggestions on porting a image classification pipeline to DeepStream (Solved)

vivekmaran27 · August 21, 2018, 6:24am

Hello,

I am working on a system which does the following:

Capture video frame
Split frame into multiple images, with each representing an ROI
Perform image classification on each image
The aggregation of classifications on each of the images corresponding to a frame is the output for that frame

Block diagram:
https://drive.google.com/open?id=19kKIepy_gq3pl_HgT4-tscpP1MuJPitl

I went through the nvgstiva app, but since the above mentioned is an unconventional model I am finding it difficult to come with a design to port it to DeepStream. Can you please provide some suggestions for the deepstream pipeline.

ChrisDing · August 24, 2018, 6:37am

How does opencv preprocess to seperating frame to multiple images? Do you have detection model or just classification model?
What’s platform you will use?
multiple sources or just one channel?

vivekmaran27 · August 24, 2018, 6:48am

<<How does opencv preprocess to seperating frame to multiple images? Do you have detection model or just classification model?>>
I have ROI co-ordinates in hand

Algorithm:

For each ROI co-ordinate:
   Create a mask image with all zeroes.(except the ROI) using cv2.fillpoly
   masked_image = frame & mask
   perform inference masked_image

Iam using Image classification model, AlexNet.

<< What’s platform you will use?>>
Jetson-TX2

<<multiple sources or just one channel?>>
One channel (atleast for now)

ChrisDing · August 24, 2018, 7:30am

I think it’s OK to port it to deepstream. You need to implement your own applicaton but don’t use nvgstiva app.
The pipeline is like this
opencv capture source + preprocess, masked_image →
appsrc plugin →
gst-nvinfer (sgie)plugin →
gst-nvosd …-> get your metadata in application.
I guess the video source is raw data so do not need decoding.
For gst-nvinfer, set your own network/model properties.

If you don’t have much gstreamer experience, I suggest you use low level TensorRT directly. It’s easy for your case.

vivekmaran27 · August 24, 2018, 7:36am

Thanks, I appreciate you taking time to respond. Couple of more questions please

What will be the sink for the gstreamer pipeline?
Also, will there be any performance difference between directly using Tensor-RT vs the mentioned gstreamer pipeline.

ChrisDing · August 24, 2018, 7:55am

It can be encoding+filesink, eglsink to display, fakesink and so on what you like.
Using Tensor-RT directly is more flexible and easier to do optimization. Deepstream is a framework which makes use of low level video codec, tensorRT, nvosd, nvtracker, …

Topic		Replies	Views
Performance of opencv capture with hardware accelerated decoding on Jetson-TX2 (Solved) DeepStream SDK opencv	4	1801	August 24, 2018
RTSP source with Jetson TX2 Jetson TX2	10	4041	November 15, 2018
Multiple ROIs processing per frame DeepStream SDK	3	605	October 2, 2021
Deepstream primary infer use indicated sources DeepStream SDK	4	276	June 1, 2023
Deep stream with other sensors DeepStream SDK	6	704	October 12, 2021
Deepstream Face Recognition OpenCV Jetson Nano gstreamer	16	2282	October 15, 2021
Write and read data into pipeline DeepStream SDK gstreamer	4	283	February 29, 2024
Using deepstream while merging multiple rtsp streams DeepStream SDK	6	372	April 8, 2024
DeepStream get cv::mat by DeepStream SDK	5	764	October 12, 2021
DeepStream sdk 4.x reading ouput buffer of the nvdecode plugin Deep Learning (Training & Inference)	3	456	April 2, 2020

Suggestions on porting a image classification pipeline to DeepStream (Solved)

Related topics