Workflow containing using multiple models for inference

silentjcr · August 7, 2023, 8:59am

I trained an ActionRecgonitionNet model by using the provided Colab sample code and exported it to get a .etlt file.

I provided it to my colleague who’s been working on DeepStream. He tried the model and it worked correctly, but he tried it on a video clip containing multiple persons.

The result of the action recognition kept changing as people in the said video clip were doing different actions.

I wonder if it’s possible to do the following thing on DeepStream:

First, detect all the persons appearing in the image using models like YOLO-v4 or PeopleNet.
Suppose that N persons are detected as described above and we have their positions. For each person, do action recognition respectively, which means that their appearance on the image is used as the input for the action recognition net. Therefore, if there’re N persons detected, the inference of the action recognition net has to be done N times.

Fiona.Chen · August 7, 2023, 10:50am

It is supported to use nvpreprocess + nvinfer as SGIE since DeepStream 6.2. Gst-nvdspreprocess (Alpha) — DeepStream 6.2 Release documentation. Please use “process-on-frame” of nvdspreprocess to control the nvpreprocess to work as PGIE or SGIE. For such action recognization model, since it needs continuous images of the same person, it is recommend to add nvtracker to track the person so that you can identify the bbox for the same person in nvdspreprocess library with track-id.

We will publish a similar sample to show how to use nvdspreprocess as SGIE and how to collect succeeded bboxes for the same person with track-id in nvdspreprocess library. Please wait for the new sample.

Fiona.Chen · August 15, 2023, 10:23am

The nvpreprocess+nvinfer(nvinferserver) as SGIE sample is published: deepstream_tao_apps/apps/tao_others/deepstream-pose-classification at master · NVIDIA-AI-IOT/deepstream_tao_apps (github.com)

system · September 5, 2023, 2:06am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multiple person action recognition pipeline on deepstream with PeopleNet as pgie DeepStream SDK	4	566	March 16, 2023
Human action recognition in deepstream DeepStream SDK	5	309	September 13, 2022
ActionRecognitionNet as SGIE for multiple person action recognition DeepStream SDK deepstream	7	24	February 13, 2025
Could ActionRecoginitonNet be segmented by person? DeepStream SDK gstreamer	3	296	July 26, 2022
Detecting pose of a specific person using PoseClassificationNet DeepStream SDK	4	314	February 6, 2024
Deepstream Multi Stream detection DeepStream SDK	8	272	February 12, 2024
What is an efficient way to detect people with faces? DeepStream SDK	20	1945	June 20, 2022
How detect action in post-process in the app in human pose estimation? DeepStream SDK deep-learning	11	1022	March 9, 2023
How to process every third frame from the input stream DeepStream SDK	7	185	February 27, 2024
Multiple objects are detected using peoplenet DeepStream SDK	4	19	August 23, 2024

Workflow containing using multiple models for inference

Related topics