Parallel branching in deepstream 6.4

Lino120 · April 19, 2024, 7:14am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) : NVIDIA GeForce RTX 3090
• DeepStream Version : 6.4
• JetPack Version (valid for Jetson only)
• TensorRT Version : 12.2
• NVIDIA GPU Driver Version (valid for GPU only) : 535.104.05
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)

I am trying to build pipeline branching like in below image

after run this pipeline line with two branches , I putted probe function on branch1 but I am still see results from branch2

if I commented out branch2 , I can not find any detections

I tried to put nvstreamdemux and nvstreammux after tee plugin on branch1 and branch2 still commented , pipeline is not working/

so I see abnormal behavior and I don’t know where is the problem , so can you tell me how to run this pipeline with branches as in image successfully ?

Appreciate your help

Fiona.Chen · April 22, 2024, 3:25am

“tee" only clone the buffers to branches but not copy the buffers, the buffers in branch 1 are exactly the buffers in branch 2.

What is the purpose of your branch1 and branch 2?

You may refer to GitHub - NVIDIA-AI-IOT/deepstream_parallel_inference_app: A project demonstrating how to use nvmetamux to run multiple models in parallel.

Lino120 · April 22, 2024, 9:50am

I checked the
deepstream_parallel_inference_app
that you mentioned before asking

and it didn’t work as mentioned above

what is difference between clone and copy?

the purpose of branch1 and branch2 to parallelize inference and minimize time since sgie2 depends only on pgie so this will minimize total inference time

Fiona.Chen · April 22, 2024, 3:20pm

What does you mean? The deepstream_parallel_inference_app just tell you how to handle the batched data with tee.

To make the explanation simple,
“clone” means the “buffer” is the same “buffer”, tee subprojects/gstreamer/plugins/elements/gsttee.c · main · GStreamer / gstreamer · GitLab just create new “pointer” to point to the same “buffer”. When you change the “buffer” content through one “pointer” in one branch, you can see the same change in the other branch through another “pointer”, since they pint to the same “buffer”.
“copy” will create a whole new “buffer”.

Is your purpose to minimize latency or to minimize processing time?

Lino120 · April 23, 2024, 4:11pm

Thanks for details about clone and copy

Yes I want to use multiple branches to make models work in parallel which which will reduce processing time and will affect also whole pipeline latency

So can you help me in this case, please

Fiona.Chen · April 24, 2024, 10:06am

For your PGIE + multiple SGIEs case, multiple branches are not necessary. The parallel pipeline may not be faster than the normal pipeline like /opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-test2

You can try this parallel pipeline under the /opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-test2 directory.

gst-launch-1.0 nvstreammux batch-size=2 width=1920 height=1080 name=mux ! nvinfer config-file-path=./dstest2_pgie_config.txt batch-size=2 ! nvtracker ll-config-file=/opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app/config_tracker_NvDCF_perf.yml ll-lib-file=/opt/nvidia/deepstream/deepstream/lib/libnvds_nvmultiobjecttracker.so display-tracking-id=0 ! tee name=t t.src_0 ! nvvideoconvert ! 'video/x-raw(memory:NVMM),width=1920,height=1088' ! nvinfer config-file-path=./dstest2_sgie2_config.txt ! nvmultistreamtiler width=1920 height=2160 rows=2 columns=1 ! queue ! nvdsosd display-text=1 display-bbox=1 ! nveglglessink uridecodebin uri=file:///opt/nvidia/deepstream/deepstream/samples/streams/sample_1080p_h264.mp4 ! mux.sink_0 uridecodebin uri=file:///opt/nvidia/deepstream/deepstream/samples/streams/sample_1080p_h265.mp4 ! mux.sink_1 t.src_2 ! queue ! nvvideoconvert ! 'video/x-raw(memory:NVMM),width=1920,height=1088' ! nvinfer config-file-path=./dstest2_sgie1_config.txt ! nvmultistreamtiler rows=1 columns=2 width=3840 height=1080 ! nvdsosd ! nveglglessink

Lino120 · April 24, 2024, 5:40pm

I used your pipeline and now it is working but FPS becomes lower as in below screenshot

you mentioned that multiple branches are not necessary for my case, so can you explain why multiple branches didn’t reduce latency or processing time so performance become more worse

Fiona.Chen · April 25, 2024, 1:26am

You can compare the pipeline in /opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-test2 and the so-called parallel pipeline I give to you. Extra conversion and processing are needed for separating the “buffers” for branches. It will not make the pipeline faster.

Lino120 · April 29, 2024, 7:50am

Extra conversion and processing already implemented in deepstream or should I implement that?

Appreciate more clarification

Fiona.Chen · April 29, 2024, 7:53am

DeepStream is a SDK. The pipeline I provided is an implementation.

yingliu · May 21, 2024, 5:28am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

system · June 4, 2024, 5:28am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deepstream parallel pipeline video sink issue DeepStream SDK gstreamer , deepstream	12	492	August 15, 2023
Issues with tee branching DeepStream SDK	10	243	July 16, 2024
"Internal data stream error" with tee and nvstreammux DeepStream SDK	5	305	January 17, 2024
Synchronization between pgie probe and sgie probe DeepStream SDK gstreamer	4	1313	March 2, 2022
Gstreamer tee element branching shows output of both branches on one of the sink output DeepStream SDK	10	929	October 12, 2021
Parallel Inference vs Tee DeepStream SDK	8	64	August 26, 2024
In Deepstream-6.4 multiple streammux is not working DeepStream SDK nvbugs	13	983	January 4, 2024
Problems linking multiple demuxers to a Tee'd batched stream on dGPU DeepStream SDK	17	710	September 6, 2023
Segmentation fault on Running multiple processes with same deepstream pipeline in a container DeepStream SDK	13	679	November 22, 2022
How to debug gstnvinfer with custom model？ DeepStream SDK tensorrt	24	2142	October 12, 2021

Parallel branching in deepstream 6.4

Related topics