Deepstream 6 python app performance degradation

denisvmedyantsev · January 27, 2022, 6:52am

I have developed a custom python app using DeepStream: primary detector + classifier with custom post-processing (output-tensor-meta=1).
With DS 5.1 ngc container my app runs ~250 FPS, with DS 6.0 ~140 FPS.

For DS 6.0 container I installed pyds from https://github.com/NVIDIA-AI-IOT/deepstream_python_apps/releases/download/v1.1.0/pyds-1.1.0-py3-none-linux_x86_64.whl

I tried sample app from deepstream_python_apps, deepstream-test3 and deepstream-imagedata-multistream with one file source (sample_720p.mp4) and one small patch nveglglessink → fakesink sync=0.

deepstream-test3.py runs 350 FPS in DS 5.1 and 340 FPS in DS 6.0.
deepstream-imagedata-multistream (without frame saving) runs 225 FPS in DS 5.1 and 216 FPS in DS 6.0

I checked resnet10.caffemodel_b1_gpu0_int8.engine (primary detector for sample apps) with trtexec, for TRT 7.2.2 I got Host Latency end to end 0.815338 ms at 99%, throughput: 2153.88 qps, and for TRT 8.0.1 End-to-End Host Latency percentile(99%) = 0.802185 ms, Throughput: 2155.33 qps, thus the engine is even slightly faster in DS 6.0.

I tried downgrading pyds in DS 6.0 (1.1.0 to 1.0.2):

deepstream-test3 - no effect
deepstream-imagedata-multistream - 225 FPS (= DS 5.1)
my app - 250 FPS (= DS 5.1)

Can you explain the reason for the performance degradation and help fix it?
Thanks

Platform

NVIDIA GeForce RTX 2080
Driver Version: 470.86
Docker nvcr.io/nvidia/deepstream:5.1-21.02-base, nvcr.io/nvidia/deepstream:6.0-base

kayccc · February 8, 2022, 1:29am

Sorry for the late response, is this still an issue to support? Thanks

denisvmedyantsev · February 8, 2022, 2:42am

Yes, pyds 1.1.0 is noticeably slower under heavy use than 1.0.2.

metarefl · February 10, 2022, 7:22am

I observed the same effect for the DeepStream primary detector and various custom models.

mchi · February 13, 2022, 7:49am

Hi @metarefl ,
In the three cases (your app, deepstream-test3, deepstream-imagedata-multistream), seems only your app is affected by different pyds seriously (250 vs 140) while the others two have much less perf drop, right?

Is your detection running with TensorRT backend?
Is it possible for you to narrow down which components cause the perf drop referring to DeepStream SDK FAQ - #12 by bcao ?

denisvmedyantsev · February 18, 2022, 1:31am

Yes, my app affected more seriously. Python sample apps are very simple and use little bindings.

“Is your detection running with TensorRT backend?” - yes, it is TRT engine with tensor output post processing ending with adding object meta to frame

“Is it possible for you to narrow down which components cause the perf drop referring to DeepStream SDK FAQ - #12 by bcao ?” - no, it’s python app and pyds doesn’t provide bindings for latency measurement api

zhliunycm2 · March 11, 2022, 1:03am

Thanks for reporting the issue. Can you share more info on what you app does and which bindings you are using?

zhliunycm2 · March 21, 2022, 3:51pm

There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks
Hi @metarefl, thanks for bringing this to our attentions. Can you share some more info on your use case, especially how it deviates from our sample apps? We haven’t seen this with the sample apps so would like to understand how to replicate the issue.

system · April 12, 2022, 2:42am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Amycao · September 20, 2022, 3:17am

Can you try the latest version 6.1.1?

Topic		Replies	Views
DeepStream 6.0 output-tensor-meta=1 extremely slow DeepStream SDK	8	1290	April 4, 2022
Getting only 20fps on python deepstream app on 6.3 on a 30 fps camera DeepStream SDK fps	6	761	October 7, 2023
Does the deepstream_python_apps have performance hit compare to native apps? DeepStream SDK	5	440	January 17, 2022
Estimates on improvement in performance by moving to Deepstream DeepStream SDK deepstream	10	29	February 21, 2025
Multi-RTSP FPS drop with python deepstream_test_3.py example DeepStream SDK rtsp , python	5	1474	November 16, 2021
Deepstream 6.0 Python Yolo bad performance DeepStream SDK	8	1668	December 28, 2021
Python Deepstream-Test1 App running slow and GPU only at 17% DeepStream SDK	4	516	February 8, 2022
Deepstream4.0.2 -> deepstream5.0.1 deepstream-app FPS degradation on Jetson Nano DeepStream SDK	5	489	October 12, 2021
Python deepstream segmentation sample app running very slow DeepStream SDK deepstream	7	447	December 4, 2023
Multiple camera input to Deepstream through python for Object detection using SSD Mobilnet V2 on Jetson Nano DeepStream SDK ssd	8	1196	October 12, 2021

Deepstream 6 python app performance degradation

Related topics