Need help in choosing GPU for Video Analytics with multi-stream (4+) RTSP inputs & outputs

deepak · July 10, 2020, 9:55am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
• DeepStream Version 5.0
• TensorRT Version 7
• NVIDIA GPU Driver Version (450, CUDA 11.00)

We are evaluating suitability of NVIDIA GPUs for video analytics application and have used the deepstream-app as the basis for evaluation and have added support for Nvidia Optical Flow and nvdsanalytics plugins in the pipeline . We have tested the performance on RTX 2080 Ti and Jetson (testing on Jetson has been minimal so far as it doesn’t support Optical Flow). For the purpose of testing, we disable the Optical Flow and nvdsanalytics plugins in the pipeline and limited ourselves to YoloV3 model provided by NVIDIA as part of the samples.

We need to process as many input camera streams (of 1080p x 30FPS over RTSP) as possible (say, 8+) and generate the output RTSP streams with each of the output streams carrying the detection output (OSD).

The problem with RTX 2080 Ti is it limits the number of concurrent encoder sessions to 3.

On the other hand, a Quadro or similar processor do not restrict the number of concurrent sessions. However, it is not clear how many concurrent NVENC sessions can be used on such GPUs to effectively stream 1080p x 30 FPS over RTSP (or even 720p x 30FPS).

Looking at NVIDIA whitepaper on Turing platforms didn’t help. Our own testing so far has been not very encouraging on RTX 2080 Ti (with 1080p or even 720p)

So:

Which GPU is suitable for implementing DeepLearning Inference (using DeepStream)+ RTSP streaming of 4+ and 8+ streams
The performance of DeepStream (deepstream-app) doesn’t seem to be different (and continues to be poor with jitter and significant end-to-end delay as well as buffer caching) when incoming streams of lower resolution (e.g. 720p) is used. Would lowering of incoming frame resolution improve the performance?

Thanks for your inputs.

Fiona.Chen · July 13, 2020, 2:34am

https://docs.nvidia.com/metropolis/deepstream/dev-guide/index.html#page/DeepStream_Development_Guide/deepstream_quick_start.html#

NVIDIA® DeepStream Software Development Kit (SDK) is an accelerated AI framework to build intelligent video analytics (IVA) pipelines. DeepStream runs on NVIDIA® T4 and platforms such as NVIDIA® Jetson™ Nano, NVIDIA® Jetson AGX Xavier™, NVIDIA® Jetson Xavier NX™, NVIDIA® Jetson™ TX1 and TX2.

Fiona.Chen · July 13, 2020, 2:48am

There is some performance data for your refference: https://docs.nvidia.com/metropolis/deepstream/dev-guide/index.html#page/DeepStream_Development_Guide/deepstream_performance.html#

deepak · July 13, 2020, 4:40am

Hi!

Thanks for the information. We had already looked into it. To be more specific:

Some of the Quadro series GPUs do not restrict the number of concurrent NVENC sessions ( https://developer.nvidia.com/video-encode-decode-gpu-support-matrix ). What is the reasonable number of concurrent encoding session possible when using 1080p camera inputs at 30FPS?
In case of Quadro series, what will be the expected FPS when using inputs from 30FPS, H264 1080p RTSP cameras and performing a simple pipeline using mux->nvinfer (YoloV3)->Tracker-Demux (No Tiling)?
On Jetson, the example shown is for Resnet10 (which is highly tuned/pruned model and detects just 4 classes of objects). What will be the FPS throughput when YoloV3 (with 80 classes of objects)?
In general, will reducing the input camera resolution to 720p or reducing the bitrate improve the FPS throughput?

Fiona.Chen · July 13, 2020, 10:38am

There is only some encoder performance data in NVIDIA VIDEO CODEC SDK | NVIDIA Developer, please refer to “NVENC - Hardware-Accelerated Video Encoding” part. There are some stream number data for some GPUs. No data for Quadro GPUs now.

Topic		Replies	Views
Deepstream 4 + yolov3 multi source slow DeepStream SDK	9	1814	October 12, 2021
The number of cameras that DeepStream can support DeepStream SDK tensorrt , camera , gstreamer	3	603	December 15, 2023
How to maximize inferences/sec in a deepstream pipeline DeepStream SDK	13	1061	October 12, 2021
Too much frame drop in deepstream pipeline DeepStream SDK cuda , jetson-inference , gstreamer , deepstream	20	116	February 12, 2025
Hardware buying choices DeepStream SDK	3	381	April 25, 2022
How to use deepstream-nvdsanalytics-test DeepStream SDK	4	389	September 20, 2022
Slow decoding T4 DS 5.0.1 DeepStream SDK	11	476	October 12, 2021
Which on-prem GPU server supports Custom Object Detection on Multi camera Stream using DeepStream 7.0 DeepStream SDK camera , opencv , gstreamer , deepstream	6	85	October 12, 2024
Delay due to nvinfer DeepStream SDK deepstream	11	28	February 25, 2025
Deepstream Multi-Source RTSP Encoding Problem DeepStream SDK camera , gstreamer , nvbugs , encoder	2	716	October 12, 2021

Need help in choosing GPU for Video Analytics with multi-stream (4+) RTSP inputs & outputs

Related topics