Best Practices for Deploying DeepStream 7.1 on Kubernetes with RTSP Streams and Analytics

yassinbalde.sorint · January 22, 2025, 3:15pm

• Hardware Platform:
GPU
• DeepStream Version:
7.1
• TensorRT Version:
Aligned with DeepStream 7.1 recommendations
• Issue Type:
Questions
• Requirement Details:

Hello, I am planning to deploy a DeepStream 7.1 application on Kubernetes to process multiple RTSP streams. The application will analyze these streams using a YOLO model, generate analytics, and send messages via Kafka using Faust. It will also interface with a small backend.

I would appreciate advice and best practices in the following areas:

Deployment Methodology:
Would you recommend deploying via Helm (e.g., using this example guide NVIDIA Helm chart) or directly creating and managing Kubernetes manifests? What are the pros and cons of each approach for deploying a DeepStream 7.1 application?
System Requirements:
Recommended NVIDIA GPU driver version and CUDA version for compatibility with DeepStream 7.1?
Any specific GPU configuration or hardware tips for better performance?
Application Development:
For developing the application, should I use C or Python bindings (e.g., following these examples deepstream_python_apps)? Are there notable performance differences or limitations with Python bindings?
Best Practices and Plugin Recommendations:
Are there specific plugins you would recommend for processing RTSP streams, running inference with YOLO models, and integrating with Kafka?
For example, I plan to use plugins such as nvstreammux, nvinfer, and nvmsgbroker. Are there any additional or alternative plugins I should consider to optimize my pipeline?
What are the best practices for configuring these plugins to ensure scalability and high performance, especially in a Kubernetes environment?
Additional Documentation/Resources:
Apart from the resources above, are there other guides or documentation that would help optimize this deployment?

I look forward to hearing your suggestions and insights.
Thanks in advance for your help!

Fiona.Chen · January 23, 2025, 2:25am

From DeepStream point, no difference of the two deploying methods.

As to DeepStream - Intelligent Video Analytics Demo | NVIDIA NGC, it is too old to be used with the latest DeepStream SDK version.

DeepStream has its own platform compatibility restriction, it must be followed. Installation — DeepStream documentation

DeepStream SDK supports both C/C++ and python APIs. We don’t find notable performance differences or limitations with pyds bindings.

There are samples for all the things you mentioned.

For the pipeline performance, please refer to Troubleshooting — DeepStream documentation

There is no special comments on Kubernetes environment

Do you mean you need document about the deployment with Helm or Kubernetes?

yassinbalde.sorint · January 23, 2025, 8:17am

Thank you for all the information!
Yes, I’m looking for documentation or guides specifically about deployment with Helm and Kubernetes, especially focused on optimizing setups for DeepStream.

Fiona.Chen · January 23, 2025, 8:43am

The optimization of the DeepStream pipeline has nothing to do with helm or k8s. DeepStream published its own docker containers DeepStream | NVIDIA NGC.

You may consult in Latest NGC GPU Cloud/Docker and NVIDIA Docker topics - NVIDIA Developer Forums for how to run Nvidia dockers in more efficient way.

yassinbalde.sorint · January 23, 2025, 3:23pm

Okay thanks, I understand that the optimization of the DeepStream pipeline is separate from Kubernetes and Helm. However, I am looking for guides or tutorials specifically on deploying DeepStream applications in Kubernetes clusters. Are there any resources or examples that detail this process?

Additionally, I noticed that for DeepStream 7.1, the requirements specify driver version 535.183.06 and CUDA 12.6, but the maximum supported CUDA version for that driver is 12.2. Could you clarify why?

simon.renuart · January 24, 2025, 1:19am

In my humble opinion, there has always been some ambiguity about how a Deepstream pipeline can be scaled from 10 cameras to 1000 cameras in the cloud. As of today, I am not even sure if it is possible or if it makes sense.

Fiona.Chen · January 24, 2025, 1:28am

No special guidance for this.

CUDA support backward compatibility.

DeepStream pipeline supports running in multiple GPUs.

simon.renuart · January 24, 2025, 1:38am

How can one manage models optimized for a batch size of 16 when dynamically increasing the number of cameras from, say, 10 to 150?

Additionally, which technology do you suggest for dynamically allocating additional GPU resources to a DeepStream container?

Thanks

Fiona.Chen · January 24, 2025, 5:49am

No. If the batch size is fixed, the performance may not be better when run more streams than batch size.

What does this mean?

yassinbalde.sorint · February 4, 2025, 9:52am

Hello, I deployed a DeepStream 7.1 app into my Kubernetes cluster using the deepstream:7.1-triton-multiarch container.
The GPU I’m using is an NVDIA H100, with Driver version 535.183.06 and CUDA version 12.6.

When i try to execute deepstream-rtsp-in-rtsp-out from DeepStream Python app with the command:

python3 deepstream_test1_rtsp_in_rtsp_out.py -i <my_rtsp>

I get the following output:

Creating Pipeline

Creating streamux

Creating source_bin  0

Creating source bin
source-bin-00
Creating Pgie

Creating tiler

Creating nvvidconv

Creating nvosd

Creating H264 Encoder
Is it Integrated GPU? : 0
Creating H264 rtppay
Adding elements to Pipeline


 *** DeepStream: Launched RTSP Streaming at rtsp://localhost:8554/ds-test ***


Starting pipeline

Failed to query video capabilities: Invalid argument
0:00:00.465533680  2416 0x562bf7c20910 INFO                 nvinfer gstnvinfer.cpp:684:gst_nvinfer_logger:<primary-inference> NvDsInferContext[UID 1]: Info from NvDsInferContextImpl::deserializeEngineAndBackend() <nvdsinfer_context_impl.cpp:2092> [UID = 1]: deserialized trt engine from :/opt/nvidia/deepstream/deepstream-7.1/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_int8.engine
Implicit layer support has been deprecated
INFO: ../nvdsinfer/nvdsinfer_model_builder.cpp:327 [Implicit Engine Info]: layers num: 0

0:00:00.465587041  2416 0x562bf7c20910 INFO                 nvinfer gstnvinfer.cpp:684:gst_nvinfer_logger:<primary-inference> NvDsInferContext[UID 1]: Info from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:2195> [UID = 1]: Use deserialized engine model: /opt/nvidia/deepstream/deepstream-7.1/samples/models/Primary_Detector/resnet18_trafficcamnet_pruned.onnx_b1_gpu0_int8.engine
0:00:00.473556234  2416 0x562bf7c20910 INFO                 nvinfer gstnvinfer_impl.cpp:343:notifyLoadModelStatus:<primary-inference> [UID 1]: Load new model:dstest1_pgie_config.txt sucessfully
Decodebin child added: source

Warning: gst-resource-error-quark: Could not read from resource. (9): ../gst/rtsp/gstrtspsrc.c(5964): gst_rtspsrc_reconnect (): /GstPipeline:pipeline0/GstBin:source-bin-00/GstURIDecodeBin:uri-decode-bin/GstRTSPSrc:source:
Could not receive any UDP packets for 5.0000 seconds, maybe your firewall is blocking it. Retrying using a tcp connection.
Decodebin child added: decodebin0

Decodebin child added: rtph264depay0

Decodebin child added: h264parse0

Decodebin child added: capsfilter0

Decodebin child added: nvv4l2decoder0

Failed to query video capabilities: Invalid argument
In cb_newpad

gstname= video/x-raw
features= <Gst.CapsFeatures object at 0x7effd52e0e20 (GstCapsFeatures at 0x7efd4c399db0)>
ENC_CTX(0x562bf7c86050) Error in initializing nvenc context
Error: gst-core-error-quark: ENCODER INITIALIZATION FAILED (1): None

Could someone help me with this issue?

Fiona.Chen · February 6, 2025, 5:21am

yassinbalde.sorint:

Warning: gst-resource-error-quark: Could not read from resource. (9): ../gst/rtsp/gstrtspsrc.c(5964): gst_rtspsrc_reconnect (): /GstPipeline:pipeline0/GstBin:source-bin-00/GstURIDecodeBin:uri-decode-bin/GstRTSPSrc:source:
Could not receive any UDP packets for 5.0000 seconds, maybe your firewall is blocking it. Retrying using a tcp connection.

Please check with your docker container whether it can receive data from the rtsp port and UDP port.

yassinbalde.sorint · February 8, 2025, 7:26pm

I solved the problem. I was running my pod incorrectly.

However, I noticed an issue: when running my application on a machine with an L40S GPU, everything works fine, but with the H100 GPU, I encounter an encoding problem. Could this be related to the absence of NVENC? How can I work around this issue?
Thanks

Fiona.Chen · February 10, 2025, 1:46am

There is no hardware encoder in H100. Please use software encoder instead. If you are using deepstream-app, please refer to DeepStream Reference Application - deepstream-app — DeepStream documentation for the file save sink settings, there is “enc-type” to choose hardware and software encoder.

You can also refer to the deepstream-app source code for how to use software encoder.

system · February 24, 2025, 1:46am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deepstream6.4 in Ubuntu24 Pipeline Not running DeepStream SDK	19	362	July 23, 2024
DeepStream 5.1 Docker, deepstream-test3.py, gst-stream-error-quark: Internal data stream error DeepStream SDK deepstream	4	323	March 25, 2024
Unstable deepstream rtsp pipeline - CUDA runtime error DeepStream SDK jetson , deepstream	7	59	April 1, 2025
How to configure RTP source in deepstream-app 7.1? DeepStream SDK cudnn , jetson , deepstream	26	88	February 11, 2025
How we can run deepstream-3d-action-recognition using Python? DeepStream SDK gstreamer , deepstream , deepstream61	14	1143	March 15, 2023
Deepstream Pipiline does not initialize properly DeepStream SDK	4	551	October 12, 2021
Deepstream_test1_rtsp_in_rtsp_out.py not working DeepStream SDK	16	576	April 14, 2023
RTSP streams working in deepstream, RTMP streams giving error: streaming stopped, reason not-linked (-1) DeepStream SDK tensorrt , cuda , ubuntu , gstreamer , docker , python , deepstream	41	83	March 7, 2025
Can't add new RTSP source dynamically using NEW NVSTREAMMUX and adaptative batching DeepStream SDK	12	750	October 7, 2023
DeepStream SDK and decoding RTSP on GPU DeepStream SDK	23	2045	October 12, 2021

Best Practices for Deploying DeepStream 7.1 on Kubernetes with RTSP Streams and Analytics

Related topics