GPU management deploying deepstream on Kubernetes

dddd713 · November 17, 2022, 9:39am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) GPU
• DeepStream Version 6.1.1
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs) question
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Hi, I see similar question but it’s closed with no definite answer How to manage thousands of video streams and feed to deepstream?

I wanted to ask what is the recommended course of action for using Kubernetes to run multiple instances of Deepstream apps on multiple nodes and multiple gpus available to each node?

How can I make the best use of the hardware I have at my disposal and possibly avoiding multiple manual changes in helm configs / deepstream configs?

Fiona.Chen · November 18, 2022, 3:15am

DeepStream is a SDK to develop inferencing applications. The DeepStream applications are just the same as other applications which can run in Kubernetes. For Kubernetes, seems you need to refer to Kubernetes document.

dddd713 · November 24, 2022, 10:43am

I know what DeepStream is, thanks…

Let me clarify maybe the situation I’m in:

I have previously used your helm video-analytics-demo chart version 0.1.5 as guide. There I provided gpu for DS instance in values.yaml and used that value to set the same gpu-id using modified created_config.py.

But as you can imagine, it’s not a very flexible solution when more streams and containers need to be handled.

In recent video-analytics-demo helm chart, version 0.1.8 DeepStream - Intelligent Video Analytics Demo | NVIDIA NGC I see gpus mentioned under

resources:
  limits:
    nvidia.com/gpu: 1

with gpu-id=0 everywhere where applicable.

But it is still not clear to me if there is a nice way to handle situation where I have more pods (or containers or ds-app instances - 1 DS instance is one pod/container here) to run than gpus (and more than one node)

I wish to be able to have bigger batches and less DS apps running but it’s not possible right now as some post processing of the bounding boxes appears to be a cpu-bound process.

To summarize:
I’m looking for a advice that would help me extend your helm charts to situations where multiple nodes have access to multiple gpus but the number of pods that needs to be run is greater than the total number of gpus.

I do not expect the streams to be reassigned to different gpus during their lifetime. I can assume my pods will need a reasonably constant amount of resources. I’m just hoping for something that would save me some initial guessing and manually modify tens of config files. Something to start tenth DS instance on less busy gpu.

I don’t think it’s a very unusual usecase hence I’m asking for suggestions.

And why here and not on kubernetes forums? Because it’s ds config that requires gpu-id that needs to be hardcoded in multiple places, even thought it needs to be the same for the app to run.

zhliunycm2 · December 1, 2022, 1:34am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

GPU time slicing may help to have pods share GPUs:
https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/gpu-sharing.html

system · March 22, 2023, 2:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to manage thousands of video streams and feed to deepstream? DeepStream SDK kubernetes , deepstream	4	1309	August 10, 2022
Scaling Deepstream app to multiple servers/nodes DeepStream SDK	6	694	July 28, 2022
Server installation recommendations DeepStream SDK	6	268	June 30, 2023
Best Practices for Deploying DeepStream 7.1 on Kubernetes with RTSP Streams and Analytics DeepStream SDK cuda , driver , kubernetes , deepstream	13	170	February 10, 2025
Deepstream multi stream handling DeepStream SDK camera , cuda , gstreamer , docker , deepstream , ngc , deepstream61	4	1038	August 29, 2022
Running Deepstream in Kubernetes Docker and NVIDIA Docker	2	938	August 24, 2022
How to utilize multiple GPU in deepstream 5.0 DeepStream SDK	7	2607	October 12, 2021
How to increase the robustness of deepstream DeepStream SDK	8	247	May 23, 2023
DS6.1 performance issue DeepStream SDK performance	2	471	December 29, 2022
How to host service with deepstream DeepStream SDK	5	731	October 12, 2021

GPU management deploying deepstream on Kubernetes

Related topics