Auto Scaling of Computer Vision Application on kubernetis

bhavara23 · December 16, 2022, 6:58am

Hello,

We have some questions regarding the scaling and deployment of computer vision models utilising Nvidia GPUs on Kubernetes. If you can point us in the correct path, that would be fantastic.

All of our computer vision models are running on a kubernetis cluster that has two Nvidia V100 GPUs and one Nvidia P100 GPU.

We’re having some trouble with horizontal scaling. In light of the requested horizontal scaling, we are unsure how to proceed. Although we can set the number of nodes, we are unable to set it up to scale up or down in response to requests.

micuentadecasa · December 21, 2022, 8:51am

I´m also interested on this topic, I´m planning to use Azure AKS on Edge devices.

Topic		Replies	Views
Kubernetes on NVIDIA GPUs Release Candidate Now Available Technical Blog	0	288	August 21, 2022
Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes Technical Blog nim	2	133	January 24, 2025
Deploying a Natural Language Processing Service on a Kubernetes Cluster with Helm Charts from NVIDIA NGC Technical Blog	4	531	January 4, 2021
Deploying NVIDIA Triton at Scale with MIG and Kubernetes Technical Blog	0	687	August 26, 2021
"How to Deploy Riva at Scale on AWS with EKS" tutorial is outdated Riva	1	98	November 28, 2024
Scaling Deepstream app to multiple servers/nodes DeepStream SDK	5	933	July 14, 2022
Amazon Elastic Kubernetes Services Now Offers Native Support for NVIDIA A100 Multi-Instance GPUs Technical Blog	0	376	October 22, 2021
Kubernetes Operator (k8s) setting CUDA_VISIBLE_DEVICES CUDA Setup and Installation	1	806	May 17, 2024
Triton and Kubernetes Computer Vision & Image Processing	0	407	December 7, 2022
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes Technical Blog llama	0	123	October 22, 2024

Auto Scaling of Computer Vision Application on kubernetis

Related topics