Amazon Elastic Kubernetes Services Now Offers Native Support for NVIDIA A100 Multi-Instance GPUs

jwitsoe · October 22, 2021, 7:17pm

Originally published at: https://developer.nvidia.com/blog/amazon-elastic-kubernetes-services-now-offers-native-support-for-nvidia-a100-multi-instance-gpus/

Deployment and integration of trained machine learning (ML) models in production remains a hard problem, both for application developers and the infrastructure teams supporting them. How do you ensure you have the right-sized compute resources to support multiple end-users, serve multiple disparate workloads at the highest level of performance, automatically balancing the load, scale up…

Topic		Replies	Views
One-click Deployment of Triton Inference Server to Simplify AI Inference on Google Kubernetes Engine (GKE) Technical Blog	0	524	August 23, 2021
AWS Brings NVIDIA A10G Tensor Core GPUs to the Cloud with New EC2 G5 Instances Technical Blog	0	495	November 12, 2021
Getting Kubernetes ready for the NVIDIA A100 GPU with Multi-Instance GPU Technical Blog	4	662	November 8, 2022
Getting the Most Out of the NVIDIA A100 GPU with Multi-Instance GPU Technical Blog	11	1482	January 19, 2023
Deploying AI Deep Learning Models with NVIDIA Triton Inference Server Technical Blog	0	395	December 18, 2020
Fast and Scalable AI Model Deployment with NVIDIA Triton Inference Server Technical Blog	0	419	November 9, 2021
NVIDIA AI Enterprise - Optimized, Certified and Supported on VMware vSphere Technical Blog	0	397	January 6, 2022
Run Multiple AI Models on the Same GPU with Amazon SageMaker Multi-Model Endpoints Powered by NVIDIA Triton Inference Server Technical Blog	0	403	October 25, 2022
Adobe Scales ML Pipelines for Optimized Delivery of Brand Messages Technical Blog	0	244	September 14, 2023
Develop ML and AI with Metaflow and Deploy with NVIDIA Triton Inference Server Technical Blog	2	357	January 5, 2024

Amazon Elastic Kubernetes Services Now Offers Native Support for NVIDIA A100 Multi-Instance GPUs

Related topics