Improving GPU Utilization in Kubernetes

Originally published at: Improving GPU Utilization in Kubernetes | NVIDIA Technical Blog

To improve NVIDIA GPU utilization in K8s clusters, we offer new GPU time-slicing APIs, enabling multiple GPU-accelerated workloads to time-slice and run on a single NVIDIA GPU.