ISC 2020: Running Multiple Workloads on a Single A100 GPU

Presenters: DemoTeam, NVIDIA
Today, researchers and developers get a dedicated GPU to run their workload, even if the workload only uses a fraction of the GPU’s compute power. The NVIDIA A100 Tensor Core GPU includes a groundbreaking feature called Multi-Instance GPU (MIG), which partitions the GPU into as many as seven instances, each with dedicated compute, memory, and bandwidth. This allows multiple users to run their workloads on the same GPU, maximizing per-GPU utilization and user productivity. This demo runs AI and high-performance computing (HPC) workloads simultaneously on the same A100 GPU.

