Gpu-operator / MIG feature / ResourceQuota

sebastien.chausson.extern · June 5, 2023, 3:35pm

Hi,
We have one A100-SXM4-80GB GPU server integrated in a kubernetes cluster.
After reading the documentation, I am wondering about the way to set a “max amount of GPU” to a k8s namespace, that could be used by different workload sizes.
For instance, let say we want to allocate one full GPU card (out of 8 available ones) to a project/namespace, but we want to let users the possibility to execute workloads using either 1g.10gb or 2g.20gb or even 4g.40gb (depending on their use cases), the total of all their workloads having to fit in the single GPU instance (for instance, they could simultaneously start 7 workloads using 1g.10gb, or 3 workloads using 2g.20gb + 1 workloag using 1g.10gb, and so on…)
I couldn’t figure out how to set ResourceQuota on the project namespace to reach such a goal:
From what I understood, I can set following in the “hard” section of the ResourceQuota to assign a full GPU instance to the namespace:

requests.nvidia.com/gpu: '1'

Or following to restrict the assignment to 4 * 2g20gb:

requests.nvidia.com/mig-2g.20gb: '4'

But will k8s be able to “understand” that 7 * 1g.10g workloads fit in a full GPU card ?
Will it be able to sum “1g.10g” & “2g.20gb” gpu slices and infer that the sum is under the limit we set ?
I would not want to have to set:

requests.nvidia.com/mig-1g.10gb: '7'
requests.nvidia.com/mig-2g.20gb: '4'
requests.nvidia.com/mig-4g.40gb: '2'

Because I guess that in that case this would be cumulative resources, right ?
Thanks for any help

Topic		Replies	Views
How to increase dynamically allocatable memory in device function? CUDA Programming and Performance	2	2998	November 20, 2018
Questions for CUDA Time-slicing in kubernetes Docker and NVIDIA Docker kubernetes	0	942	September 22, 2022
Assign Multi GPU on XenServer 6.2 with SLES 11? CUDA Programming and Performance	1	911	January 24, 2015
How to use k8s to build up GPU cluster and setup the load balance （I don't know which forum section I can post） CUDA Setup and Installation clustering	0	422	October 17, 2023
CPU Cores Per GPUs CUDA Programming and Performance	11	2456	April 14, 2013
Maximum number of threads in a GPU CUDA Programming and Performance cuda	5	6520	December 29, 2022
How to use GPU Operator with MIG to configure 2 GPUs on one node separately Docker and NVIDIA Docker kubernetes	0	166	September 10, 2024
Maximum allocation size? Workspace issues CUDA Programming and Performance	1	2007	July 26, 2011
How to limit number of cores in GPU to be used for processing CUDA Setup and Installation	2	2784	July 28, 2014
GPU workload with large datasets CUDA Programming and Performance	2	948	October 29, 2015

Gpu-operator / MIG feature / ResourceQuota

Related topics