Hello everyone,
I have a query about the number of virtual machine (VM) instances that can be run on a single virtual GPU (vGPU).
Referring to the “Virtual-GPU-Packaging-and-Licensing-Guide” PDF, specifically at the bottom of page 10, there’s a statement: Maximum 10 concurrent VMs per GPU.
This implies that, for example, it wouldn’t be feasible to operate 16 VMs on an A16 GPU. However, I’m curious about a specific scenario: if I were to run 10 VMs on an A16 GPU, would the remaining 6GB of VRAM be rendered unusable or redundant?
Any insights or clarifications on this matter would be greatly appreciated.