We recently purchased an a40 GPU. However, plan to use it with multiple customer virtual machines. I am wondering if i can still use a VM with high v-ram dedicated from the GPU as a CloudXR server, or if we will need an entire new GPU dedicated for the server.
The number of users that can be supported by a single, virtualized graphics card is largely dependent on the VR application. We know that for typical encoding workloads a single card Turing or Ampere generation card can support a maximum of two users for HMDs with specs similar to those of the Quest 2.