VDI, performance impact of non-numa locality (Multisocket servers)

thorbjorn.kolstad1 · January 3, 2024, 2:34pm

Hi.

I’m currently doing some research on vGPU + VDI.

It looks like ESXi does a really bad job with handling Numa and PCI devices.
Unless you pin your VM to a specific socket and ensures that it really picks the local GPU you might end up on having multiple VMs running a vGPU through the interconnect.

I haven’t really tested the performance impact here. I’m aware that this depends on the workload.

Have anyone done any testing related to mixed VDI environments? Typical office users / CAD users.
How much impact will running through the interconnect have? Especially when using NVENC to encode the stream for Citrix/Horizon.

I’m aware of pinning VMs to specific sockets might be the way to go, but this is won’t scale in a big environment where VMs are moved, respecced etc.

Topic		Replies	Views
Nvidia Grid Tesla P40 performance issue after vgpu driver installation on vdi XenDesktop	0	2412	March 14, 2020
GPU Passthrough Help More vGPU Forums	0	766	January 31, 2023
Optimal multi-GPU system CUDA Programming and Performance	7	1048	September 6, 2017
Nvidia with ESXi hosts that can't take a GPU card. General Discussion	0	3137	September 26, 2017
Problems with perfomance vGPU (Dell R730, VMware ESXi 6.7, Tesla P40) General Discussion	4	4603	December 4, 2020
Simultaneous sVGA and vGPU vibs NVIDIA Virtual GPU Technology	1	4485	September 2, 2016
Horizon View Tesla M10 - Poor Performance NVIDIA Virtual GPU Technology	3	8012	April 18, 2019
GRID Licensing server NVIDIA Virtual GPU Technology	2	6099	February 14, 2018
NVSHMEM applicability in setting of two PCIe connected GPUs GPU-Accelerated Libraries nvshmem	6	994	October 12, 2021
Odd performance blinks inside Grid Xendesktop VM's XenDesktop	9	6892	January 16, 2020

VDI, performance impact of non-numa locality (Multisocket servers)

Related topics