Using the Dell vSphere installer (VMware-VMvisor-Installer-6.5.0.update01-6765664.x86_64-DellEMC_Customized-A02), the NVIDIA Grid VIB installed fine (NVIDIA-VMware_ESXi_6.5_Host_Driver_384.73-1OEM.650.0.0.4598673).
However, nvidia-smi returns:
Failed to initialize NVML: Unknown Error
Is there an incompatibility between the two versions that I have installed?
Please try the default bits from VMWare. I don’t think our VIB is tested with Dell installer. In addition please check with dmesg to see if there are any other errors that may indicate a BIOS settings error.
The problem with the default installer from VMWare was that it did not recognize the 10Gb network ports on the server. I’ll try to find another workaround for that.
In the mean time, it does appear that there are some issues from the results of dmesg:
According to this doc: VMware Knowledge Base, the Module Name needs to be "nvidia", but I show it as "None", which might explain why Xorg will not start.
I’m also not sure if the fact that
>esxcli hardware pci list -c 0x0300 -m 0xf
returns the embedded VGA controller as well as the NVIDIA controller is an issue or not…
Description:
When system BIOS has "Memory Mapped I/O Base" set to 56 TB and if the server has GPU cards such as Nvidia M60 as the PCIe Pass-Through device, the virtual machines fails to power on.
Applies to:
ESXi 6.5.x and Dell EMC’s 14th generation PowerEdge servers
Solution:
To resolve this, set the MMIO to 12 TB. To set MMIO, in System BIOS Settings >
Integrated Devices, you have to set "Memory Mapped I/O Base" to 12 TB.
For more information, refer to
VMware Knowledge Base article 2142307