Unable to upgrade Drivers on Esxi

Hello,

I am attempting to upgrade my Nvidia NVD drivers on our ESXI host. I am following the steps in the below article.
https://docs.omniverse.nvidia.com/deployment/latest/installing-vgpu-manager.html

Current driver is showing that everything is running fine, but after the upgrade I am getting an error

[root@GPU1-core:~] nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

I have reverted the driver back to the previous that was working and it will show the status of being detected.

[root@GPU1:~] nvidia-smi
Wed Mar 6 22:10:52 2024
±----------------------------------------------------------------------------+
| NVIDIA-SMI 470.63 Driver Version: 470.63 CUDA Version: N/A |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA A100-PCI… On | 00000000:25:00.0 Off | Off |
| N/A 26C P0 36W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+
| 1 NVIDIA A100-PCI… Off | 00000000:81:00.0 Off | Off |
| N/A 26C P0 37W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+
| 2 NVIDIA A100-PCI… Off | 00000000:E2:00.0 Off | Off |
| N/A 27C P0 38W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+

±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+

I have tried 2 different version and all fail.

Works:
NVIDIA_bootbank_NVIDIA-VMware_ESXi_7.0.2_Driver_470.63-1OEM.702.0.0.17630552.vib

Fails:
NVD_bootbank_NVD-VMware_ESXi_7.0.2_Driver_535.154.02-1OEM.702.0.0.17630552.vib
NVD_bootbank_NVD-VMware_ESXi_7.0.2_Driver_550.54.10-1OEM.702.0.0.17630552.vib

Any suggested next steps would be appreciated.

A100 is not supported any more with vGPU. You need to use the Nvidia AI Enterprise host driver instead. This change already was made 2 years ago starting wirh vGPU 14. Please check the release notes for the vGPU 16 releasr you were trying to install…