Hello,
I am attempting to upgrade my Nvidia NVD drivers on our ESXI host. I am following the steps in the below article.
https://docs.omniverse.nvidia.com/deployment/latest/installing-vgpu-manager.html
Current driver is showing that everything is running fine, but after the upgrade I am getting an error
[root@GPU1-core:~] nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
I have reverted the driver back to the previous that was working and it will show the status of being detected.
[root@GPU1:~] nvidia-smi
Wed Mar 6 22:10:52 2024
±----------------------------------------------------------------------------+
| NVIDIA-SMI 470.63 Driver Version: 470.63 CUDA Version: N/A |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA A100-PCI… On | 00000000:25:00.0 Off | Off |
| N/A 26C P0 36W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+
| 1 NVIDIA A100-PCI… Off | 00000000:81:00.0 Off | Off |
| N/A 26C P0 37W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+
| 2 NVIDIA A100-PCI… Off | 00000000:E2:00.0 Off | Off |
| N/A 27C P0 38W / 250W | 0MiB / 40536MiB | 0% Default |
| | | Disabled |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+
I have tried 2 different version and all fail.
Works:
NVIDIA_bootbank_NVIDIA-VMware_ESXi_7.0.2_Driver_470.63-1OEM.702.0.0.17630552.vib
Fails:
NVD_bootbank_NVD-VMware_ESXi_7.0.2_Driver_535.154.02-1OEM.702.0.0.17630552.vib
NVD_bootbank_NVD-VMware_ESXi_7.0.2_Driver_550.54.10-1OEM.702.0.0.17630552.vib
Any suggested next steps would be appreciated.