Driver Error: NVIDIA-SMI failed because it couldn't communicate with the NVIDA Driver

Hey everyone,
as part of my bachelor thesis, I am working on deploying an local LLM, but unfortunately I am having issues with my NVIDIA Driver.
When I type “nvidia-smi” I get the error message, that it couldnt communicate with the driver. I’ve found several solutions in different forums, but unfortunately none of them worked for me.

Ive deployed a VM in the vmWare Environment. The VM contains 4 Tesla T4s, each with 16 GB VRAM in a 16Q Grid.
The Ubuntu Version is 24.04 and the Kernel Version is 6.8
The installed NVIDIA Driver Version is 550.67

The NVIDIA Bug Report is a 2.3 GB File, so I cant upload it here, I will send a OneDrive Share Link.
I’ve also tried it before on a VM with Ubuntu 22.04.

I would be very happy, if someone could help me, because it would help me with my bachelor thesis

Please find the Bug Report with the following OneDrive Link: Microsoft OneDrive

Did you use the proper grid driver?

No, I don’t think so (at least not on the VM, it is installed on the ESXi Host. (I havent really worked with a linux machine yet).
I’ve seen that there is an Grid Driver, but couldn’t find much to install it. I installed the normal NVIDIA Driver.
On VM before with Ubuntu 22.04 I was able to run nvidia-smi and it also didn’t have the grid driver. But on that machine problems started after an apt update.

Could you help me to install the proper driver? I’ve searched on a few websites, but couldnt find much.
There were these instructions in vmWare, but commands are not working on Ubuntu

You first need to download the proper -grid client driver from the vgpu portal where you also acquired the vmware host driver. It is not publically available.

Prior to using the runfile installer, you need to uninstall all nvidia driver packages you installed from repo.

That did the trick, thank you very much.
Nvidia-smi is working again. I hope I can execute the LLM on the Graphic cards now

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.