Hey, after reboot i recieved this message NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver.
5.15.0-43-generic #46~20.04.1-Ubuntu
x86_64 GNU/Linux
You might want to run ‘apt --fix-broken install’ to correct these.
The following packages have unmet dependencies:
nvidia-dkms-510 : Depends: nvidia-kernel-common-510 (>= 510.85.02) but 510.73.08-0ubuntu1 is to be installed
nvidia-docker2 : Depends: nvidia-container-toolkit (>= 1.10.0-1) but 1.9.0-1 is to be installed
nvidia-driver-510 : Depends: nvidia-kernel-common-510 (>= 510.85.02) but 510.73.08-0ubuntu1 is to be installed
E: Unmet dependencies. Try ‘apt --fix-broken install’ with no packages (or specify a solution).
I found the log file of package manager, Inside the log i saw that every X-time he start to unpacking nvidia-drivers
The question is why it happend, and how i can prevent it term.log (28.8 KB)
The first two are missing the kernel modules, check
dkms status
The third is crashing due to gpu errors. Might be due to overheating or the gpu is damaged.
So, on the first two computers Im installed Nvidia-drivers with run file, and its solved the problem
But this issue happend when I run new AI algorithm on GPU
Hello again the third one is probably crashed " nvidia-smi
Unable to determine the device handle for GPU 0000:01:00.0: Unknown Error " nvidia-bug-report.log (1.2 MB)