After a hard shutdown caused by power outage on a linux headless server, I am getting the following error after nvidia-smi “NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.” I installed the driver using Linux-x86_64/515.65.01/NVIDIA-Linux-x86_64-515.65.01.run
Last time when I had to face such a problem, I purged and reinstalled the driver but that caused the server to shutdown after every 5-10 mins. I had physical access to the server so I re-installed the OS and things went back to normal. This time I do not have physical access to the server and do not want to cause any further damage. Please, someone help me to reinstall the driver without causing any further harm to the server.
I am attaching the bug report.
Thank you in advance.
nvidia-bug-report.log (221.3 KB)