I have a gpu box running Ubuntu Server 20.04 LTS with four 2080 Ti’s. Today, after upgrading my system, I purged my previous versions of CUDA toolkit and nvidia driver to install newer versions. I am able to successfully install any of the drivers listed on
ubuntu-drivers devices and can confirm this by successfully running
nvidia-smi on my system, post-reboot. But now my computer randomly shuts down without warning! I have encountered this problem before and have always been able to fixed it by installing another driver version listed on
ubuntu-drivers devices, but this time no luck.
Really at a loss here. I do not know how to fix nor why this happens. Anyone know of a solution to this?
This drivers in question are 418-server, 450, 450-server, 455, and 460.