Ubuntu 24.04 Nvidia Driver unstable after reboot: NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver

OS: Ubuntu 24.04
kernel: 6.8.0-39-generic
Driver: nvidia-driver-535
Machine: Dell Precision 7670 with an A4500 GPU in it.

I upgraded to Ubuntu 24.04 from 22.04 and having some trouble getting the nvidia-driver-535 working.

I went through the normal installation steps and it appeared to be working, as running nvidia-smi showed the expected GUI. However, after another reboot, it now says “NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.”

These are the steps I took:

sudo apt-get remove --purge 'libnvidia-.*'
sudo apt-get remove --purge '^nvidia-.*'
sudo apt-get remove --purge '^libnvidia-.*'
sudo apt-get remove --purge '^cuda-.*'
sudo apt clean
sudo apt autoremove

Then I ran:

sudo add-apt-repository ppa:graphics-drivers/ppa --yes
sudo apt update
update-pciids
sudo apt install nvidia-driver-535
dkms status
sudo apt reinstall linux-headers-$(uname -r)
sudo update-initramfs -u
sudo reboot

dkms status at this point shows

nvidia/535.183.01, 6.8.0-39-generic, x86_64: installed

It seemed to be working, but after restarting the computer I’m getting an error “NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.”
nvidia-bug-report.log (2.5 MB)

I am using to use CUDA on 24.10

I had to remove the installed nvidia drivers and install it using the .run file from CUDA Toolkit 12.6 Update 3 Downloads | NVIDIA Developer

and reboot