In order to use tensorflow >=1.5.0, I tried to upgrade my Ubuntu 16.04 server with 2 GTX 1070 GPUs from Cuda 8 to Cuda 9.
On my first attempt, I used the local .deb installer for 9.1 but after installation, when I tried nvidia-smi it complained:
“NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.”.
I tried to download and reinstall the driver manually at this stage, but the .run file aborted saying the preinstall failed.
Today, I returned to the problem - this time installing 9.2 (using network installer) after explicitly removing the old cuda 8 install (which I realised I’d forgotten to do first time) and of course the attempted 9.1 install. However the same problem occurs - nvidia-smi reports the same problem.
Note that ‘lspci | grep -i nvidia’ confirms the GPUs are there. Also I get the following error after ‘sudo /sbin/modprobe nvidia’:
“modprobe: ERROR: could not insert ‘nvidia_396’: Exec format error”
I haven’t tried reinstalling the driver - is that what I need to do next?