Hello.
I work with a machine that has run CUDA 9.0 (and previous version of CUDA down to 7.0) comfortably without error, the specs are posted below. I recently attempted to install CUDA 10.0, but hit some installation errors. After reading other support topics that suggested purging my system of NVIDIA-packages and doing a fresh re-install via this documentation (Installation Guide Linux :: CUDA Toolkit Documentation) and rebooting, I am hitting the following error when attempting to use nvidia-smi:
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
I attempted to start the driver using
sudo modprobe nvidia
but received the following error message:
modprobe: ERROR: could not insert 'nvidia_418': Package not installed
This is confusing to me, as that driver should have just been installed with my fresh CUDA 10.0 installation, which was managed via the RPM installer. Using:
dpkg -l | grep nvidia
I get:
ii nvidia-418 418.56-0ubuntu0~gpu14.04.1 amd64 NVIDIA binary driver - version 418.56
ii nvidia-418-dev 418.56-0ubuntu0~gpu14.04.1 amd64 NVIDIA binary Xorg driver development files
ii nvidia-modprobe 418.40.04-0ubuntu1 amd64 Load the NVIDIA kernel driver and create device files
ii nvidia-opencl-icd-418 418.56-0ubuntu0~gpu14.04.1 amd64 NVIDIA OpenCL ICD
ii nvidia-prime 0.6.2.1 amd64 Tools to enable NVIDIA's Prime
ii nvidia-settings 418.56-0ubuntu0~gpu14.04.1 amd64 Tool for configuring the NVIDIA graphics driver
which shows the driver that I supposedly don’t have installed. Attempting to install this driver via apt-get also states that it is already installed.
The results of
lsmod | grep nvidia
returns nothing in this case, which may be why my installation can’t locate my drivers. But I’m not sure how to install them correctly if that is the case…
Does anyone know what the next step is at this point? I have already tried uninstalling and reinstalling multiple times now, and reboot each time. Any help would be appreciated.
Machine specifications:
Distributor ID: Ubuntu
Description: Ubuntu 14.04.6 LTS
Release: 14.04
Codename: trusty