Hi
I wanted to install Nvidia drivers on a Rocky machine. Coming from Debian, I went to the Nvidia page and followed their tutorial. NVIDIA CUDA Installation Guide for Linux
After following it to the letter, I couldn’t run nvidia-smi Then I looked in their forum, read the Nvidia topics here, but I can’t find a solution.
How can I go back to “the rocky way of doing things”?
Currently I have this:
$ nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
# ls -l /dev/nv*
crw-------. 1 root root 10, 144 Jun 23 02:06 /dev/nvram
# lsmod | grep -i nv
# rpm -qa | grep -i nv
nvidia-settings-530.30.02-1.el8.x86_64
libnvjpeg-12-1-12.2.0.2-1.x86_64
cuda-nvvp-12-1-12.1.105-1.x86_64
cuda-nvprof-12-1-12.1.105-1.x86_64
nvidia-driver-cuda-libs-530.30.02-1.el8.x86_64
libnvjpeg-devel-12-1-12.2.0.2-1.x86_64
cuda-nvprune-12-1-12.1.105-1.x86_64
nvidia-gds-12-1-12.1.1-1.x86_64
nvidia-driver-cuda-530.30.02-1.el8.x86_64
cuda-nvdisasm-12-1-12.1.105-1.x86_64
nvme-cli-1.16-7.el8.x86_64
nvidia-persistenced-530.30.02-1.el8.x86_64
cuda-nvml-devel-12-1-12.1.105-1.x86_64
libblockdev-nvdimm-2.28-2.el8.x86_64
nvidia-libXNVCtrl-530.30.02-1.el8.x86_64
kmod-nvidia-latest-dkms-530.30.02-1.el8.x86_64
nvidia-modprobe-530.30.02-1.el8.x86_64
libnvjitlink-12-1-12.1.105-1.x86_64
nvidia-detect-525.85.05-1.el8.x86_64
glibc-gconv-extra-2.28-225.el8.x86_64
environment-modules-4.5.2-3.el8.x86_64
nvidia-libXNVCtrl-devel-530.30.02-1.el8.x86_64
nvidia-driver-libs-530.30.02-1.el8.x86_64
nvidia-driver-devel-530.30.02-1.el8.x86_64
libnvvm-samples-12-1-12.1.105-1.x86_64
nvidia-driver-530.30.02-1.el8.x86_64
nvidia-xconfig-530.30.02-1.el8.x86_64
cuda-nvrtc-12-1-12.1.105-1.x86_64
cuda-nvrtc-devel-12-1-12.1.105-1.x86_64
cuda-nvtx-12-1-12.1.105-1.x86_64
nvidia-fs-2.15.3-1.x86_64
nvidia-driver-NVML-530.30.02-1.el8.x86_64
nvidia-kmod-common-530.30.02-1.el8.noarch
nvidia-fs-dkms-2.15.3-1.x86_64
nvidia-driver-NvFBCOpenGL-530.30.02-1.el8.x86_64
libnvjitlink-devel-12-1-12.1.105-1.x86_64
cuda-nvcc-12-1-12.1.105-1.x86_64
nvidia-gds-12.1.1-1.x86_64
dnf-plugin-nvidia-2.0-1.el8.noarch
# lspci | grep NV
86:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1)
87:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1)
$ nvidia-detect
kmod-nvidia-470xx
WARNING: Xorg log file /var/log/Xorg.0.log does not exist
WARNING: Unable to determine Xorg ABI compatibility
WARNING: The driver for this device does not support the current Xorg version
I tried this as a last resort
$ sudo dnf install $(nvidia-detect)
WARNING: Xorg log file /var/log/Xorg.0.log does not exist
WARNING: Unable to determine Xorg ABI compatibility
WARNING: The driver for this device does not support the current Xorg version
Last metadata expiration check: 0:16:37 ago on Fri 23 Jun 2023 03:00:40 AM EDT.
Error:
Problem: package kmod-nvidia-470xx-470.182.03-2.el8_8.elrepo.x86_64 requires nvidia-x11-drv-470xx = 470.182.03, but none of the providers can be installed
- cannot install the best candidate for the job
- package nvidia-x11-drv-470xx-470.182.03-1.el8_7.elrepo.x86_64 is filtered out by modular filtering
- package nvidia-x11-drv-470xx-470.182.03-2.el8_8.elrepo.x86_64 is filtered out by modular filtering
(try to add '--skip-broken' to skip uninstallable packages or '--nobest' to use not only best candidate packages)
and what I’d like to have is the 530-dkms (correct me if I’m wrong there please).
$ sudo dnf module list nvidia-driver
Last metadata expiration check: 0:11:20 ago on Fri 23 Jun 2023 03:00:40 AM EDT.
cuda-rhel8-x86_64
Name Stream Profiles Summary
nvidia-driver latest default [d], fm, ks, src Nvidia driver for latest branch
nvidia-driver latest-dkms [d][e] default [d] [i], fm, ks Nvidia driver for latest-dkms branch
nvidia-driver open-dkms default [d], fm, ks, src Nvidia driver for open-dkms branch
nvidia-driver 418 default [d], fm, ks, src Nvidia driver for 418 branch
nvidia-driver 418-dkms default [d], fm, ks Nvidia driver for 418-dkms branch
nvidia-driver 440 default [d], fm, ks, src Nvidia driver for 440 branch
nvidia-driver 440-dkms default [d], fm, ks Nvidia driver for 440-dkms branch
nvidia-driver 450 default [d], fm, ks, src Nvidia driver for 450 branch
nvidia-driver 450-dkms default [d], fm, ks Nvidia driver for 450-dkms branch
nvidia-driver 455 default [d], fm, ks, src Nvidia driver for 455 branch
nvidia-driver 455-dkms default [d], fm, ks Nvidia driver for 455-dkms branch
nvidia-driver 460 default [d], fm, ks, src Nvidia driver for 460 branch
nvidia-driver 460-dkms default [d], fm, ks Nvidia driver for 460-dkms branch
nvidia-driver 465 default [d], fm, ks, src Nvidia driver for 465 branch
nvidia-driver 465-dkms default [d], fm, ks Nvidia driver for 465-dkms branch
nvidia-driver 470 default [d], fm, ks, src Nvidia driver for 470 branch
nvidia-driver 470-dkms default [d], fm, ks Nvidia driver for 470-dkms branch
nvidia-driver 495 default [d], fm, ks, src Nvidia driver for 495 branch
nvidia-driver 495-dkms default [d], fm, ks Nvidia driver for 495-dkms branch
nvidia-driver 510 default [d], fm, ks, src Nvidia driver for 510 branch
nvidia-driver 510-dkms default [d], fm, ks Nvidia driver for 510-dkms branch
nvidia-driver 515 default [d], fm, ks, src Nvidia driver for 515 branch
nvidia-driver 515-dkms default [d], fm, ks Nvidia driver for 515-dkms branch
nvidia-driver 515-open default [d], fm, ks, src Nvidia driver for 515-open branch
nvidia-driver 520 default [d], fm, ks, src Nvidia driver for 520 branch
nvidia-driver 520-dkms default [d], fm, ks Nvidia driver for 520-dkms branch
nvidia-driver 520-open default [d], fm, ks, src Nvidia driver for 520-open branch
nvidia-driver 525 default [d], fm, ks, src Nvidia driver for 525 branch
nvidia-driver 525-dkms default [d], fm, ks Nvidia driver for 525-dkms branch
nvidia-driver 525-open default [d], fm, ks, src Nvidia driver for 525-open branch
nvidia-driver 530 default [d], fm, ks, src Nvidia driver for 530 branch
nvidia-driver 530-dkms default [d], fm, ks Nvidia driver for 530-dkms branch
nvidia-driver 530-open default [d], fm, ks, src Nvidia driver for 530-open branch
Hint: [d]efault, [e]nabled, [x]disabled, [i]nstalled
How can I install the driver correctly?