I see several posts, but mostly on Ubuntu with the nvidia-smi
error. Installed are
cuda-11-5-11.5.1-1.x86_64
cuda-11.5.1-1.x86_64
nvidia-driver-cuda-495.29.05-1.fc34.x86_64
nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
nvidia-settings
Unable to init server: Could not connect: Connection refused
ERROR: The control display is undefined; please run `nvidia-settings --help` for usage information.
lspci -nnk |egrep -A3 -i "3D|VGA"
06:00.0 VGA compatible controller [0300]: Matrox Electronics Systems Ltd. G200eR2 [102b:0534]
DeviceName: Embedded Video
Subsystem: Dell Device [1028:04f8]
Kernel driver in use: mgag200
--
41:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP107GL [Quadro P1000] [10de:1cb1] (rev a1)
Subsystem: NVIDIA Corporation Device [10de:11bc]
Kernel modules: nouveau
41:00.1 Audio device [0403]: NVIDIA Corporation GP107GL High Definition Audio Controller [10de:0fb9] (rev a1)
In the system logs I see:
Dec 1 02:15:57 systemd-udevd[955]: /usr/lib/udev/rules.d/60-nvidia.rules:1 Invalid value "/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidiactl c $(grep nvidia-frontend /proc/devices | cut -d \ -f 1) 255" for RUN (char 61: invalid substitution type), ignoring, but please fix it.
Dec 1 02:15:57 systemd-udevd[955]: /usr/lib/udev/rules.d/60-nvidia.rules:2 Invalid value "/usr/bin/bash -c 'for i in $(cat /proc/driver/nvidia/gpus/*/information | grep Minor | cut -d \ -f 4); do /usr/bin/mknod -Z -m 666 /dev/nvidia${i} c $(grep nvidia-frontend /proc/devices | cut -d \ -f 1) ${i}; done" for RUN (char 28: invalid substitution type), ignoring, but please fix it.
Dec 1 02:15:57 systemd-udevd[955]: /usr/lib/udev/rules.d/60-nvidia.rules:3 Invalid value "/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidia-modeset c $(grep nvidia-frontend /proc/devices | cut -d \ -f 1) 254" for RUN (char 66: invalid substitution type), ignoring, but please fix it.
Dec 1 02:15:57 systemd-udevd[955]: /usr/lib/udev/rules.d/60-nvidia.rules:4 Invalid value "/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidia-uvm c $(grep nvidia-uvm /proc/devices | cut -d \ -f 1) 0'" for RUN (char 62: invalid substitution type), ignoring, but please fix it.
Dec 1 02:15:57 systemd-udevd[955]: /usr/lib/udev/rules.d/60-nvidia.rules:5 Invalid value "/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidia-uvm-tools c $(grep nvidia-uvm /proc/devices | cut -d \ -f 1) 1'" for RUN (char 68: invalid substitution type), ignoring, but please fix it.
But I don’t know if that’s just a red herring. I’d prefer not to have to reboot. Is there an easy fix for this? I do have GRUB_CMDLINE_LINUX="rd.driver.blacklist=nouveau audit=0"
in /etc/default/grub
When I try running ./NVIDIA-Linux-x86_64-495.44
.run I get:
ERROR: You appear to be running an X server; please exit X before installing. For further details, please see the section INSTALLING THE NVIDIA DRIVER in the README available on the Linux
driver download page at www.nvidia.com.
Edit:
Well I found my answer.
ps -aux|grep X
root 912505 0.1 0.0 2934868 5172 tty1 Ssl+ Nov22 18:32 /usr/libexec/Xorg -background none :0 vt01 -nolisten tcp -novtswitch -auth /var/run/lxdm/lxdm-:0.auth
So I stopped/disabled lxdm
via systemctl
and the installer completed.
nvidia-smi
Wed Dec 1 16:23:18 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 495.44 Driver Version: 495.44 CUDA Version: 11.5 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro P1000 Off | 00000000:41:00.0 Off | N/A |
| 29% 44C P0 N/A / N/A | 0MiB / 4040MiB | 1% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+