RHEL9/NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver

Hi, my server was rebooted (seemingly as a mistake/hard reboot). The configuration is:

lspci | grep -i nvidia
0d:00.0 3D controller: NVIDIA Corporation GA102GL [A40] (rev a1)
b5:00.0 3D controller: NVIDIA Corporation GA102GL [A40] (rev a1)```

I’m not in secure mode:

mokutil --sb-state
SecureBoot disabled

I have the following nvidia drivers installed:

rpm -qa | grep -i Nvidia
dnf-plugin-nvidia-2.0-1.el9.noarch
kmod-nvidia-latest-dkms-545.23.08-1.el9.x86_64
nvidia-driver-545.23.08-1.el9.x86_64
nvidia-driver-cuda-545.23.08-1.el9.x86_64
nvidia-driver-cuda-libs-545.23.08-1.el9.x86_64
nvidia-driver-devel-545.23.08-1.el9.x86_64
nvidia-driver-libs-545.23.08-1.el9.x86_64
nvidia-driver-NvFBCOpenGL-545.23.08-1.el9.x86_64
nvidia-driver-NVML-545.23.08-1.el9.x86_64
nvidia-kmod-common-545.23.08-1.el9.noarch
nvidia-libXNVCtrl-545.23.08-1.el9.x86_64
nvidia-libXNVCtrl-devel-545.23.08-1.el9.x86_64
nvidia-modprobe-545.23.08-1.el9.x86_64
nvidia-persistenced-545.23.08-1.el9.x86_64
nvidia-settings-545.23.08-1.el9.x86_64
nvidia-xconfig-545.23.08-1.el9.x86_64
pcp-pmda-nvidia-gpu-6.0.5-4.el9.x86_64

Is there anything obviously askew? Do I need to tell rhel to load my GPUs after reboot? I attach the
nvidia-bug-report.log (2.3 MB)
I would appreciate any advice! Thanks in advance.

You booted an old kernel, 5.14.0-362.13.1 but the driver is installed for the current kernel, 5.14.0-362.24.1.
Please select the correct kernel in grub menu.

1 Like

Thanks! That was it…

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.