RHEL 8.4 is out, and display is blank

Hi,

I just upgraded to RHEL 8.4. Is the driver in need of update, and when such would be available?

Symptoms are that there is nothing in display.

Kernel: kernel-core-4.18.0-305.el8.x86_64

Drivers:

nvidia-libXNVCtrl-devel-450.119.04-1.el8.x86_64
nvidia-modprobe-450.119.04-1.el8.x86_64
nvidia-driver-450.119.04-1.el8.x86_64
nvidia-libXNVCtrl-450.119.04-1.el8.x86_64
nvidia-driver-libs-450.119.04-1.el8.x86_64
nvidia-driver-devel-450.119.04-1.el8.x86_64
nvidia-persistenced-450.119.04-1.el8.x86_64
nvidia-driver-cuda-libs-450.119.04-1.el8.x86_64
nvidia-driver-cuda-450.119.04-1.el8.x86_64
nvidia-kmod-common-450.119.04-1.el8.noarch
nvidia-xconfig-450.119.04-1.el8.x86_64
nvidia-driver-NVML-450.119.04-1.el8.x86_64
nvidia-settings-450.119.04-1.el8.x86_64
nvidia-driver-NvFBCOpenGL-450.119.04-1.el8.x86_64

GPU:
01:00.0 VGA compatible controller: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] (rev a1)

OK, now I have time to come back to this. So there is no nvidia-driver at all in lsmod. And I don’t know what happened with the screens yesterday, they were blank, but not the other one is alive and has basic framebuffer display.

When I look around, I see that no nvidia-driver is loaded.

There is no module for the new kernel:

╰─➤  find /lib/modules -iname '*nvidia*' 
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-drm.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-modeset.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-uvm.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia.ko.xz
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-uvm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-drm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-drm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-uvm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia.ko

So I went ahead and reinstalled the driver to see if it triggers some driver module install

╰─➤  sudo dnf remove nvidia-driver
╰─➤  sudo dnf module install nvidia-driver:450

And sure, it did:

╰─➤  find /lib/modules -iname '*nvidia*'  
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-drm.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-modeset.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia-uvm.ko.xz
/lib/modules/4.18.0-240.15.1.el8_3.x86_64/weak-updates/nvidia.ko.xz
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-uvm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/extra/drivers/video/nvidia/nvidia-drm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-drm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia-uvm.ko
/lib/modules/4.18.0-240.22.1.el8_3.x86_64/weak-updates/drivers/video/nvidia/nvidia.ko
/lib/modules/4.18.0-305.el8.x86_64/extra/drivers/video/nvidia
/lib/modules/4.18.0-305.el8.x86_64/extra/drivers/video/nvidia/nvidia.ko
/lib/modules/4.18.0-305.el8.x86_64/extra/drivers/video/nvidia/nvidia-uvm.ko
/lib/modules/4.18.0-305.el8.x86_64/extra/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-305.el8.x86_64/extra/drivers/video/nvidia/nvidia-drm.ko

And both of the displays work fine!

So perhaps the nvidia-driver rpm is missing some kernel update hook, which doesn’t automatically now re-install the modules if kernel updates?

For those running into this and looking into Red Hat bugzilla, I made an issue there too: Bug Access Denied

You need to rebuild the kmod for your rhel 8.4 kernel. If you don’t know how to do, please reboot to the older kernel from 8.3.

The other way is to use akmod-nvidia from nvidia (that means removing the nvidia-driver provided by nvidia as a module and install akmod-nvidia from rpmfusion-nonfree).

This may work (untested)
#See Configuration - RPM Fusion
dnf shell

module disable nvidia-driver
install akmod-nvidia
run