[Solved] No devices were found for RTX 5080 on Ubuntu 2024 (once worked)

I have a RTX 5080 on Ubuntu 2024 and I cannot make the cuda driver work again.

It once worked but did not any more after a reboot. I used the change to run apt-get upgrade, removed nvidia (12.8) and installed the current one (12.9).

> nvidia-smi
No devices were found
> lsmod | grep nvi
nvidia_uvm           2048000  0
nvidia_drm            131072  0
nvidia_modeset       1548288  1 nvidia_drm
nvidia              90316800  4 nvidia_uvm,nvidia_modeset
drm_ttm_helper         16384  1 nvidia_drm
video                  77824  2 asus_wmi,nvidia_modeset

nouveau is black listed and not loaded.

I’m only remotely on the machine, but AFAIK I have secure boot disabled, which is what also nivdia-but-report.sh says

> grep Secure nvidia-bug-report.log
[ 0.000000] secureboot: **Secure** boot disabled
[ 0.003847] secureboot: **Secure** boot disabled
[ 0.564931] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing: 61482aa2830d0ab2ad5af10b7250da9033ddcef0'
[ 0.564939] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (2017): 242ade75ac4a15e50d50c84b0d45ff3eae707a03'
[ 0.564945] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (ESM 2018): 365188c1d374d6b07c3c8f240f8ef722433d6a8b'
[ 0.564952] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (2019): c0746fd6c5da3ae827864651ad66ae47fe24b3e8'
[ 0.564959] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (2021 v1): a8d54bbb3825cfb94fa13c9f8a594a195c107b8d'
[ 0.564966] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (2021 v2): 4cf046892d6fd3c9a5b03f98d845f90851dc6a8c'
[ 0.564972] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (2021 v3): 100437bb6de6e469b581e61cd66bce3ef4ed53af'
[ 0.564979] Loaded X.509 cert 'Canonical Ltd. **Secure** Boot Signing (Ubuntu Core 2019): c1d57b8f6b743f23ee41f4f7ee292f06eecadfb9'

I tried both, nvidia-driver-570 and 575

> ubuntu-drivers devices
udevadm hwdb is deprecated. Use systemd-hwdb instead.
udevadm hwdb is deprecated. Use systemd-hwdb instead.
udevadm hwdb is deprecated. Use systemd-hwdb instead.
udevadm hwdb is deprecated. Use systemd-hwdb instead.
udevadm hwdb is deprecated. Use systemd-hwdb instead.
udevadm hwdb is deprecated. Use systemd-hwdb instead.
== /sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0 ==
modalias : pci:v000010DEd00002C02sv00001043sd000089D7bc03sc00i00
vendor   : NVIDIA Corporation
driver   : nvidia-driver-570-open - third-party non-free
driver   : nvidia-driver-575-open - third-party non-free
driver   : nvidia-driver-570 - third-party non-free
driver   : nvidia-driver-575 - third-party non-free recommended
driver   : nvidia-driver-570-server-open - distro non-free
driver   : nvidia-driver-570-server - distro non-free
driver   : xserver-xorg-video-nouveau - distro free builtin

With both drivers I have the issue

> sudo dmesg | grep nvidia
[ 2.284334] **nvidia**: loading out-of-tree module taints kernel.
[ 2.284340] **nvidia**: module license 'NVIDIA' taints kernel.
[ 2.284342] **nvidia**: module verification failed: signature and/or required key missing - tainting kernel
[ 2.284342] **nvidia**: module license taints kernel.
[ 2.348674] **nvidia**-nvlink: Nvlink Core is being initialized, major device number 235
[ 2.349695] **nvidia** 0000:01:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=io+mem
[ 2.361599] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_dma_unmap_pages from proprietary module **nvidia**.
[ 2.361601] **nvidia**_fs: Unknown symbol **nvidia**_p2p_dma_unmap_pages (err -2)
[ 2.361607] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_put_pages_persistent from proprietary module **nvidia**.
[ 2.361607] **nvidia**_fs: Unknown symbol **nvidia**_p2p_put_pages_persistent (err -2)
[ 2.361615] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_get_pages from proprietary module **nvidia**.
[ 2.361615] **nvidia**_fs: Unknown symbol **nvidia**_p2p_get_pages (err -2)
[ 2.361621] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_put_pages from proprietary module **nvidia**.
[ 2.361621] **nvidia**_fs: Unknown symbol **nvidia**_p2p_put_pages (err -2)
[ 2.361625] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_dma_map_pages from proprietary module **nvidia**.
[ 2.361625] **nvidia**_fs: Unknown symbol **nvidia**_p2p_dma_map_pages (err -2)
[ 2.361632] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_free_dma_mapping from proprietary module **nvidia**.
[ 2.361632] **nvidia**_fs: Unknown symbol **nvidia**_p2p_free_dma_mapping (err -2)
[ 2.361633] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_free_page_table from proprietary module **nvidia**.
[ 2.361633] **nvidia**_fs: Unknown symbol **nvidia**_p2p_free_page_table (err -2)
[ 2.361637] **nvidia**_fs: module using GPL-only symbols uses symbols **nvidia**_p2p_get_pages_persistent from proprietary module **nvidia**.
[ 2.361638] **nvidia**_fs: Unknown symbol **nvidia**_p2p_get_pages_persistent (err -2)
[ 2.388921] **nvidia**-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 570.158.01 Mon Jun 2 13:01:45 UTC 2025
[ 2.391761] [drm] [**nvidia**-drm] [GPU ID 0x00000100] Loading driver
[ 2.488819] [drm:nv_drm_load [**nvidia**_drm]] *ERROR* [**nvidia**-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
[ 2.488914] [drm:nv_drm_register_drm_device [**nvidia**_drm]] *ERROR* [**nvidia**-drm] [GPU ID 0x00000100] Failed to register device
[ 18.420023] **nvidia**_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module **nvidia**, inheriting taint.

The other errors, I found are

> grep ERROR nvidia-bug-report.log
2025-06-27T13:21:00.272511+02:00 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
2025-06-27T13:21:00.272511+02:00 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
2025-06-27T13:44:59.185374+02:00 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
2025-06-27T13:44:59.185374+02:00 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
2025-06-27T14:07:26.737065+02:00 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
2025-06-27T14:07:26.737065+02:00 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
2025-06-27T14:16:11.134795+02:00 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
2025-06-27T14:16:11.134796+02:00 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
[ 2.488819] kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
[ 2.488914] kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
Jun 27 14:16:10 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
Jun 27 14:16:10 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
Jun 27 14:07:26 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
Jun 27 14:07:26 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
Jun 27 13:44:58 shady kernel: [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
Jun 27 13:44:58 shady kernel: [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
[ 2.488819] [drm:nv_drm_load [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to allocate NvKmsKapiDevice
[ 2.488914] [drm:nv_drm_register_drm_device [nvidia_drm]] ***ERROR*** [nvidia-drm] [GPU ID 0x00000100] Failed to register device
**ERROR**: A supplied argument is invalid

Thanks a lot for help! I have no further idea on what to do …

nvidia-bug-report.log (896.6 KB)

As the log indicates when reading it, I had to install the open driver, with them nvidia-smi works.