Failed to start NVIDIA Persistence Daemon and nvidia-smi cannot communicate with driver on Ubuntu 22.04 (RTX 5000 Ada)

Hello,

I am experiencing an issue with my NVIDIA RTX 5000 Ada GPU on Ubuntu 22.04. When the system starts, I see the following error:

failed to start nvidia persistence daemon

Additionally, when I run nvidia-smi, I get:

nvidia-smi has failed because it couldn't communicate with the NVIDIA driver

Here are the details of my environment:

  • GPU: NVIDIA RTX 5000 Ada

  • OS: Ubuntu 22.04

  • Drivers tested: 535 and 570 (both produce the same issue)

  • Secure Boot: Disabled in BIOS

I have tried reinstalling the drivers multiple times, purging old versions, and rebooting, but the problem persists. lsmod | grep nvidia shows nothing, and modprobe nvidia returns no such device. Secure Boot is confirmed off.

Could you please advise what might be causing this issue and how to resolve it?

Thank you.

1 Like

Same here for a different configuration.

OS: Ubuntu 24.04

GPU: NVIDIA GeForce RTX 4070 Laptop GPU

Tested Nvidia drivers: 575-open and 580-open (recommanded by ubuntu-drivers)

Secure boot: disabled

After purging every packages linked to nvidia, a new nvidia driver installation works the first boot, then the next boots fail.

The nouveau driver works (not so well, but at least booting does not fail).

I also tested blacklisting nouveau for systemd services, but booting still fails.

Working first boot logs for Nvidia:

$ cat 2026-01-09_15_04_boot_logs.log | grep nvidia
janv. 09 15:01:55 deputy-p60 kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
janv. 09 15:01:55 deputy-p60 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 509
janv. 09 15:01:55 deputy-p60 kernel: nvidia 0000:01:00.0: enabling device (0000 -> 0003)
janv. 09 15:01:55 deputy-p60 kernel: nvidia 0000:01:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
janv. 09 15:01:56 deputy-p60 kernel: nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64  580.95.05  Release Build  (dvs-builder@U22-I3-B17-02-5)  Tue Sep 23 09:42:01 UTC 2025
janv. 09 15:01:56 deputy-p60 kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
janv. 09 15:01:56 deputy-p60 kernel: Modules linked in: snd_soc_hda_codec mac80211(+) snd_hda_ext_core kvm snd_soc_core snd_hda_codec_hdmi nvidia_drm(OE+) libarc4 irqbypass snd_compress polyval_clmulni ac97_bus btusb polyval_generic snd_pcm_dmaengine ghash_clmulni_intel btrtl sha256_ssse3 processor_thermal_device_pci cmdlinepart uvcvideo sha1_ssse3 processor_thermal_device btintel snd_hda_intel aesni_intel processor_thermal_wt_hint spi_nor processor_thermal_rfim mei_pxp mei_hdcp btbcm videobuf2_vmalloc snd_intel_dspcfg nvidia_modeset(OE) iwlwifi crypto_simd uvc mtd ee1004 btmtk snd_intel_sdw_acpi processor_thermal_rapl intel_rapl_msr i915(+) nvidia(OE) videobuf2_memops snd_hda_codec mei_me cryptd intel_rapl_common i2c_i801 spi_intel_pci videobuf2_v4l2 rapl i2c_smbus drm_buddy processor_thermal_wt_req snd_hda_core cfg80211 bluetooth videobuf2_common intel_cstate spi_intel mxm_wmi mei i2c_mux drm_ttm_helper processor_thermal_power_floor snd_hwdep ttm processor_thermal_mbox drm_display_helper cec intel_pmc_core int3403_thermal
janv. 09 15:01:57 deputy-p60 systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: Verbose syslog connection opened
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: Now running with user ID 122 and group ID 124
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: Started (1753)
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 122 has read and write permissions for those files.
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: PID file unlocked.
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1752]: nvidia-persistenced failed to initialize. Check syslog for more details.
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: PID file closed.
janv. 09 15:01:57 deputy-p60 systemd[1]: nvidia-persistenced.service: Control process exited, code=exited, status=1/FAILURE
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: The daemon no longer has permission to remove its runtime data directory /var/run/nvidia-persistenced
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1753]: Shutdown (1753)
janv. 09 15:01:57 deputy-p60 systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
janv. 09 15:01:57 deputy-p60 systemd[1]: Failed to start nvidia-persistenced.service - NVIDIA Persistence Daemon.
janv. 09 15:01:57 deputy-p60 kernel: [drm] Initialized nvidia-drm 0.0.0 for 0000:01:00.0 on minor 2
janv. 09 15:01:57 deputy-p60 kernel: nvidia 0000:01:00.0: [drm] Cannot find any crtc or sizes
janv. 09 15:01:57 deputy-p60 systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Verbose syslog connection opened
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Now running with user ID 122 and group ID 124
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Started (1870)
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: device 0000:01:00.0 - registered
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Local RPC services initialized
janv. 09 15:01:57 deputy-p60 systemd[1]: Started nvidia-persistenced.service - NVIDIA Persistence Daemon.
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (**) OutputClass "nvidia" ModulePath extended to "/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules"
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Applying OutputClass "nvidia" to /dev/dri/card2
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]:         loading driver: nvidia
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (==) Matched nvidia as autoconfigured driver 0
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) LoadModule: "nvidia"
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Loading /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Module nvidia: vendor="NVIDIA Corporation"
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Applying OutputClass "nvidia" options to /dev/dri/card2
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Loading sub module "glxserver_nvidia"
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) LoadModule: "glxserver_nvidia"
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Loading /usr/lib/x86_64-linux-gnu/nvidia/xorg/libglxserver_nvidia.so
janv. 09 15:01:58 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) Module glxserver_nvidia: vendor="NVIDIA Corporation"
janv. 09 15:01:59 deputy-p60 /usr/libexec/gdm-x-session[2070]: (WW) NVIDIA:     '/var/run/nvidia-xdriver-3f32ae9f' Permission denied
janv. 09 15:01:59 deputy-p60 /usr/libexec/gdm-x-session[2070]: (II) NVIDIA(G0): [DRI2]   VDPAU driver: nvidia
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (**) OutputClass "nvidia" ModulePath extended to "/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Applying OutputClass "nvidia" to /dev/dri/card2
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]:         loading driver: nvidia
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (==) Matched nvidia as autoconfigured driver 0
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) LoadModule: "nvidia"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Loading /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Module nvidia: vendor="NVIDIA Corporation"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Applying OutputClass "nvidia" options to /dev/dri/card2
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Loading sub module "glxserver_nvidia"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) LoadModule: "glxserver_nvidia"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Loading /usr/lib/x86_64-linux-gnu/nvidia/xorg/libglxserver_nvidia.so
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) Module glxserver_nvidia: vendor="NVIDIA Corporation"
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (WW) NVIDIA:     '/var/run/nvidia-xdriver-e223af8c' Permission denied
janv. 09 15:02:25 deputy-p60 /usr/libexec/gdm-x-session[2955]: (II) NVIDIA(G0): [DRI2]   VDPAU driver: nvidia
janv. 09 15:02:26 deputy-p60 systemd[2798]: Started app-gnome-nvidia\x2dsettings\x2dautostart-3531.scope - Application launched by gnome-session-binary.
janv. 09 15:03:55 deputy-p60 tracker-miner-f[3789]: Could not execute sparql: Unable to insert multiple values on single valued property `nfo:belongsToContainer' for resource file:///media/profchep/SHARED/Desktop/2026-01-09_14_59_systemctl_status_nvidia-persistenced_service.log (old_value: '622', new value: '1068')
janv. 09 15:03:59 deputy-p60 tracker-miner-f[3789]: Could not execute sparql: Unable to insert multiple values on single valued property `nfo:belongsToContainer' for resource file:///media/profchep/SHARED/Desktop/2026-01-09_15_02_systemctl_status_nvidia-persistenced_service.log (old_value: '622', new value: '1068')

systemctl status nvidia-persistencedfor working boot:

â—Ź nvidia-persistenced.service - NVIDIA Persistence Daemon
     Loaded: loaded (/usr/lib/systemd/system/nvidia-persistenced.service; static)
     Active: active (running) since Fri 2026-01-09 15:01:57 PST; 1min 0s ago
    Process: 1855 ExecStart=/usr/bin/nvidia-persistenced --user nvidia-persistenced --no-persistence-mode --verbose (code=exited, status=0/SUCCESS)
   Main PID: 1870 (nvidia-persiste)
      Tasks: 1 (limit: 37970)
     Memory: 292.0K (peak: 1.0M)
        CPU: 1ms
     CGroup: /system.slice/nvidia-persistenced.service
             └─1870 /usr/bin/nvidia-persistenced --user nvidia-persistenced --no-persistence-mode --verbose

janv. 09 15:01:57 deputy-p60 systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Verbose syslog connection opened
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Now running with user ID 122 and group ID 124
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Started (1870)
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: device 0000:01:00.0 - registered
janv. 09 15:01:57 deputy-p60 nvidia-persistenced[1870]: Local RPC services initialized
janv. 09 15:01:57 deputy-p60 systemd[1]: Started nvidia-persistenced.service - NVIDIA Persistence Daemon.

Failing second boot logs for Nvidia:

$ cat fail_boot_logs.log | grep nvidia
janv. 09 20:13:40 deputy-p60 kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
janv. 09 20:13:40 deputy-p60 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 509
janv. 09 20:13:40 deputy-p60 kernel: nvidia 0000:01:00.0: enabling device (0000 -> 0003)
janv. 09 20:13:40 deputy-p60 kernel: nvidia 0000:01:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
janv. 09 20:13:40 deputy-p60 kernel: nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64  580.95.05  Release Build  (dvs-builder@U22-I3-B17-02-5)  Tue Sep 23 09:42:01 UTC 2025
janv. 09 20:13:40 deputy-p60 kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
janv. 09 20:13:42 deputy-p60 systemd[1]: Starting nvidia-persistenced.service - NVIDIA Persistence Daemon...
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Verbose syslog connection opened
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Now running with user ID 122 and group ID 124
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Started (1244)
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 122 has read and write permissions for those files.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: PID file unlocked.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1243]: nvidia-persistenced failed to initialize. Check syslog for more details.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: PID file closed.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: The daemon no longer has permission to remove its runtime data directory /var/run/nvidia-persistenced
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Shutdown (1244)
janv. 09 20:13:42 deputy-p60 systemd[1]: nvidia-persistenced.service: Control process exited, code=exited, status=1/FAILURE
janv. 09 20:13:42 deputy-p60 systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
janv. 09 20:13:42 deputy-p60 systemd[1]: Failed to start nvidia-persistenced.service - NVIDIA Persistence Daemon.
janv. 09 20:14:39 deputy-p60 (udev-worker)[558]: nvidia: Spawned process '/sbin/modprobe nvidia-drm' [920] is taking longer than 59s to complete
janv. 09 20:14:39 deputy-p60 systemd-udevd[513]: nvidia: Worker [558] processing SEQNUM=5298 is taking a long time
janv. 09 20:16:39 deputy-p60 (udev-worker)[558]: nvidia: Spawned process '/sbin/modprobe nvidia-drm' [920] timed out after 2min 59s, killing
janv. 09 20:16:39 deputy-p60 systemd-udevd[513]: nvidia: Worker [558] processing SEQNUM=5298 killed
janv. 09 20:16:39 deputy-p60 systemd-udevd[513]: nvidia: Worker [558] terminated by signal 9 (KILL).
janv. 09 20:17:42 deputy-p60 kernel:  os_acquire_rwlock_write+0x31/0x50 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  portSyncRwLockAcquireWrite+0x10/0x40 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  rmapiLockAcquire+0x294/0x360 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  kgspInitRm_IMPL+0xcad/0x1680 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  RmInitAdapter+0xff2/0x1e40 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  rm_init_adapter+0xad/0xc0 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  nv_open_device+0x222/0xa80 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  nvidia_dev_get+0x68/0xd0 [nvidia]
janv. 09 20:17:42 deputy-p60 kernel:  nvkms_open_gpu+0x4f/0x90 [nvidia_modeset]
janv. 09 20:17:42 deputy-p60 kernel:  AllocateDevice+0x48/0x760 [nvidia_modeset]
janv. 09 20:17:42 deputy-p60 kernel:  ? nv_drm_dev_load+0x112/0xd10 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  ? __pfx_nv_drm_event_callback+0x10/0x10 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  nv_drm_register_drm_device+0xb0/0x230 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  ? __pfx_nv_drm_register_drm_device+0x10/0x10 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  EnumerateGpus+0x150/0x160 [nvidia_modeset]
janv. 09 20:17:42 deputy-p60 kernel:  ? nv_drm_probe_devices+0x66/0xa0 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  ? __pfx_nv_linux_drm_init+0x10/0x10 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  ? nv_drm_init+0x31/0x70 [nvidia_drm]
janv. 09 20:17:42 deputy-p60 kernel:  ? nv_linux_drm_init+0xe/0xff0 [nvidia_drm]

systemctl status nvidia-persistencedfor failing boot:

Ă— nvidia-persistenced.service - NVIDIA Persistence Daemon
     Loaded: loaded (/usr/lib/systemd/system/nvidia-persistenced.service; static)
     Active: failed (Result: exit-code) since Fri 2026-01-09 20:13:42 PST; 4min 59s ago
    Process: 1243 ExecStart=/usr/bin/nvidia-persistenced --user nvidia-persistenced --no-persistence-mode --verbose (code=exited, status=1/FAILURE)
    Process: 1246 ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced (code=exited, status=0/SUCCESS)
        CPU: 4ms

janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Started (1244)
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Failed to query NVIDIA devices. Please ensure that the NVIDIA device files (/dev/nvidia*) exist, and that user 122 has read and write permissions for those files.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: PID file unlocked.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1243]: nvidia-persistenced failed to initialize. Check syslog for more details.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: PID file closed.
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: The daemon no longer has permission to remove its runtime data directory /var/run/nvidia-persistenced
janv. 09 20:13:42 deputy-p60 nvidia-persistenced[1244]: Shutdown (1244)
janv. 09 20:13:42 deputy-p60 systemd[1]: nvidia-persistenced.service: Control process exited, code=exited, status=1/FAILURE
janv. 09 20:13:42 deputy-p60 systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
janv. 09 20:13:42 deputy-p60 systemd[1]: Failed to start nvidia-persistenced.service - NVIDIA Persistence Daemon.

Same dpkg –list | grep ii^.*nvidia for both:

ii  libnvidia-cfg1-580:amd64                         580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA binary OpenGL/GLX configuration library
ii  libnvidia-common-580                             580.95.05-0ubuntu0.24.04.2                           all          Shared files used by the NVIDIA libraries
ii  libnvidia-compute-580:amd64                      580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA libcompute package
ii  libnvidia-compute-580:i386                       580.95.05-0ubuntu0.24.04.2                           i386         NVIDIA libcompute package
ii  libnvidia-decode-580:amd64                       580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA Video Decoding runtime libraries
ii  libnvidia-decode-580:i386                        580.95.05-0ubuntu0.24.04.2                           i386         NVIDIA Video Decoding runtime libraries
ii  libnvidia-egl-wayland1:amd64                     1:1.1.17-0ubuntu0~gpu24.04.1                         amd64        Wayland EGL External Platform library -- shared library
ii  libnvidia-egl-wayland1:i386                      1:1.1.17-0ubuntu0~gpu24.04.1                         i386         Wayland EGL External Platform library -- shared library
ii  libnvidia-encode-580:amd64                       580.95.05-0ubuntu0.24.04.2                           amd64        NVENC Video Encoding runtime library
ii  libnvidia-encode-580:i386                        580.95.05-0ubuntu0.24.04.2                           i386         NVENC Video Encoding runtime library
ii  libnvidia-extra-580:amd64                        580.95.05-0ubuntu0.24.04.2                           amd64        Extra libraries for the NVIDIA driver
ii  libnvidia-fbc1-580:amd64                         580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA OpenGL-based Framebuffer Capture runtime library
ii  libnvidia-fbc1-580:i386                          580.95.05-0ubuntu0.24.04.2                           i386         NVIDIA OpenGL-based Framebuffer Capture runtime library
ii  libnvidia-gl-580:amd64                           580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii  libnvidia-gl-580:i386                            580.95.05-0ubuntu0.24.04.2                           i386         NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii  nvidia-compute-utils-580                         580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA compute utilities
ii  nvidia-dkms-580-open                             580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA DKMS package (open kernel module)
ii  nvidia-driver-580-open                           580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA driver (open kernel) metapackage
ii  nvidia-firmware-580-580.95.05                    580.95.05-0ubuntu0.24.04.2                           amd64        Firmware files used by the kernel module
ii  nvidia-kernel-common-580                         580.95.05-0ubuntu0.24.04.2                           amd64        Shared files used with the kernel module
ii  nvidia-kernel-source-580-open                    580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA kernel source package
ii  nvidia-prime                                     0.8.17.2                                             all          Tools to enable NVIDIA's Prime
ii  nvidia-settings                                  510.47.03-0ubuntu4                                   amd64        Tool for configuring the NVIDIA graphics driver
ii  nvidia-utils-580                                 580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA driver support binaries
ii  screen-resolution-extra                          0.18.3                                               all          Extension for the nvidia-settings control panel
ii  xserver-xorg-video-nvidia-580                    580.95.05-0ubuntu0.24.04.2                           amd64        NVIDIA binary Xorg driver

bug report file? logs excerpts? lspci output? driver packaging method?

Hello, I answered this post with logs because I have the same problem. Unfortunately, the Askimet automatic spam filter hid temporarily my post, it should come back soon.

My previous message is not hidden anymore.

Concerning lspci, NVIDIA card is identified:

00:00.0 Host bridge: Intel Corporation Device 4649 (rev 02)
00:01.0 PCI bridge: Intel Corporation 12th Gen Core Processor PCI Express x16 Controller #1 (rev 02)
00:02.0 VGA compatible controller: Intel Corporation Alder Lake-P GT1 \[UHD Graphics\] (rev 0c)
00:04.0 Signal processing controller: Intel Corporation Alder Lake Innovation Platform Framework Processor Participant (rev 02)
00:06.0 PCI bridge: Intel Corporation 12th Gen Core Processor PCI Express x4 Controller #0 (rev 02)
00:08.0 System peripheral: Intel Corporation 12th Gen Core Processor Gaussian & Neural Accelerator (rev 02)
00:0a.0 Signal processing controller: Intel Corporation Platform Monitoring Technology (rev 01)
00:0d.0 USB controller: Intel Corporation Alder Lake-P Thunderbolt 4 USB Controller (rev 02)
00:14.0 USB controller: Intel Corporation Alder Lake PCH USB 3.2 xHCI Host Controller (rev 01)
00:14.2 RAM memory: Intel Corporation Alder Lake PCH Shared SRAM (rev 01)
00:14.3 Network controller: Intel Corporation Alder Lake-P PCH CNVi WiFi (rev 01)
00:15.0 Serial bus controller: Intel Corporation Alder Lake PCH Serial IO I2C Controller #0 (rev 01)
00:16.0 Communication controller: Intel Corporation Alder Lake PCH HECI Controller (rev 01)
00:1c.0 PCI bridge: Intel Corporation Device 51bc (rev 01)
00:1d.0 PCI bridge: Intel Corporation Alder Lake PCI Express Root Port #9 (rev 01)
00:1d.2 PCI bridge: Intel Corporation Device 51b2 (rev 01)
00:1f.0 ISA bridge: Intel Corporation Alder Lake PCH eSPI Controller (rev 01)
00:1f.3 Audio device: Intel Corporation Alder Lake PCH-P High Definition Audio Controller (rev 01)
00:1f.4 SMBus: Intel Corporation Alder Lake PCH-P SMBus Host Controller (rev 01)
00:1f.5 Serial bus controller: Intel Corporation Alder Lake-P PCH SPI Controller (rev 01)
01:00.0 VGA compatible controller: NVIDIA Corporation AD106M \[GeForce RTX 4070 Max-Q / Mobile\] (rev a1)
01:00.1 Audio device: NVIDIA Corporation Device 22bd (rev a1)
02:00.0 Non-Volatile memory controller: Sandisk Corp WD Black SN770 / PC SN740 256GB / PC SN560 (DRAM-less) NVMe SSD (rev 01)
03:00.0 Non-Volatile memory controller: Micron Technology Inc 2300 NVMe SSD \[Santana\]
04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 15)
05:00.0 SD Host controller: O2 Micro, Inc. SD/MMC Card Reader Controller (rev 01)

I was not able to generate a nvidia report via the script nvidia-bug-report.sh because the process did not terminate and was stuck.

My wild guess is that this is related to some module options in /etc/modprobe.d that are not present there yet when the modules are loaded right after the installation. In particular playing with modeset would be my first action.

Other than that, you are trying drivers from Ubuntu repos, right? Have you tried the ones from Nvidia DC repos?

Hello @morgwai666, thank you for your answers and for your time.

playing with modeset would be my first action

Do you mean disabling the persistenced mode?

you are trying drivers from Ubuntu repos, right?

Yes

Have you tried the ones from Nvidia DC repos?

Not yet.

I succeeded again to boot after another clean install of the 580-open metapackage.
I will try to reboot and see if I still have the problem as the last times.
If I have again boot issues, I will try to install the driver from the NVIDIA DC repos.
Note that I also tried to install the non-open package via apt install, without boot success…

Before things go wrong again, I would like to share some logs of the first (working) boot after a new clean install:

$ cat /var/log/gpu-manager.log
log_file: /var/log/gpu-manager.log
last_boot_file: /var/lib/ubuntu-drivers-common/last_gfx_boot
new_boot_file: /var/lib/ubuntu-drivers-common/last_gfx_boot
can't access /opt/amdgpu-pro/bin/amdgpu-pro-px
Looking for nvidia modules in /lib/modules/6.14.0-32-generic/kernel
Looking for nvidia modules in /lib/modules/6.14.0-32-generic/updates/dkms
Found nvidia.ko module in /lib/modules/6.14.0-32-generic/updates/dkms/nvidia.ko.zst
Looking for amdgpu modules in /lib/modules/6.14.0-32-generic/kernel
Looking for amdgpu modules in /lib/modules/6.14.0-32-generic/updates/dkms
Is nvidia loaded? yes
Was nvidia unloaded? no
Is nvidia blacklisted? no
Is intel loaded? yes
Is radeon loaded? no
Is radeon blacklisted? no
Is amdgpu loaded? no
Is amdgpu blacklisted? no
Is amdgpu versioned? no
Is amdgpu pro stack? no
Is nouveau loaded? no
Is nouveau blacklisted? yes
Is nvidia kernel module available? yes
Is amdgpu kernel module available? no
Vendor/Device Id: 8086:46a3
BusID "PCI:0@0:2:0"
Is boot vga? yes
Vendor/Device Id: 10de:2820
BusID "PCI:1@0:0:0"
Is boot vga? no
Chassis type: "10"
Laptop detected
/etc/u-d-c-nvidia-runtimepm-override found. Will try runtimepm if the kernel supports it.
Linux 6.14 detected.
Is nvidia runtime pm supported for "0x2820"? yes
Trying to create new file: /run/nvidia_runtimepm_supported
Checking power status in /proc/driver/nvidia/gpus/0000:01:00.0/power
Runtime D3 status:          Enabled (fine-grained)
Is nvidia runtime pm enabled for "0x2820"? yes
Trying to create new file: /run/nvidia_runtimepm_enabled
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
can't access /run/u-d-c-nvidia-drm-was-loaded file
Takes 260ms to wait for nvidia udev rules completed.
Skipping "/dev/dri/card1", driven by "nvidia-drm"
Skipping "/dev/dri/card2", driven by "i915"
Skipping "/dev/dri/card1", driven by "nvidia-drm"
Skipping "/dev/dri/card2", driven by "i915"
Skipping "/dev/dri/card1", driven by "nvidia-drm"
Skipping "/dev/dri/card2", driven by "i915"
Skipping "/dev/dri/card1", driven by "nvidia-drm"
Found "/dev/dri/card2", driven by "i915"
output 0:
        card2-eDP-1
Number of connected outputs for /dev/dri/card2: 1
Does it require offloading? no
last cards number = 2
Has amd? no
Has intel? yes
Has nvidia? yes
How many cards? 2
Has the system changed? No
Intel IGP detected
NVIDIA hybrid system
Creating /usr/share/X11/xorg.conf.d/11-nvidia-offload.conf
Setting power control to "auto" in /sys/bus/pci/devices/0000:01:00.0/power/control

and attached the nvidia-bug-report.shresult

nvidia-bug-report.log.gz (456.5 KB)

no no: I mean modeset option to nvidia-drm module (on or off). grep through the files in /etc/modprobe.d/ to see if it is explicitly set to any value and try changing it. If it’s not explicitly set then the default is off, so in such case try enabling it.
If this doesn’t help then try playing with other options defined in /etc/modprobe.d/ for any NV module.

I’ve just had a look at your bug report file and it turns you have modeset enabled in the grub config:

2026-01-12T12:18:47.490486-08:00 deputy-p60 kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-6.14.0-32-generic root=UUID=4a8c1557-dddd-4d2e-b1bc-8d04474ddeb1 ro rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1

So to disable it, you need to edit probably /etc/default/grub (at least on Debian, probably the same on Ubuntu but not 100% sure).

you need to edit probably /etc/default/grub

Effectively, I can probably edit the file by editing the variable GRUB_CMDLINE_LINUX=“” to GRUB_CMDLINE_LINUX=“nvidia-drm.modeset=0”(see https://askubuntu.com/a/780545/1077387 to know the difference between GRUB_CMDLINE_LINUX and GRUB_CMDLINE_LINUX_DEFAULT)

Is modifying the value options nvidia_drm modeset=0 in /etc/modprobe.d/nvidia-graphics-drivers-kms.confnot sufficient?

Probably not: the fact that it is on kernel’s command line suggests that Ubuntu’s packaging also added NV modules to your initrd and the modules are loaded before even the rootFS is mounted. Generally you should set it in both places to the same value in case modules are removed and then loaded again.

1 Like

Set the nvidia-drm mode in /etc/default/grub as GRUB_CMDLINE_LINUX=“nvidia-drm.modeset=0”is sufficient to enable me to boot again for the linux kernel 6.14.0-32-generic.

Thank you for your help, I hope it will help also @jin1987.jin

2 Likes

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.