[560] Kubuntu 24.04.2 LTS fails to resume from sleep

Kubuntu 24.04.02 (newest at this point) fails to resume from sleep.

When I tried to resume my session there is only black screen, however system still works I think, when I disconnect or connect USB monitors, system makes standard sounds.

looking at syslogs, there is a lot of

kernel: NVRM: RmHandleDNotifierEvent: RmHandleDNotifierEvent: Failed to handle ACPI D-Notifier event, status=0x11
kernel: NVRM: rm_power_source_change_event: rm_power_source_change_event: Failed to handle Power Source change event, status=0x11

before I had to HARD RESET my PC.

And half an hour before, when PC went into sleep:

2025-03-13T14:00:09.895783+01:00 ku-xps systemd[1]: Starting nvidia-suspend.service - NVIDIA system suspend actions...
2025-03-13T14:00:09.897816+01:00 ku-xps suspend: nvidia-suspend.service
2025-03-13T14:00:09.897890+01:00 ku-xps logger[285716]: <13>Mar 13 14:00:09 suspend: nvidia-suspend.service
2025-03-13T14:00:09.926837+01:00 ku-xps wpa_supplicant[1819]: wlp0s20f3: CTRL-EVENT-DSCP-POLICY clear_all
2025-03-13T14:00:09.957911+01:00 ku-xps wpa_supplicant[1819]: wlp0s20f3: CTRL-EVENT-DSCP-POLICY clear_all
2025-03-13T14:00:09.957970+01:00 ku-xps wpa_supplicant[1819]: nl80211: deinit ifname=wlp0s20f3 disabled_11b_rates=0
2025-03-13T14:00:10.142739+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 10!
2025-03-13T14:00:10.142748+01:00 ku-xps kernel: NVRM: rpcRmApiFree_GSP: GspRmFree failed: hClient=0xc1d00065; hObject=0xbfef0012; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:00:10.142749+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_client.c:843
2025-03-13T14:00:10.142750+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_server.c:257
2025-03-13T14:00:10.142751+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_server.c:1287
2025-03-13T14:00:10.142751+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 10!
2025-03-13T14:00:10.142752+01:00 ku-xps kernel: NVRM: rpcRmApiFree_GSP: GspRmFree failed: hClient=0xc1d00065; hObject=0xbfef0029; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:00:10.142868+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_client.c:843
2025-03-13T14:00:10.142966+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_server.c:257
2025-03-13T14:00:10.142968+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: status == NV_OK @ rs_server.c:1287
2025-03-13T14:00:10.142969+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 10!
2025-03-13T14:00:10.142969+01:00 ku-xps kernel: NVRM: rpcRmApiFree_GSP: GspRmFree failed: hClient=0xc1d00065; hObject=0xbfef0002; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:00:10.142970+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:852
2025-03-13T14:00:10.142970+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1301
2025-03-13T14:00:10.142971+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:489
2025-03-13T14:00:10.142971+01:00 ku-xps kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0x12019b000 to 0x12019dfff. Status = 0x00000040
2025-03-13T14:00:10.142971+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:72
2025-03-13T14:00:10.142972+01:00 ku-xps kernel: NVRM: nvAssertOkFailedNoLog: Assertion failed: Generic Error: Invalid state [NV_ERR_INVALID_STATE] (0x00000040) returned from mmuWalkUnmap(userCtx.pGpuState->pWalk, vaLo, vaHi) @ gpu_vaspace.c:2288
2025-03-13T14:00:10.142973+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pEntries != NULL @ gmmu_walk.c:852
2025-03-13T14:00:10.142973+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: progress == indexHi_tmp - indexLo_tmp + 1 @ mmu_walk.c:1301
2025-03-13T14:00:10.142973+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:489
2025-03-13T14:00:10.142974+01:00 ku-xps kernel: NVRM: mmuWalkUnmap: Failed to unmap VA Range 0x12019b000 to 0x12019dfff. Status = 0x00000040
2025-03-13T14:00:10.142975+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ mmu_walk_unmap.c:72
2025-03-13T14:00:10.142975+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ gpu_vaspace.c:5080
2025-03-13T14:00:10.142975+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: pKernelBus->pReadToFlush != NULL || pKernelBus->virtualBar2[GPU_GFID_PF].pCpuMapping != NULL @ kern_bus_gv100.c:388

and some more errors from NVRM and:

2025-03-13T14:01:16.727799+01:00 ku-xps kernel: ------------[ cut here ]------------
2025-03-13T14:01:16.727808+01:00 ku-xps kernel: WARNING: CPU: 5 PID: 285719 at /var/lib/dkms/nvidia/560.35.05/build/nvidia/nv.c:4353 nv_set_system_power_state+0x30d/0x480 [nvidia]
2025-03-13T14:01:16.727809+01:00 ku-xps kernel: Modules linked in: udp_diag tcp_diag ib_core inet_diag xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat nf_tables rfcomm snd_seq_dummy snd_hrtimer vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) ccm overlay qrtr cmac algif_hash algif_skcipher af_alg bnep dell_rbu snd_hda_codec_hdmi xe snd_ctl_led snd_hda_codec_realtek drm_gpuvm drm_exec snd_hda_codec_generic gpu_sched drm_suballoc_helper drm_ttm_helper sunrpc intel_uncore_frequency intel_uncore_frequency_common binfmt_misc snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence x86_pkg_temp_thermal intel_powerclamp snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi coretemp soundwire_generic_allocation soundwire_bus snd_soc_core snd_compress kvm_intel ac97_bus snd_pcm_dmaengine snd_hda_intel snd_intel_dspcfg kvm
2025-03-13T14:01:16.727810+01:00 ku-xps kernel:  snd_intel_sdw_acpi iwlmvm snd_usb_audio snd_hda_codec cmdlinepart spi_nor snd_hda_core snd_usbmidi_lib irqbypass rapl mac80211 mtd mei_hdcp mei_pxp snd_ump snd_hwdep intel_rapl_msr snd_pcm libarc4 uvcvideo i915 dell_laptop videobuf2_vmalloc snd_seq_midi uvc snd_seq_midi_event videobuf2_memops snd_rawmidi videobuf2_v4l2 dell_wmi btusb snd_seq hid_sensor_als btrtl videodev sch_fq_codel iwlwifi hid_sensor_trigger btintel snd_seq_device dell_smbios industrialio_triggered_buffer nls_iso8859_1 videobuf2_common btbcm intel_cstate snd_timer drm_buddy kfifo_buf dcdbas btmtk dell_wmi_sysman nvidia_drm(OE) hid_sensor_iio_common snd mei_me ttm i2c_i801 spi_intel_pci bluetooth dell_wmi_ddv dell_smm_hwmon mc firmware_attributes_class ledtrig_audio dell_wmi_descriptor wmi_bmof spi_intel nvidia_modeset(OE) i2c_smbus soundcore cfg80211 nvidia_uvm(OE) mei industrialio ecdh_generic drm_display_helper processor_thermal_device_pci cec processor_thermal_device processor_thermal_wt_hint processor_thermal_rfim dptf_power
2025-03-13T14:01:16.727810+01:00 ku-xps kernel:  processor_thermal_rapl int3403_thermal intel_rapl_common processor_thermal_wt_req intel_pmc_core processor_thermal_power_floor processor_thermal_mbox intel_vsec rc_core int340x_thermal_zone pmt_telemetry igen6_edac i2c_algo_bit pmt_class intel_hid int3400_thermal acpi_thermal_rel acpi_pad sparse_keymap acpi_tad joydev input_leds mac_hid serio_raw typec_displayport nvidia(OE) ecc msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq libcrc32c hid_logitech_hidpp hid_logitech_dj usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_hid nvme nvme_core nvme_auth hid_multitouch hid_generic ahci libahci rtsx_pci_sdmmc crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel video sha256_ssse3 sha1_ssse3 ucsi_acpi thunderbolt psmouse typec_ucsi intel_lpss_pci rtsx_pci intel_ish_ipc intel_lpss xhci_pci intel_ishtp idma64 typec xhci_pci_renesas vmd i2c_hid_acpi i2c_hid hid wmi pinctrl_tigerlake aesni_intel crypto_simd
2025-03-13T14:01:16.727811+01:00 ku-xps kernel:  cryptd
2025-03-13T14:01:16.727812+01:00 ku-xps kernel: CPU: 5 PID: 285719 Comm: nvidia-sleep.sh Tainted: G           OEL     6.8.0-55-generic #57-Ubuntu
2025-03-13T14:01:16.727812+01:00 ku-xps kernel: Hardware name: Dell Inc. XPS 15 9520/0YD3W1, BIOS 1.29.0 12/11/2024
2025-03-13T14:01:16.727812+01:00 ku-xps kernel: RIP: 0010:nv_set_system_power_state+0x30d/0x480 [nvidia]
2025-03-13T14:01:16.727813+01:00 ku-xps kernel: Code: 24 70 06 00 00 4d 85 e4 75 d1 e9 c6 fd ff ff 0f 0b e9 00 fe ff ff 48 8b 3d 58 37 3d 00 4c 89 fe e8 c8 f0 d7 ee e9 01 ff ff ff <0f> 0b 4c 89 f7 e8 69 26 b6 ef 4d 85 ff 74 0d e8 7f 4f 12 00 84 c0
2025-03-13T14:01:16.727813+01:00 ku-xps kernel: RSP: 0018:ffffaed0ff783c80 EFLAGS: 00010206
2025-03-13T14:01:16.727814+01:00 ku-xps kernel: RAX: 0000000000000011 RBX: 0000000000000001 RCX: 0000000000000000
2025-03-13T14:01:16.727814+01:00 ku-xps kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffaed0ff783bc0
2025-03-13T14:01:16.727814+01:00 ku-xps kernel: RBP: ffffaed0ff783cb8 R08: 0000000000000000 R09: 0000000000000000
2025-03-13T14:01:16.727815+01:00 ku-xps kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000011
2025-03-13T14:01:16.727815+01:00 ku-xps kernel: R13: ffff9a492bdcf000 R14: ffff9a492bdcf648 R15: 0000000000000000
2025-03-13T14:01:16.727815+01:00 ku-xps kernel: FS:  00007a30c5cdd740(0000) GS:ffff9a506f080000(0000) knlGS:0000000000000000
2025-03-13T14:01:16.727816+01:00 ku-xps kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2025-03-13T14:01:16.727816+01:00 ku-xps kernel: CR2: 00007a30c5c04650 CR3: 00000001a4da0000 CR4: 0000000000f50ef0
2025-03-13T14:01:16.727816+01:00 ku-xps kernel: PKRU: 55555554
2025-03-13T14:01:16.727817+01:00 ku-xps kernel: Call Trace:
2025-03-13T14:01:16.727817+01:00 ku-xps kernel:  <TASK>
2025-03-13T14:01:16.727817+01:00 ku-xps kernel:  ? show_regs+0x6d/0x80
2025-03-13T14:01:16.727818+01:00 ku-xps kernel:  ? __warn+0x89/0x160
2025-03-13T14:01:16.727818+01:00 ku-xps kernel:  ? nv_set_system_power_state+0x30d/0x480 [nvidia]
2025-03-13T14:01:16.727819+01:00 ku-xps kernel:  ? report_bug+0x17e/0x1b0
2025-03-13T14:01:16.727819+01:00 ku-xps kernel:  ? handle_bug+0x51/0xa0
2025-03-13T14:01:16.727819+01:00 ku-xps kernel:  ? exc_invalid_op+0x18/0x80
2025-03-13T14:01:16.727819+01:00 ku-xps kernel:  ? asm_exc_invalid_op+0x1b/0x20
2025-03-13T14:01:16.727820+01:00 ku-xps kernel:  ? nv_set_system_power_state+0x30d/0x480 [nvidia]
2025-03-13T14:01:16.727820+01:00 ku-xps kernel:  nv_procfs_write_suspend+0x106/0x1c0 [nvidia]
2025-03-13T14:01:16.727820+01:00 ku-xps kernel:  proc_reg_write+0x69/0xb0
2025-03-13T14:01:16.727821+01:00 ku-xps kernel:  vfs_write+0xfd/0x480
2025-03-13T14:01:16.727821+01:00 ku-xps kernel:  ksys_write+0x73/0x100
2025-03-13T14:01:16.727821+01:00 ku-xps kernel:  __x64_sys_write+0x19/0x30
2025-03-13T14:01:16.727822+01:00 ku-xps kernel:  x64_sys_call+0x7e/0x25a0
2025-03-13T14:01:16.727822+01:00 ku-xps kernel:  do_syscall_64+0x7f/0x180
2025-03-13T14:01:16.727822+01:00 ku-xps kernel:  ? syscall_exit_to_user_mode+0x86/0x260
2025-03-13T14:01:16.727823+01:00 ku-xps kernel:  ? do_syscall_64+0x8c/0x180
2025-03-13T14:01:16.727823+01:00 ku-xps kernel:  ? do_user_addr_fault+0x333/0x670
2025-03-13T14:01:16.727823+01:00 ku-xps kernel:  ? irqentry_exit_to_user_mode+0x7b/0x260
2025-03-13T14:01:16.727824+01:00 ku-xps kernel:  ? irqentry_exit+0x43/0x50
2025-03-13T14:01:16.727824+01:00 ku-xps kernel:  ? exc_page_fault+0x94/0x1b0
2025-03-13T14:01:16.727824+01:00 ku-xps kernel:  entry_SYSCALL_64_after_hwframe+0x78/0x80
2025-03-13T14:01:16.727824+01:00 ku-xps kernel: RIP: 0033:0x7a30c5b1c574
2025-03-13T14:01:16.727825+01:00 ku-xps kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d d5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
2025-03-13T14:01:16.727825+01:00 ku-xps kernel: RSP: 002b:00007ffe8e84d098 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
2025-03-13T14:01:16.727826+01:00 ku-xps kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007a30c5b1c574
2025-03-13T14:01:16.727826+01:00 ku-xps kernel: RDX: 0000000000000008 RSI: 0000574a125a3b80 RDI: 0000000000000001
2025-03-13T14:01:16.727826+01:00 ku-xps kernel: RBP: 00007ffe8e84d0c0 R08: 00007a30c5c03b20 R09: 0000000000000410
2025-03-13T14:01:16.727827+01:00 ku-xps kernel: R10: 0000000000000001 R11: 0000000000000202 R12: 0000000000000008
2025-03-13T14:01:16.727827+01:00 ku-xps kernel: R13: 0000574a125a3b80 R14: 00007a30c5c045c0 R15: 00007a30c5c01ee0
2025-03-13T14:01:16.727827+01:00 ku-xps kernel:  </TASK>
2025-03-13T14:01:16.727828+01:00 ku-xps kernel: ---[ end trace 0000000000000000 ]---
2025-03-13T14:01:16.727828+01:00 ku-xps kernel: ------------[ cut here ]------------

and more repeated:

2025-03-13T14:01:24.581442+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 76!
2025-03-13T14:01:24.581443+01:00 ku-xps kernel: NVRM: nvCheckOkFailedNoLog: Check failed: GPU not in full power [NV_ERR_GPU_NOT_FULL_POWER] (0x00000011) returned from pRmApi->Control(pRmApi, pGpu->hInternalClient, pGpu->hInternalSubdevice, NV2080_CTRL_CMD_INTERNAL_FIFO_TOGGLE_ACTIVE_CHANNEL_SCHEDULING, &fifoToggleActiveChannelSchedulingParam, sizeof(fifoToggleActiveChannelSchedulingParam)) @ kernel_fifo_init.c:142
2025-03-13T14:01:24.581444+01:00 ku-xps kernel: NVRM: gpuStateUnload_IMPL: Failed to unload engine with descriptor index: 0x4 and descriptor: 0xf3e15500
2025-03-13T14:01:24.581444+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: 0 @ gpu.c:3060
2025-03-13T14:01:24.581445+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 103!
2025-03-13T14:01:24.581446+01:00 ku-xps kernel: NVRM: rpcRmApiAlloc_GSP: GspRmAlloc failed: hClient=0xc1e0007e; hParent=0x00000000; hObject=0x00000000; hClass=0x00000000; paramsSize=0x0000006c; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:01:24.581447+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 103!
2025-03-13T14:01:24.581448+01:00 ku-xps kernel: NVRM: rpcRmApiAlloc_GSP: GspRmAlloc failed: hClient=0xc1e0007e; hParent=0xc1e0007e; hObject=0x0000000a; hClass=0x00000080; paramsSize=0x00000038; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:01:24.581449+01:00 ku-xps kernel: NVRM: _issueRpcAndWait: rpcSendMessage failed with status 0x00000011 for fn 103!
2025-03-13T14:01:24.581450+01:00 ku-xps kernel: NVRM: rpcRmApiAlloc_GSP: GspRmAlloc failed: hClient=0xc1e0007e; hParent=0x0000000a; hObject=0x0000000b; hClass=0x00002080; paramsSize=0x00000004; paramsStatus=0x00000000; status=0x00000011
2025-03-13T14:01:24.581451+01:00 ku-xps kernel: NVRM: nvCheckFailedNoLog: Check failed: pDst != NULL @ mem_utils.c:705
2025-03-13T14:01:24.581452+01:00 ku-xps kernel: NVRM: nvAssertOkFailedNoLog: Assertion failed: Ran out of a critical resource, other than memory [NV_ERR_INSUFFICIENT_RESOURCES] (0x0000001A) returned from memmgrMemSet(GPU_GET_MEMORY_MANAGER(pGpu), &dest, 0, (NvU32)memdescGetSize(pMemDesc), TRANSFER_FLAGS_NONE) @ gmmu_walk.c:98
2025-03-13T14:01:24.581452+01:00 ku-xps kernel: NVRM: memdescDestroy: Destroying unfreed memory FFFF9A4913746420
2025-03-13T14:01:24.581453+01:00 ku-xps kernel: NVRM: memdescDestroy: Please call memdescFree()
2025-03-13T14:01:24.581453+01:00 ku-xps kernel: NVRM: nvAssertFailedNoLog: Assertion failed: NV_OK == status @ mmu_walk.c:1397
2025-03-13T14:01:24.581454+01:00 ku-xps kernel: NVRM: nvAssertOkFailedNoLog: Assertion failed: Ran out of a critical resource, other than memory [NV_ERR_INSUFFICIENT_RESOURCES] (0x0000001A) returned from _mmuWalkLevelInstAcquire(pWalk, &pWalk->root, vaLo, vaHi, NV_TRUE, NV_FALSE, bCommit, &bChanged, &pLevelInst, NV_FALSE ) @ mmu_walk.c:892

looks like Nvidia GFX problem?

(Dell XPS 9520, RTX 3050 Ti) nvidia-driver-560-open 6.8.0-55-generic #57-Ubuntu SMP PREEMPT_DYNAMIC

Anyone experienced those problems? updating nvidia drivers to 570 could help ?

yeah if I’m correct this problem has been fixed in either 565 or 570