Device driver crash (unable to handle page fault) after suspend-&-resume with version 555.58.02 on Linux kernel v6.9.9

Since the last update driver update to 555.xx I can’t suspend-&-resume my laptop anymore. Suspend works, but on resume the device driver crashes and all displays stay black (no matter whether I am on X, or switch to a VT). Looking into the logs after reboot I see unhandled page faults caused by the nvidia device driver, and multiple follow-on errors.

I’ve meanwhile tested multiple permutations of module parameters. All of these fail in the same/similar way:

options nvidia-drm fbdev=1
options nvidia-drm fbdev=0
options nvidia NVreg_PreserveVideoMemoryAllocations=1
options nvidia NVreg_PreserveVideoMemoryAllocations=0

Other than that I have some settings as such:

options nvidia-drm modeset=1
options nvidia NVreg_DynamicPowerManagement=0x02 NVreg_EnableGpuFirmware=0

Distribution: Fedora 40
Kernel: 6.9.9-200.fc40.x86_64
GPU: NVIDIA T1200 Laptop GPU
Driver: 555.58.02

Jul 18 19:32:34 kernel: BUG: unable to handle page fault for address: ffffbe3894fa9604
Jul 18 19:32:34 kernel: #PF: supervisor read access in kernel mode
Jul 18 19:32:34 kernel: #PF: error_code(0x0000) - not-present page
Jul 18 19:32:34 kernel: PGD 100000067 P4D 100000067 PUD 1002ab067 PMD 0
Jul 18 19:32:34 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Jul 18 19:32:34 kernel: CPU: 0 PID: 562 Comm: irq/145-nvidia Tainted: P           OE      6.9.9-200.fc40.x86_64 #1
Jul 18 19:32:34 kernel: Hardware name: LENOVO 20Y4S32J06/20Y4S32J06, BIOS N40ET46W (1.28 ) 05/07/2024
Jul 18 19:32:34 kernel: RIP: 0010:_nv012398rm+0xbd/0x130 [nvidia]
Jul 18 19:32:34 kernel: Code: 8b 45 20 41 bf 01 00 00 00 41 89 54 24 20 41 89 44 24 24 4c 89 e6 4c 89 ef e8 ef 78 65 00 49 89 c4 48 85 c0 74 5f 49 8b 0c 24 <8b> 41 04 0f ae e8 41 39 44 24 20 74 dc 8b 41 08 0f b7 d8 25 00 00
Jul 18 19:32:34 kernel: RSP: 0018:ffffbe388084fce0 EFLAGS: 00010286
Jul 18 19:32:34 kernel: RAX: ffff9ceb8c9412d8 RBX: 0000000000000001 RCX: ffffbe3894fa9600
Jul 18 19:32:34 kernel: RDX: fffffffffffffff0 RSI: ffff9ceac2763008 RDI: ffff9ceac27638e8
Jul 18 19:32:34 kernel: RBP: ffff9ceae705abf0 R08: 0000000000000000 R09: 0000000000000020
Jul 18 19:32:34 kernel: R10: ffff9ceae705ac24 R11: ffffffffc0df3cd0 R12: ffff9ceb8c9412d8
Jul 18 19:32:34 kernel: R13: ffff9ceac27638e8 R14: ffff9ceac2763008 R15: 0000000000000000
Jul 18 19:32:34 kernel: FS:  0000000000000000(0000) GS:ffff9cf1fb000000(0000) knlGS:0000000000000000
Jul 18 19:32:34 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 18 19:32:34 kernel: CR2: ffffbe3894fa9604 CR3: 00000002a0428002 CR4: 0000000000f70ef0
Jul 18 19:32:34 kernel: PKRU: 55555554
Jul 18 19:32:34 kernel: Call Trace:
Jul 18 19:32:34 kernel:  <TASK>
Jul 18 19:32:34 kernel:  ? __die_body.cold+0x19/0x27
Jul 18 19:32:34 kernel:  ? page_fault_oops+0x15a/0x2c0
Jul 18 19:32:34 kernel:  ? search_module_extables+0x19/0x60
Jul 18 19:32:34 kernel:  ? search_bpf_extables+0x5f/0x80
Jul 18 19:32:34 kernel:  ? exc_page_fault+0x170/0x180
Jul 18 19:32:34 kernel:  ? asm_exc_page_fault+0x26/0x30
Jul 18 19:32:34 kernel:  ? _nv012399rm+0x1f0/0x1f0 [nvidia]
Jul 18 19:32:34 kernel:  ? _nv012398rm+0xbd/0x130 [nvidia]
Jul 18 19:32:34 kernel:  _nv035255rm+0x19c/0x2f0 [nvidia]
Jul 18 19:32:34 kernel:  _nv030184rm+0x25/0x130 [nvidia]
Jul 18 19:32:34 kernel:  _nv021803rm+0xa2/0x2af [nvidia]
Jul 18 19:32:34 kernel:  _nv031947rm+0x60/0xc0 [nvidia]
Jul 18 19:32:34 kernel:  _nv012544rm+0x27f/0x3f0 [nvidia]
Jul 18 19:32:34 kernel:  _nv031957rm+0x167/0x1d0 [nvidia]
Jul 18 19:32:34 kernel:  _nv000735rm+0x113/0x150 [nvidia]
Jul 18 19:32:34 kernel:  rm_isr_bh+0x20/0x5c [nvidia]
Jul 18 19:32:34 kernel:  nvidia_isr_kthread_bh+0x1f/0x50 [nvidia]
Jul 18 19:32:34 kernel:  irq_thread_fn+0x20/0x60
Jul 18 19:32:34 kernel:  irq_thread+0x1b3/0x2f0
Jul 18 19:32:34 kernel:  ? __pfx_irq_thread_fn+0x10/0x10
Jul 18 19:32:34 kernel:  ? __pfx_irq_thread_dtor+0x10/0x10
Jul 18 19:32:34 kernel:  ? __pfx_irq_thread+0x10/0x10
Jul 18 19:32:34 kernel:  kthread+0xcf/0x100
Jul 18 19:32:34 kernel:  ? __pfx_kthread+0x10/0x10
Jul 18 19:32:34 kernel:  ret_from_fork+0x31/0x50
Jul 18 19:32:34 kernel:  ? __pfx_kthread+0x10/0x10
Jul 18 19:32:34 kernel:  ret_from_fork_asm+0x1a/0x30
Jul 18 19:32:34 kernel:  </TASK>
Jul 18 19:32:34 kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nft_redir nf_conntrack_tftp bridge stp llc snd_ctl_led snd_soc_skl_hda_dsp snd_soc_h>
Jul 18 19:32:34 kernel:  soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_soc_acpi_intel_match soundwire_generic_allocation snd_>
Jul 18 19:32:34 kernel:  processor_thermal_wt_req gpu_sched thunderbolt i2c_i801 platform_profile processor_thermal_power_floor snd_timer mei processor_thermal_mbox drm_ttm_helper spi_intel i2c_smbus idma64 intel_soc_dts_iosf snd>
Jul 18 19:32:34 kernel: CR2: ffffbe3894fa9604
Jul 18 19:32:34 kernel: ---[ end trace 0000000000000000 ]---
Jul 18 19:32:34 kernel: RIP: 0010:_nv012398rm+0xbd/0x130 [nvidia]
Jul 18 19:32:34 kernel: Code: 8b 45 20 41 bf 01 00 00 00 41 89 54 24 20 41 89 44 24 24 4c 89 e6 4c 89 ef e8 ef 78 65 00 49 89 c4 48 85 c0 74 5f 49 8b 0c 24 <8b> 41 04 0f ae e8 41 39 44 24 20 74 dc 8b 41 08 0f b7 d8 25 00 00
Jul 18 19:32:34 kernel: RSP: 0018:ffffbe388084fce0 EFLAGS: 00010286
Jul 18 19:32:34 kernel: RAX: ffff9ceb8c9412d8 RBX: 0000000000000001 RCX: ffffbe3894fa9600
Jul 18 19:32:34 kernel: RDX: fffffffffffffff0 RSI: ffff9ceac2763008 RDI: ffff9ceac27638e8
Jul 18 19:32:34 kernel: RBP: ffff9ceae705abf0 R08: 0000000000000000 R09: 0000000000000020
Jul 18 19:32:34 kernel: R10: ffff9ceae705ac24 R11: ffffffffc0df3cd0 R12: ffff9ceb8c9412d8
Jul 18 19:32:34 kernel: R13: ffff9ceac27638e8 R14: ffff9ceac2763008 R15: 0000000000000000
Jul 18 19:32:34 kernel: FS:  0000000000000000(0000) GS:ffff9cf1fb000000(0000) knlGS:0000000000000000
Jul 18 19:32:34 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 18 19:32:34 kernel: CR2: ffffbe3894fa9604 CR3: 00000002a0428002 CR4: 0000000000f70ef0
Jul 18 19:32:34 kernel: PKRU: 55555554
Jul 18 19:32:34 kernel: note: irq/145-nvidia[562] exited with irqs disabled
Jul 18 19:32:34 kernel: kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
Jul 18 19:32:34 kernel: BUG: unable to handle page fault for address: ffff9ceaf1bcaf40
Jul 18 19:32:34 kernel: #PF: supervisor instruction fetch in kernel mode
Jul 18 19:32:34 kernel: #PF: error_code(0x0011) - permissions violation
Jul 18 19:32:34 kernel: PGD 2a1801067 P4D 2a1801067 PUD 10249c063 PMD 131adf063 PTE 8000000131bca163
Jul 18 19:32:34 kernel: Oops: 0011 [#2] PREEMPT SMP NOPTI
Jul 18 19:32:34 kernel: CPU: 0 PID: 562 Comm: irq/145-nvidia Tainted: P      D    OE      6.9.9-200.fc40.x86_64 #1
Jul 18 19:32:34 kernel: Hardware name: LENOVO 20Y4S32J06/20Y4S32J06, BIOS N40ET46W (1.28 ) 05/07/2024
Jul 18 19:32:34 kernel: RIP: 0010:0xffff9ceaf1bcaf40
Jul 18 19:32:34 kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <00> 40 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jul 18 19:32:34 kernel: RSP: 0018:ffffbe388084feb0 EFLAGS: 00010286
Jul 18 19:32:34 kernel: RAX: ffff9ceaf1bcaf40 RBX: ffffffff8f15452d RCX: 00000000000001c0
Jul 18 19:32:34 kernel: RDX: 0000000000000001 RSI: 0000000000000282 RDI: ffffbe388084feb0
Jul 18 19:32:34 kernel: RBP: ffff9ceaf1bcaf40 R08: 0000000000000000 R09: 312f717269203a65
Jul 18 19:32:34 kernel: R10: ffffbe388084fd08 R11: 7269203a65746f6e R12: ffff9ceaf1bcbc1c
Jul 18 19:32:34 kernel: R13: ffff9ceaf1ba5a01 R14: 0000000000000000 R15: 0000000000000000
Jul 18 19:32:34 kernel: FS:  0000000000000000(0000) GS:ffff9cf1fb000000(0000) knlGS:0000000000000000
Jul 18 19:32:34 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 18 19:32:34 kernel: CR2: ffff9ceaf1bcaf40 CR3: 00000002a0428002 CR4: 0000000000f70ef0
Jul 18 19:32:34 kernel: PKRU: 55555554
Jul 18 19:32:34 kernel: Call Trace:
Jul 18 19:32:34 kernel:  <TASK>
Jul 18 19:32:34 kernel:  ? __die_body.cold+0x19/0x27
Jul 18 19:32:34 kernel:  ? page_fault_oops+0x15a/0x2c0
Jul 18 19:32:34 kernel:  ? search_module_extables+0x19/0x60
Jul 18 19:32:34 kernel:  ? exc_page_fault+0x170/0x180
Jul 18 19:32:34 kernel:  ? asm_exc_page_fault+0x26/0x30
Jul 18 19:32:34 kernel:  ? task_work_run+0x4d/0x90
Jul 18 19:32:34 kernel:  ? task_work_run+0x59/0x90
Jul 18 19:32:34 kernel:  ? do_exit+0x31b/0xac0
Jul 18 19:32:34 kernel:  ? make_task_dead+0x90/0x90
Jul 18 19:32:34 kernel:  ? rewind_stack_and_make_dead+0x16/0x20
Jul 18 19:32:34 kernel:  </TASK>
Jul 18 19:32:34 kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nft_redir nf_conntrack_tftp bridge stp llc snd_ctl_led snd_soc_skl_hda_dsp snd_soc_h>
Jul 18 19:32:34 kernel:  soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_soc_acpi_intel_match soundwire_generic_allocation snd_>
Jul 18 19:32:34 kernel:  processor_thermal_wt_req gpu_sched thunderbolt i2c_i801 platform_profile processor_thermal_power_floor snd_timer mei processor_thermal_mbox drm_ttm_helper spi_intel i2c_smbus idma64 intel_soc_dts_iosf snd>
Jul 18 19:32:34 kernel: CR2: ffff9ceaf1bcaf40
Jul 18 19:32:34 kernel: ---[ end trace 0000000000000000 ]---
Jul 18 19:32:34 kernel: RIP: 0010:_nv012398rm+0xbd/0x130 [nvidia]
Jul 18 19:32:34 kernel: Code: 8b 45 20 41 bf 01 00 00 00 41 89 54 24 20 41 89 44 24 24 4c 89 e6 4c 89 ef e8 ef 78 65 00 49 89 c4 48 85 c0 74 5f 49 8b 0c 24 <8b> 41 04 0f ae e8 41 39 44 24 20 74 dc 8b 41 08 0f b7 d8 25 00 00
Jul 18 19:32:34 kernel: RSP: 0018:ffffbe388084fce0 EFLAGS: 00010286
Jul 18 19:32:34 kernel: RAX: ffff9ceb8c9412d8 RBX: 0000000000000001 RCX: ffffbe3894fa9600
Jul 18 19:32:34 kernel: RDX: fffffffffffffff0 RSI: ffff9ceac2763008 RDI: ffff9ceac27638e8
Jul 18 19:32:34 kernel: RBP: ffff9ceae705abf0 R08: 0000000000000000 R09: 0000000000000020
Jul 18 19:32:34 kernel: R10: ffff9ceae705ac24 R11: ffffffffc0df3cd0 R12: ffff9ceb8c9412d8
Jul 18 19:32:34 kernel: R13: ffff9ceac27638e8 R14: ffff9ceac2763008 R15: 0000000000000000
Jul 18 19:32:34 kernel: FS:  0000000000000000(0000) GS:ffff9cf1fb000000(0000) knlGS:0000000000000000
Jul 18 19:32:34 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 18 19:32:34 kernel: CR2: ffff9ceaf1bcaf40 CR3: 00000002a0428002 CR4: 0000000000f70ef0
Jul 18 19:32:34 kernel: PKRU: 55555554
Jul 18 19:32:34 kernel: note: irq/145-nvidia[562] exited with irqs disabled
Jul 18 19:32:34 kernel: Fixing recursive fault but reboot is needed!
Jul 18 19:32:34 kernel: BUG: scheduling while atomic: irq/145-nvidia/562/0x00000000
Jul 18 19:32:34 kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nft_redir nf_conntrack_tftp bridge stp llc snd_ctl_led snd_soc_skl_hda_dsp snd_soc_h>
Jul 18 19:32:34 kernel:  soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_soc_acpi_intel_match soundwire_generic_allocation snd_>
Jul 18 19:32:34 kernel:  processor_thermal_wt_req gpu_sched thunderbolt i2c_i801 platform_profile processor_thermal_power_floor snd_timer mei processor_thermal_mbox drm_ttm_helper spi_intel i2c_smbus idma64 intel_soc_dts_iosf snd>
Jul 18 19:32:34 kernel: CPU: 0 PID: 562 Comm: irq/145-nvidia Tainted: P      D    OE      6.9.9-200.fc40.x86_64 #1
Jul 18 19:32:34 kernel: Hardware name: LENOVO 20Y4S32J06/20Y4S32J06, BIOS N40ET46W (1.28 ) 05/07/2024
Jul 18 19:32:34 kernel: Call Trace:
Jul 18 19:32:34 kernel:  <TASK>
Jul 18 19:32:34 kernel:  dump_stack_lvl+0x5d/0x80
Jul 18 19:32:34 kernel:  __schedule_bug.cold+0x42/0x4e
Jul 18 19:32:34 kernel:  __schedule+0x10f2/0x1710
Jul 18 19:32:34 kernel:  ? __wake_up_klogd.part.0+0x3c/0x60
Jul 18 19:32:34 kernel:  ? vprintk_emit+0x176/0x2a0
Jul 18 19:32:34 kernel:  ? _printk+0x6c/0x87
Jul 18 19:32:34 kernel:  do_task_dead+0x42/0x50
Jul 18 19:32:34 kernel:  make_task_dead.cold+0xd9/0xe5
Jul 18 19:32:34 kernel:  rewind_stack_and_make_dead+0x16/0x20
Jul 18 19:32:34 kernel: RIP: 0000:0x0
Jul 18 19:32:34 kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Jul 18 19:32:34 kernel: RSP: 0000:0000000000000000 EFLAGS: 00000000 ORIG_RAX: 0000000000000000
Jul 18 19:32:34 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Jul 18 19:32:34 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jul 18 19:32:34 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
Jul 18 19:32:34 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jul 18 19:32:34 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Jul 18 19:32:34 kernel:  </TASK>
Jul 18 19:32:36 kernel: nvidia-modeset: WARNING: GPU:0: Lost display notification (0:0x00000000); continuing.

This is definitely a regession, I used to be able to suspend&resume that same machine for days.

After reboot: nvidia-bug-report.log.gz (1.6 MB)

1 Like

Seeing the same thing on 555.58.02. Suspend no longer works - the same error messages as the previous poster.

Hi,
We have seen similar issue internally and have already filed a bug 4590535 for tracking purpose.
Shall share further updates once received from engineering team.

I am seeing similar behavior with 555.58.02. Cannot resume after a sleep.

Kernel: 6.10.0-arch1-2
GPU: NVIDIA GeForce GTX 1050 Ti
Driver Version: 555.58.02

This what appears on my screen when trying to boot:

Jul 23 16:10:48 arch kernel: BUG: kernel NULL pointer dereference, address: 000000000000032c
Jul 23 16:10:48 arch kernel: #PF: supervisor read access in kernel mode
Jul 23 16:10:48 arch kernel: #PF: error_code(0x0000) - not-present page
Jul 23 16:10:48 arch kernel: PGD 0 P4D 0 
Jul 23 16:10:48 arch kernel: Oops: Oops: 0000 [#2] PREEMPT SMP NOPTI
Jul 23 16:10:48 arch kernel: CPU: 0 PID: 680 Comm: irq/98-nvidia Tainted: P      D    OE      6.10.0-arch1-2 #1 ec818e96762f5a8ef3adc527a4740ba5b3ca4df5
Jul 23 16:10:48 arch kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C95/PRO B550M-VC WIFI (MS-7C95), BIOS H.A0 10/13/2023
Jul 23 16:10:48 arch kernel: RIP: 0010:exit_shm+0x4b/0x1d0
Jul 23 16:10:48 arch kernel: Code: 89 fd 53 48 83 ec 08 4c 89 e7 e8 00 fe 91 00 48 8b 85 28 08 00 00 4c 39 f8 0f 84 52 01 00 00 4c 8b b5 28 08 00 00 49 8b 5e 10 <8b> 93 2c 03 00 00 85 d2 0f 84 04 01 00 00 48 85 db 0f 84 fb 00 00
Jul 23 16:10:48 arch kernel: RSP: 0018:ffffa3c210257ea0 EFLAGS: 00010282
Jul 23 16:10:48 arch kernel: RAX: ffff8a5a04cf2ba8 RBX: 0000000000000000 RCX: 00000000000001c0
Jul 23 16:10:48 arch kernel: RDX: 0000000000000001 RSI: 0000000000000286 RDI: ffffa3c210257e78
Jul 23 16:10:48 arch kernel: RBP: ffff8a5a04cf2380 R08: 0000000000000000 R09: ffffa3c210257d00
Jul 23 16:10:48 arch kernel: R10: ffffffff8e0b21e8 R11: 0000000000000003 R12: ffff8a5a04cf2c9c
Jul 23 16:10:48 arch kernel: R13: ffff8a5a1730ca01 R14: ffff8a5a04cf2ba8 R15: 0000000000000000
Jul 23 16:10:48 arch kernel: FS:  0000000000000000(0000) GS:ffff8a5c8fc00000(0000) knlGS:0000000000000000
Jul 23 16:10:48 arch kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 23 16:10:48 arch kernel: CR2: 000000000000032c CR3: 000000013c020000 CR4: 0000000000f50ef0
Jul 23 16:10:48 arch kernel: PKRU: 55555554
Jul 23 16:10:48 arch kernel: Call Trace:
Jul 23 16:10:48 arch kernel:  <TASK>
Jul 23 16:10:48 arch kernel:  ? __die_body.cold+0x19/0x27
Jul 23 16:10:48 arch kernel:  ? page_fault_oops+0x15a/0x2d0
Jul 23 16:10:48 arch kernel:  ? desc_update_last_finalized+0x4c/0x90
Jul 23 16:10:48 arch kernel:  ? exc_page_fault+0x81/0x190
Jul 23 16:10:48 arch kernel:  ? asm_exc_page_fault+0x26/0x30
Jul 23 16:10:48 arch kernel:  ? exit_shm+0x4b/0x1d0
Jul 23 16:10:48 arch kernel:  ? task_work_run+0x5c/0x90
Jul 23 16:10:48 arch kernel:  ? do_exit+0x318/0xad0
Jul 23 16:10:48 arch kernel:  ? make_task_dead+0x90/0x90
Jul 23 16:10:48 arch kernel:  ? rewind_stack_and_make_dead+0x16/0x20
Jul 23 16:10:48 arch kernel:  </TASK>
Jul 23 16:10:48 arch kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm cmac algif_hash algif_skcipher af_alg bnep vfat fat pkcs8_key_parser amd_atl intel_rapl_msr intel_rapl_common nvidia_drm(POE) nvidia_uvm(POE) nvidia_modeset(POE) kvm_amd kvm mt7921e mt7921_common crct10dif_pclmul mt792x_lib crc32_pclmul polyval_clmulni snd_hda_codec_realtek mt76_connac_lib polyval_generic snd_hda_codec_generic gf128mul mt76 snd_hda_codec_hdmi snd_hda_scodec_component nvidia(POE) ghash_clmulni_intel sha512_ssse3 mac80211 sha256_ssse3 snd_hda_intel sha1_ssse3 btusb snd_intel_dspcfg aesni_intel btrtl snd_intel_sdw_acpi btintel crypto_simd libarc4 snd_hda_codec cryptd btbcm btmtk rapl snd_hda_core r8169 pcspkr wmi_bmof sp5100_tco cfg80211 snd_hwdep bluetooth realtek snd_pcm k10temp i2c_piix4 mdio_devres snd_timer rfkill ccp libphy snd soundcore gpio_amdpt gpio_generic joydev mousedev mac_hid i2c_dev crypto_user dm_mod loop nfnetlink zram ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic usbhid amdgpu
Jul 23 16:10:48 arch kernel:  amdxcp i2c_algo_bit drm_ttm_helper ttm drm_exec gpu_sched drm_suballoc_helper drm_buddy nvme drm_display_helper nvme_core crc32c_intel cec xhci_pci nvme_auth video xhci_pci_renesas wmi
Jul 23 16:10:48 arch kernel: CR2: 000000000000032c
Jul 23 16:10:48 arch kernel: ---[ end trace 0000000000000000 ]---
Jul 23 16:10:48 arch kernel: RIP: 0010:_nv012398rm+0xbd/0x130 [nvidia]
Jul 23 16:10:48 arch kernel: Code: 8b 45 20 41 bf 01 00 00 00 41 89 54 24 20 41 89 44 24 24 4c 89 e6 4c 89 ef e8 ef 78 65 00 49 89 c4 48 85 c0 74 5f 49 8b 0c 24 <8b> 41 04 0f ae e8 41 39 44 24 20 74 dc 8b 41 08 0f b7 d8 25 00 00
Jul 23 16:10:48 arch kernel: RSP: 0018:ffffa3c210257cc0 EFLAGS: 00010286
Jul 23 16:10:48 arch kernel: RAX: ffff8a5a1ffda1d8 RBX: 0000000000000001 RCX: ffffa3c211fbac00
Jul 23 16:10:48 arch kernel: RDX: fffffffffffffff0 RSI: ffff8a5a1edf4008 RDI: ffff8a5a1edf48e8
Jul 23 16:10:48 arch kernel: RBP: ffff8a5a1edcdbe0 R08: 0000000000000000 R09: 0000000000000020
Jul 23 16:10:48 arch kernel: R10: ffff8a5a1edcdc24 R11: ffffffffc1f84a10 R12: ffff8a5a1ffda1d8
Jul 23 16:10:48 arch kernel: R13: ffff8a5a1edf48e8 R14: ffff8a5a1edf4008 R15: 0000000000000000
Jul 23 16:10:48 arch kernel: FS:  0000000000000000(0000) GS:ffff8a5c8fc00000(0000) knlGS:0000000000000000
Jul 23 16:10:48 arch kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 23 16:10:48 arch kernel: CR2: 000000000000032c CR3: 000000013c020000 CR4: 0000000000f50ef0
Jul 23 16:10:48 arch kernel: PKRU: 55555554
Jul 23 16:10:48 arch kernel: note: irq/98-nvidia[680] exited with irqs disabled
Jul 23 16:10:48 arch kernel: Fixing recursive fault but reboot is needed!
Jul 23 16:10:48 arch kernel: BUG: scheduling while atomic: irq/98-nvidia/680/0x00000000
Jul 23 16:10:48 arch kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm cmac algif_hash algif_skcipher af_alg bnep vfat fat pkcs8_key_parser amd_atl intel_rapl_msr intel_rapl_common nvidia_drm(POE) nvidia_uvm(POE) nvidia_modeset(POE) kvm_amd kvm mt7921e mt7921_common crct10dif_pclmul mt792x_lib crc32_pclmul polyval_clmulni snd_hda_codec_realtek mt76_connac_lib polyval_generic snd_hda_codec_generic gf128mul mt76 snd_hda_codec_hdmi snd_hda_scodec_component nvidia(POE) ghash_clmulni_intel sha512_ssse3 mac80211 sha256_ssse3 snd_hda_intel sha1_ssse3 btusb snd_intel_dspcfg aesni_intel btrtl snd_intel_sdw_acpi btintel crypto_simd libarc4 snd_hda_codec cryptd btbcm btmtk rapl snd_hda_core r8169 pcspkr wmi_bmof sp5100_tco cfg80211 snd_hwdep bluetooth realtek snd_pcm k10temp i2c_piix4 mdio_devres snd_timer rfkill ccp libphy snd soundcore gpio_amdpt gpio_generic joydev mousedev mac_hid i2c_dev crypto_user dm_mod loop nfnetlink zram ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic usbhid amdgpu
Jul 23 16:10:48 arch kernel:  amdxcp i2c_algo_bit drm_ttm_helper ttm drm_exec gpu_sched drm_suballoc_helper drm_buddy nvme drm_display_helper nvme_core crc32c_intel cec xhci_pci nvme_auth video xhci_pci_renesas wmi
Jul 23 16:10:48 arch kernel: CPU: 0 PID: 680 Comm: irq/98-nvidia Tainted: P      D    OE      6.10.0-arch1-2 #1 ec818e96762f5a8ef3adc527a4740ba5b3ca4df5
Jul 23 16:10:48 arch kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C95/PRO B550M-VC WIFI (MS-7C95), BIOS H.A0 10/13/2023
Jul 23 16:10:48 arch kernel: Call Trace:
Jul 23 16:10:48 arch kernel:  <TASK>
Jul 23 16:10:48 arch kernel:  dump_stack_lvl+0x5d/0x80
Jul 23 16:10:48 arch kernel:  __schedule_bug.cold+0x42/0x4e
Jul 23 16:10:48 arch kernel:  __schedule+0x109b/0x1520
Jul 23 16:10:48 arch kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 23 16:10:48 arch kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 23 16:10:48 arch kernel:  ? vprintk_emit+0x176/0x2a0
Jul 23 16:10:48 arch kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 23 16:10:48 arch kernel:  ? _printk+0x6c/0x87
Jul 23 16:10:48 arch kernel:  do_task_dead+0x42/0x50
Jul 23 16:10:48 arch kernel:  make_task_dead.cold+0xd9/0xe5
Jul 23 16:10:48 arch kernel:  rewind_stack_and_make_dead+0x16/0x20
Jul 23 16:10:48 arch kernel: RIP: 0000:0x0
Jul 23 16:10:48 arch kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Jul 23 16:10:48 arch kernel: RSP: 0000:0000000000000000 EFLAGS: 00000000 ORIG_RAX: 0000000000000000
Jul 23 16:10:48 arch kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Jul 23 16:10:48 arch kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jul 23 16:10:48 arch kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
Jul 23 16:10:48 arch kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jul 23 16:10:48 arch kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Jul 23 16:10:48 arch kernel:  </TASK>
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: reserve 0x400000 from 0xf47f800000 for PSP TMR
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: SMU is resuming...
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: dpm has been disabled
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: SMU is resumed successfully!
Jul 23 16:10:48 arch kernel: [drm] DMUB hardware initialized: version=0x01010028
Jul 23 16:10:48 arch kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jul 23 16:10:48 arch kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Jul 23 16:10:48 arch kernel: [drm] JPEG decode initialized successfully.
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 8
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 8
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 8
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 8
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 8
Jul 23 16:10:48 arch kernel: amdgpu 0000:30:00.0: [drm] Cannot find any crtc or sizes
Jul 23 16:10:48 arch kernel: usb 1-8: reset full-speed USB device number 4 using xhci_hcd
Jul 23 16:10:48 arch kernel: usb 1-9: reset high-speed USB device number 5 using xhci_hcd
Jul 23 16:10:48 arch kernel: usb 1-7: reset high-speed USB device number 3 using xhci_hcd
Jul 23 16:10:48 arch kernel: Bluetooth: hci0: HW/SW Version: 0x008a008a, Build Time: 20240219111427
Jul 23 16:10:48 arch kernel: OOM killer enabled.
Jul 23 16:10:48 arch kernel: Restarting tasks ... done.
Jul 23 16:10:48 arch kernel: random: crng reseeded on system resumption

nvidia-bug-report.log.gz (643.6 KB)

What is the short-term recommendation?

Good question. I’ve since stopped suspending my Laptop… which sucks big time. But at least I’ve not found any other “workaround”.

Having the same issue on Bazzite (based on Fedora Atomic Desktop). Only “workaround” I’ve found is to boot on an older kernel, 6.9.8-204, that’s the last version I can reliably suspend/resume with NVIDIA driver 555.58.02.

Can confirm that using an older kernel (6.6.42 - Arch linux-lts) seems to restore suspend and restore functionality with 555.58.02.

I take that back - kernel 6.6.42 in combination with 555.58.02 still causes kernel page fault issue with suspend and restore.

Issue persists with kernel 6.10.3-202 and nvidia-driver-560.31.02-1.

RTX 4070 Ti Super.

Using the GSP firmware through options nvidia NVreg_EnableGpuFirmware=1 improved resume stability for me here.

I have the same error, and I’ve been unable to suspend my desktop for months now.

My system details:

GeForce GTX 1050 Ti
Driver: 560.35.03
Kernel: 6.11.0-8-generic
System: Ubuntu 24.10

I was running Ubuntu 24.04 with kernel 6.8 and driver 555.58.02 and had exactly the same issue.

BUG: unable to handle page fault for address: ffffb92f94fcac04
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
BUG: unable to handle page fault for address: ffffb92f82eafe90
#PF: supervisor instruction fetch in kernel mode
#PF: error_code(0x0011) - permissions violation
Fixing recursive fault but reboot is needed!
BUG: scheduling while atomic: irq/79-nvidia/1289/0x00000000

Here is the full log:
nvidia_bug.log (343.7 KB)

However, I have another brand-new system, with GeForce RTX 4050 and the latest Intel Core Ultra 7 155H, with exactly the same software and suspend/restore works fine.

I’m happy to help with some debugging if someone can provide instructions and any workaround would be appreciated.

GSP firmware is not included in all GPUs, so this doesn’t help me.

https://download.nvidia.com/XFree86/Linux-x86_64/510.39.01/README/gsp.html

My computer has been affected by this bug for some time, I simply downgraded the drivers and did not analyze it in depth (stick to 535xx).

Today I tried several things I found to see if any of them would help me to resume a suspension.
At least in the tests that I did, if the suspension of the system is done from a wayland session the system resumes without problems.

My system:
GeForce GTX 1050
Driver: 560.35.03
Kernel: 6.11.3-arch1-1
System: Archlinux

Some configs:

/etc/default/grub
GRUB_CMDLINE_LINUX_DEFAULT="loglevel=3 quiet nouveau.modeset=0 nvidia_drm.modeset=1 nvidia_drm.fbdev=1"
/etc/modprobe.d/nvidia.conf
options nvidia_drm modeset=1
options nvidia_drm fbdev=1
options nvidia NVreg_PreserveVideoMemoryAllocations=1
options nvidia NVreg_TemporaryFilePath=/var/tmp
/etc/mkinitcpio.conf
MODULES=(nvidia nvidia_drm nvidia_modeset nvidia_uvm)
HOOKS=(base udev autodetect microcode modconf keyboard keymap consolefont block filesystems fsck)

And this services enabled: nvidia-hibernate.service nvidia-resume.service nvidia-suspend.service

The bug still happen, but as far I can see the system can recover:

oct 17 18:13:39 florencia kernel: ------------[ cut here ]------------
oct 17 18:13:39 florencia kernel: WARNING: CPU: 2 PID: 9108 at include/linux/rwsem.h:80 follow_pte+0x1de/0x200
oct 17 18:13:39 florencia kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device cfg80211 rfkill ipt_REJECT nf_reject_ipv4 xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c xt_tcpudp iptable_filter kvm_amd ccp kvm snd_hda_codec_realtek snd_hda_codec_generic crct10dif_pclmul snd_hda_scodec_component crc32_pclmul snd_hda_codec_hdmi polyval_clmulni polyval_generic at24 ghash_clmulni_intel snd_hda_intel sha512_ssse3 snd_intel_dspcfg sha256_ssse3 snd_intel_sdw_acpi sha1_ssse3 aesni_intel snd_hda_codec sp5100_tco wmi_bmof gf128mul snd_hda_core crypto_simd snd_hwdep mxm_wmi cryptd r8169 k10temp fam15h_power snd_pcm i2c_piix4 realtek i2c_smbus snd_timer mdio_devres snd libphy soundcore mousedev joydev mac_hid tcp_bbr sch_cake xpad ff_memless i2c_dev sg crypto_user acpi_cpufreq loop dm_mod nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 sr_mod crc32c_intel xhci_pci cdrom xhci_pci_renesas nvidia_uvm(POE) nvidia_drm(POE) drm_ttm_helper ttm nvidia_modeset(POE) video uas wmi usb_storage
oct 17 18:13:39 florencia kernel:  hid_generic usbhid nvidia(POE)
oct 17 18:13:39 florencia kernel: CPU: 2 UID: 0 PID: 9108 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.11.3-arch1-1 #1 1400000003000000474e55000681d53aa6c7b79b
oct 17 18:13:39 florencia kernel: Tainted: [P]=PROPRIETARY_MODULE, [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
oct 17 18:13:39 florencia kernel: Hardware name: To be filled by O.E.M. To be filled by O.E.M./M5A97 LE R2.0, BIOS 2701 03/24/2016
oct 17 18:13:39 florencia kernel: RIP: 0010:follow_pte+0x1de/0x200
oct 17 18:13:39 florencia kernel: Code: 9e b0 00 48 81 e2 00 00 00 c0 48 09 c2 48 f7 d2 48 85 fa 75 20 e8 42 f1 ff ff 48 8b 35 7b b6 5c 01 48 81 e6 00 00 00 c0 eb 8d <0f> 0b 48 3b 1f 0f 83 50 fe ff ff bd ea ff ff ff eb b6 49 8b 3c 24
oct 17 18:13:39 florencia kernel: RSP: 0018:ffffb0758448b7e0 EFLAGS: 00010246
oct 17 18:13:39 florencia kernel: RAX: 0000000000000000 RBX: 00007a90dfc17000 RCX: ffffb0758448b820
oct 17 18:13:39 florencia kernel: RDX: ffffb0758448b818 RSI: 00007a90dfc17000 RDI: ffff94adccaf4170
oct 17 18:13:39 florencia kernel: RBP: ffffb0758448b860 R08: ffffb0758448b9b8 R09: 0000000000000000
oct 17 18:13:39 florencia kernel: R10: 0000000000400006 R11: 0000000000000000 R12: ffffb0758448b820
oct 17 18:13:39 florencia kernel: R13: ffffb0758448b818 R14: ffff94adc0079b80 R15: 0000000000000000
oct 17 18:13:39 florencia kernel: FS:  00007bf4b4130b80(0000) GS:ffff94b0eeb00000(0000) knlGS:0000000000000000
oct 17 18:13:39 florencia kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
oct 17 18:13:39 florencia kernel: CR2: 00005f3613cd1048 CR3: 00000001cae16000 CR4: 00000000000406f0
oct 17 18:13:39 florencia kernel: Call Trace:
oct 17 18:13:39 florencia kernel:  <TASK>
oct 17 18:13:39 florencia kernel:  ? follow_pte+0x1de/0x200
oct 17 18:13:39 florencia kernel:  ? __warn.cold+0x8e/0xe8
oct 17 18:13:39 florencia kernel:  ? follow_pte+0x1de/0x200
oct 17 18:13:39 florencia kernel:  ? report_bug+0xff/0x140
oct 17 18:13:39 florencia kernel:  ? handle_bug+0x3c/0x80
oct 17 18:13:39 florencia kernel:  ? exc_invalid_op+0x17/0x70
oct 17 18:13:39 florencia kernel:  ? asm_exc_invalid_op+0x1a/0x20
oct 17 18:13:39 florencia kernel:  ? follow_pte+0x1de/0x200
oct 17 18:13:39 florencia kernel:  follow_phys+0x49/0x110
oct 17 18:13:39 florencia kernel:  untrack_pfn+0x55/0x120
oct 17 18:13:39 florencia kernel:  unmap_single_vma+0xa6/0xe0
oct 17 18:13:39 florencia kernel:  zap_page_range_single+0x122/0x1d0
oct 17 18:13:39 florencia kernel:  unmap_mapping_range+0x116/0x140
oct 17 18:13:39 florencia kernel:  nv_revoke_gpu_mappings_locked+0x47/0x70 [nvidia 1400000003000000474e5500625d9666e35647df]
oct 17 18:13:39 florencia kernel:  nv_set_system_power_state+0x1cd/0x470 [nvidia 1400000003000000474e5500625d9666e35647df]
oct 17 18:13:39 florencia kernel:  nv_procfs_write_suspend+0xef/0x170 [nvidia 1400000003000000474e5500625d9666e35647df]
oct 17 18:13:39 florencia kernel:  proc_reg_write+0x5d/0xa0
oct 17 18:13:39 florencia kernel:  vfs_write+0xf8/0x460
oct 17 18:13:39 florencia kernel:  ? filp_flush+0x52/0x80
oct 17 18:13:39 florencia kernel:  ? do_dup2+0xad/0x130
oct 17 18:13:39 florencia kernel:  ksys_write+0x6d/0xf0
oct 17 18:13:39 florencia kernel:  do_syscall_64+0x82/0x190
oct 17 18:13:39 florencia kernel:  ? path_openat+0xaaa/0x11f0
oct 17 18:13:39 florencia kernel:  ? __alloc_pages_noprof+0x184/0x350
oct 17 18:13:39 florencia kernel:  ? __mod_memcg_lruvec_state+0xa0/0x150
oct 17 18:13:39 florencia kernel:  ? __lruvec_stat_mod_folio+0x83/0xd0
oct 17 18:13:39 florencia kernel:  ? set_ptes.isra.0+0x41/0x90
oct 17 18:13:39 florencia kernel:  ? do_anonymous_page+0xfe/0x870
oct 17 18:13:39 florencia kernel:  ? __pte_offset_map+0x1b/0x180
oct 17 18:13:39 florencia kernel:  ? __handle_mm_fault+0xbe7/0x1030
oct 17 18:13:39 florencia kernel:  ? __count_memcg_events+0x58/0xf0
oct 17 18:13:39 florencia kernel:  ? count_memcg_events.constprop.0+0x1a/0x30
oct 17 18:13:39 florencia kernel:  ? handle_mm_fault+0x1bb/0x2c0
oct 17 18:13:39 florencia kernel:  ? do_user_addr_fault+0x36c/0x620
oct 17 18:13:39 florencia kernel:  ? exc_page_fault+0x81/0x190
oct 17 18:13:39 florencia kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
oct 17 18:13:39 florencia kernel: RIP: 0033:0x7bf4b3f737a4
oct 17 18:13:39 florencia kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d c5 28 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
oct 17 18:13:39 florencia kernel: RSP: 002b:00007ffffed24018 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
oct 17 18:13:39 florencia kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007bf4b3f737a4
oct 17 18:13:39 florencia kernel: RDX: 0000000000000008 RSI: 00005f3613cd0c40 RDI: 0000000000000001
oct 17 18:13:39 florencia kernel: RBP: 00007ffffed24040 R08: 0000000000000410 R09: 0000000000000001
oct 17 18:13:39 florencia kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
oct 17 18:13:39 florencia kernel: R13: 00005f3613cd0c40 R14: 00007bf4b404f5c0 R15: 00007bf4b404cea0
oct 17 18:13:39 florencia kernel:  </TASK>
oct 17 18:13:39 florencia kernel: ---[ end trace 0000000000000000 ]---
oct 17 18:13:39 florencia systemd[1]: nvidia-suspend.service: Deactivated successfully.
oct 17 18:13:39 florencia systemd[1]: Finished NVIDIA system suspend actions.
oct 17 18:13:39 florencia systemd[1]: nvidia-suspend.service: Consumed 514ms CPU time, 234.1M memory peak.
oct 17 18:13:39 florencia systemd[1]: Starting System Suspend...
oct 17 18:13:39 florencia systemd-sleep[9123]: User sessions remain unfrozen on explicit request ($SYSTEMD_SLEEP_FREEZE_USER_SESSIONS=0).
oct 17 18:13:39 florencia systemd-sleep[9123]: This is not recommended, and might result in unexpected behavior, particularly
oct 17 18:13:39 florencia systemd-sleep[9123]: in suspend-then-hibernate operations or setups with encrypted home directories.
oct 17 18:13:39 florencia systemd-sleep[9123]: Performing sleep operation 'suspend'...
oct 17 18:13:39 florencia kernel: PM: suspend entry (deep)

On a X11 session the only thing I could do after suspend was reboot.