Foremost I would like to emphasize that “systemctl suspend” and “systemctl hibernate” works without issues. However, I have an issue resuming from “systemctl hybrid-sleep”. When resuming, the display shows the same as before suspending but the system is completely unresponsive (although I can ssh in perfectly fine) and Xorg is spinning at at 100% on one cpu thread.
System is a Lenovo Thinkpad X1E Gen.1 with GTX 1050 Ti Max-Q, running with 440.100 driver and 5.7.7 kernel. It is set up with Prime working (as er instructions here on the forum)
Following troubleshooting attempted:
Tried enabling and disabling the i915 module in initramfs
Tried running intel card with “intel” and “modesetting” driver
Running nvidia with systemd power management and without
Tried boot parameters acpi_osi=! "acpi_osi=Windows 2015" and acpi_osi=Linux"
Hi @henriker, did you ever solve this issue? I am running into something similar when resuming from hybrid-sleep. Sleep and regular hibernate work just fine.
Specs:
Arch Linux
Kernel 5.11.2
RTX 3090 with 460.56 drivers
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failed detecting connected display devices
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failed detecting connected display devices
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: WARNING: GPU:0: Failure processing EDID for display device Acer XB271HU (DP-2).
Mar 05 20:52:52 benxiao-arch01 upowerd[1487]: treating change event as add on /sys/devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:09:00.3/usb3/3-4
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device Acer XB271HU (DP-2)
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failure reading maximum pixel clock value for display device DP-2.
Mar 05 20:52:52 benxiao-arch01 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000224
Mar 05 20:52:52 benxiao-arch01 kernel: #PF: supervisor read access in kernel mode
Mar 05 20:52:52 benxiao-arch01 kernel: #PF: error_code(0x0000) - not-present page
Mar 05 20:52:52 benxiao-arch01 kernel: PGD 0 P4D 0
Mar 05 20:52:52 benxiao-arch01 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Mar 05 20:52:52 benxiao-arch01 kernel: CPU: 9 PID: 990 Comm: Xorg Tainted: P OE 5.11.2-arch1-1 #1
Mar 05 20:52:52 benxiao-arch01 kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570 Taichi, BIOS P4.00 01/19/2021
Mar 05 20:52:52 benxiao-arch01 kernel: RIP: 0010:_nv002315kms+0x57/0xa0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: Code: 28 00 48 89 46 18 48 8b 87 e8 05 00 00 48 8b 80 a8 01 00 00 48 85 c0 74 52 53 48 89 f3 e8 b1 54 56 c6 ba 04 00 00 00 48 63 ca <8b> 74 c8 08 85 f6 74 29 48 8d 04 c8 0f b7 48 08 66 89 4b 20 0f b7
Mar 05 20:52:52 benxiao-arch01 kernel: RSP: 0018:ffffadec01e97998 EFLAGS: 00010206
Mar 05 20:52:52 benxiao-arch01 kernel: RAX: 00000000000001fc RBX: ffffadec01e97c90 RCX: 0000000000000004
Mar 05 20:52:52 benxiao-arch01 kernel: RDX: 0000000000000004 RSI: ffffadec01e97c90 RDI: ffff9cf6aba63008
Mar 05 20:52:52 benxiao-arch01 kernel: RBP: ffffadec01e85008 R08: 0000000000000280 R09: ffffadec00715008
Mar 05 20:52:52 benxiao-arch01 kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff9cf68b939c08
Mar 05 20:52:52 benxiao-arch01 kernel: R13: 0000000000000001 R14: ffff9cf6b7002814 R15: 0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: FS: 00007f6ea29d3940(0000) GS:ffff9cfd9ec40000(0000) knlGS:0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224 CR3: 0000000123876000 CR4: 0000000000350ee0
Mar 05 20:52:52 benxiao-arch01 kernel: Call Trace:
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000076kms+0x118/0x170 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv002327kms+0xb3/0x5b0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv002457kms+0x53/0x60 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? __d_alloc+0x22/0x1e0
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000742kms+0xfc/0x370 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000742kms+0xbf/0x370 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? desc_read_finalized_seq+0x37/0x90
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv002771kms+0x433/0x600 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv002771kms+0x3fc/0x600 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000742kms+0x40/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? nvkms_alloc+0x61/0x90 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000742kms+0x40/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? _nv000744kms+0x2a/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? nvKmsIoctl+0x96/0x1d0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? nvkms_ioctl+0x10f/0x180 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: ? nvidia_frontend_unlocked_ioctl+0x37/0x50 [nvidia]
Mar 05 20:52:52 benxiao-arch01 kernel: ? __x64_sys_ioctl+0x83/0xb0
Mar 05 20:52:52 benxiao-arch01 kernel: ? do_syscall_64+0x33/0x40
Mar 05 20:52:52 benxiao-arch01 kernel: ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Mar 05 20:52:52 benxiao-arch01 kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq rfcomm ccm hid_logitech_hidpp mousedev joydev hid_logitech_dj uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev snd_usb_audio snd_usbmidi_lib ses snd_rawmidi enclosure snd_seq_device scsi_>
Mar 05 20:52:52 benxiao-arch01 kernel: xt_addrtype snd_compress xt_tcpudp kvm ac97_bus snd_pcm_dmaengine irqbypass snd_pcm drm_kms_helper crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel snd_timer wmi_bmof mxm_wmi crypto_simd cryptd glue_helper xt_conntrack cec snd rapl syscopyarea sysfillrect sysimg>
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224
Mar 05 20:52:52 benxiao-arch01 kernel: ---[ end trace 2fe4d965292e60bc ]---
Mar 05 20:52:52 benxiao-arch01 kernel: RIP: 0010:_nv002315kms+0x57/0xa0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: Code: 28 00 48 89 46 18 48 8b 87 e8 05 00 00 48 8b 80 a8 01 00 00 48 85 c0 74 52 53 48 89 f3 e8 b1 54 56 c6 ba 04 00 00 00 48 63 ca <8b> 74 c8 08 85 f6 74 29 48 8d 04 c8 0f b7 48 08 66 89 4b 20 0f b7
Mar 05 20:52:52 benxiao-arch01 kernel: RSP: 0018:ffffadec01e97998 EFLAGS: 00010206
Mar 05 20:52:52 benxiao-arch01 kernel: RAX: 00000000000001fc RBX: ffffadec01e97c90 RCX: 0000000000000004
Mar 05 20:52:52 benxiao-arch01 kernel: RDX: 0000000000000004 RSI: ffffadec01e97c90 RDI: ffff9cf6aba63008
Mar 05 20:52:52 benxiao-arch01 kernel: RBP: ffffadec01e85008 R08: 0000000000000280 R09: ffffadec00715008
Mar 05 20:52:52 benxiao-arch01 kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff9cf68b939c08
Mar 05 20:52:52 benxiao-arch01 kernel: R13: 0000000000000001 R14: ffff9cf6b7002814 R15: 0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: FS: 00007f6ea29d3940(0000) GS:ffff9cfd9ec40000(0000) knlGS:0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224 CR3: 0000000123876000 CR4: 0000000000350ee0
I have the exact same issue when resuming from hybrid-sleep: X server process takes 100% CPU and can’t be killed even with SIGKILL. Restarting the display manager from an ssh session works after a long timeout for the systemd unit. I didn’t get anything fishy in the kernel log, though. The only error I have is in the Xorg.0.log, which says
(EE) NVIDIA(GPU-0): Failed to query object info.
systemd suspend and systemd hibernate work just fine.
My spec:
Arch Linux with 5.11.6 kernel
GPU 1650Ti with driver 460.56