Issues resuming from hybrid-sleep

Hi,

Foremost I would like to emphasize that “systemctl suspend” and “systemctl hibernate” works without issues. However, I have an issue resuming from “systemctl hybrid-sleep”. When resuming, the display shows the same as before suspending but the system is completely unresponsive (although I can ssh in perfectly fine) and Xorg is spinning at at 100% on one cpu thread.

dmesg has the following error is present:

nvidia-modeset: ERROR: GPU:0: Failed detecting connected display devices

Checking the Xorg.log has the following:

[ 417.567] (II) modeset(0): EDID vendor “AUO”, prod id 13035
[ 417.567] (II) modeset(0): Printing DDC gathered Modelines:
[ 417.567] (II) modeset(0): Modeline "3840x2160"x0.0 533.30 3840 3888 3920 4000 2160 2163 2168 2222 -hsync -vsync (133.3 kHz eP)
[ 417.583] (WW) NVIDIA(GPU-0): Failed to enter interactive mode.
[ 417.583] (–) NVIDIA(GPU-0): DFP-0: disconnected
[ 417.583] (–) NVIDIA(GPU-0): DFP-0: Internal DisplayPort
[ 417.583] (–) NVIDIA(GPU-0): DFP-0: 1440.0 MHz maximum pixel clock
[ 417.583] (–) NVIDIA(GPU-0):
[ 417.583] (–) NVIDIA(GPU-0): DFP-1: disconnected
[ 417.583] (–) NVIDIA(GPU-0): DFP-1: Internal DisplayPort
[ 417.583] (–) NVIDIA(GPU-0): DFP-1: 1440.0 MHz maximum pixel clock
[ 417.583] (–) NVIDIA(GPU-0):
[ 417.583] (–) NVIDIA(GPU-0): DFP-2: disconnected
[ 417.583] (–) NVIDIA(GPU-0): DFP-2: Internal TMDS
[ 417.583] (–) NVIDIA(GPU-0): DFP-2: 165.0 MHz maximum pixel clock
[ 417.583] (–) NVIDIA(GPU-0):
[ 417.583] (EE) NVIDIA(GPU-0): Failed to query object info.
[ 417.583] (EE) NVIDIA(GPU-0): Failed to query object info.
[ 417.590] (EE) NVIDIA(GPU-0): Failed to query object info.

System is a Lenovo Thinkpad X1E Gen.1 with GTX 1050 Ti Max-Q, running with 440.100 driver and 5.7.7 kernel. It is set up with Prime working (as er instructions here on the forum)

Following troubleshooting attempted:
Tried enabling and disabling the i915 module in initramfs
Tried running intel card with “intel” and “modesetting” driver
Running nvidia with systemd power management and without
Tried boot parameters acpi_osi=! "acpi_osi=Windows 2015" and acpi_osi=Linux"

I’m at a loss so any help is greatly appreciated.

Hi @henriker, did you ever solve this issue? I am running into something similar when resuming from hybrid-sleep. Sleep and regular hibernate work just fine.

Specs:
Arch Linux
Kernel 5.11.2
RTX 3090 with 460.56 drivers

Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failed detecting connected display devices
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failed detecting connected display devices
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: WARNING: GPU:0: Failure processing EDID for display device Acer XB271HU (DP-2).
Mar 05 20:52:52 benxiao-arch01 upowerd[1487]: treating change event as add on /sys/devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:09:00.3/usb3/3-4
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device Acer XB271HU (DP-2)
Mar 05 20:52:52 benxiao-arch01 kernel: nvidia-modeset: ERROR: GPU:0: Failure reading maximum pixel clock value for display device DP-2.
Mar 05 20:52:52 benxiao-arch01 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000224
Mar 05 20:52:52 benxiao-arch01 kernel: #PF: supervisor read access in kernel mode
Mar 05 20:52:52 benxiao-arch01 kernel: #PF: error_code(0x0000) - not-present page
Mar 05 20:52:52 benxiao-arch01 kernel: PGD 0 P4D 0 
Mar 05 20:52:52 benxiao-arch01 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Mar 05 20:52:52 benxiao-arch01 kernel: CPU: 9 PID: 990 Comm: Xorg Tainted: P           OE     5.11.2-arch1-1 #1
Mar 05 20:52:52 benxiao-arch01 kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570 Taichi, BIOS P4.00 01/19/2021
Mar 05 20:52:52 benxiao-arch01 kernel: RIP: 0010:_nv002315kms+0x57/0xa0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: Code: 28 00 48 89 46 18 48 8b 87 e8 05 00 00 48 8b 80 a8 01 00 00 48 85 c0 74 52 53 48 89 f3 e8 b1 54 56 c6 ba 04 00 00 00 48 63 ca <8b> 74 c8 08 85 f6 74 29 48 8d 04 c8 0f b7 48 08 66 89 4b 20 0f b7
Mar 05 20:52:52 benxiao-arch01 kernel: RSP: 0018:ffffadec01e97998 EFLAGS: 00010206
Mar 05 20:52:52 benxiao-arch01 kernel: RAX: 00000000000001fc RBX: ffffadec01e97c90 RCX: 0000000000000004
Mar 05 20:52:52 benxiao-arch01 kernel: RDX: 0000000000000004 RSI: ffffadec01e97c90 RDI: ffff9cf6aba63008
Mar 05 20:52:52 benxiao-arch01 kernel: RBP: ffffadec01e85008 R08: 0000000000000280 R09: ffffadec00715008
Mar 05 20:52:52 benxiao-arch01 kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff9cf68b939c08
Mar 05 20:52:52 benxiao-arch01 kernel: R13: 0000000000000001 R14: ffff9cf6b7002814 R15: 0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: FS:  00007f6ea29d3940(0000) GS:ffff9cfd9ec40000(0000) knlGS:0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224 CR3: 0000000123876000 CR4: 0000000000350ee0
Mar 05 20:52:52 benxiao-arch01 kernel: Call Trace:
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000076kms+0x118/0x170 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv002327kms+0xb3/0x5b0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv002457kms+0x53/0x60 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? __d_alloc+0x22/0x1e0
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000742kms+0xfc/0x370 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000742kms+0xbf/0x370 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? desc_read_finalized_seq+0x37/0x90
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv002771kms+0x433/0x600 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv002771kms+0x3fc/0x600 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000742kms+0x40/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? nvkms_alloc+0x61/0x90 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000742kms+0x40/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? _nv000744kms+0x2a/0x40 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? nvKmsIoctl+0x96/0x1d0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? nvkms_ioctl+0x10f/0x180 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? nvidia_frontend_unlocked_ioctl+0x37/0x50 [nvidia]
Mar 05 20:52:52 benxiao-arch01 kernel:  ? __x64_sys_ioctl+0x83/0xb0
Mar 05 20:52:52 benxiao-arch01 kernel:  ? do_syscall_64+0x33/0x40
Mar 05 20:52:52 benxiao-arch01 kernel:  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Mar 05 20:52:52 benxiao-arch01 kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq rfcomm ccm hid_logitech_hidpp mousedev joydev hid_logitech_dj uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev snd_usb_audio snd_usbmidi_lib ses snd_rawmidi enclosure snd_seq_device scsi_>
Mar 05 20:52:52 benxiao-arch01 kernel:  xt_addrtype snd_compress xt_tcpudp kvm ac97_bus snd_pcm_dmaengine irqbypass snd_pcm drm_kms_helper crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel snd_timer wmi_bmof mxm_wmi crypto_simd cryptd glue_helper xt_conntrack cec snd rapl syscopyarea sysfillrect sysimg>
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224
Mar 05 20:52:52 benxiao-arch01 kernel: ---[ end trace 2fe4d965292e60bc ]---
Mar 05 20:52:52 benxiao-arch01 kernel: RIP: 0010:_nv002315kms+0x57/0xa0 [nvidia_modeset]
Mar 05 20:52:52 benxiao-arch01 kernel: Code: 28 00 48 89 46 18 48 8b 87 e8 05 00 00 48 8b 80 a8 01 00 00 48 85 c0 74 52 53 48 89 f3 e8 b1 54 56 c6 ba 04 00 00 00 48 63 ca <8b> 74 c8 08 85 f6 74 29 48 8d 04 c8 0f b7 48 08 66 89 4b 20 0f b7
Mar 05 20:52:52 benxiao-arch01 kernel: RSP: 0018:ffffadec01e97998 EFLAGS: 00010206
Mar 05 20:52:52 benxiao-arch01 kernel: RAX: 00000000000001fc RBX: ffffadec01e97c90 RCX: 0000000000000004
Mar 05 20:52:52 benxiao-arch01 kernel: RDX: 0000000000000004 RSI: ffffadec01e97c90 RDI: ffff9cf6aba63008
Mar 05 20:52:52 benxiao-arch01 kernel: RBP: ffffadec01e85008 R08: 0000000000000280 R09: ffffadec00715008
Mar 05 20:52:52 benxiao-arch01 kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff9cf68b939c08
Mar 05 20:52:52 benxiao-arch01 kernel: R13: 0000000000000001 R14: ffff9cf6b7002814 R15: 0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: FS:  00007f6ea29d3940(0000) GS:ffff9cfd9ec40000(0000) knlGS:0000000000000000
Mar 05 20:52:52 benxiao-arch01 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 05 20:52:52 benxiao-arch01 kernel: CR2: 0000000000000224 CR3: 0000000123876000 CR4: 0000000000350ee0

I have the exact same issue when resuming from hybrid-sleep: X server process takes 100% CPU and can’t be killed even with SIGKILL. Restarting the display manager from an ssh session works after a long timeout for the systemd unit. I didn’t get anything fishy in the kernel log, though. The only error I have is in the Xorg.0.log, which says

(EE) NVIDIA(GPU-0): Failed to query object info.

systemd suspend and systemd hibernate work just fine.

My spec:
Arch Linux with 5.11.6 kernel
GPU 1650Ti with driver 460.56

Hi,

Same problem here, same log entries as OP.

Lenovo Ideapad Z500, kernel 5.10.27, Nvidia drivers 460.xx.

Can someone from Nvidia please help?

Thanks

I have the same issue.

Device: MSI GL63-8SE
Kernel: 5.12.18-1-ck-skylake
Driver: nvidia-dkms-470.63.01
GPU: RTX 2060 Mobile

If someone at Nvidia can help, it would be greatly appreciated.

Thanks in advance.

Just confirming that I have the same issue. Tried modesetting driver, changing acpi into firmware, and also this solution: Systemd's suspend-then-hibernate not working in NVIDIA Optimus laptop - #4 by charles15 . Nothing worked. Hibernation and sleep work just fine.

Kernel: Linux 6.0.11-arch1-1 x86_64
GPU: RTX 3070
Nvidia driver: 525.60.11-2