It used to fine to suspend and resume daily. I tried to revert kernel also driver but no good result, suspend state go well but resume show black screen. Black mean monitor get no signal, not black with emit background light. In resume state, cannot move to any other tty, but i can reboot by REISUB. Make sure that nvidia-suspend, resume service enabled in preset and session. Now on kernel 6.14.2-1-liquorix-amd64, driver 570.144, initrd module signed by initramfs-tools. Simple kernel log show error when resume from suspend:
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: Multiple Uncorrectable (Non-Fatal) error message received from 0000:00:02.0
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: PCIe Bus Error: severity=Uncorrectable (Non-Fatal), type=Transaction Layer, (Requester ID)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: device [8086:2f04] error status/mask=00004000/00000000
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: [14] CmpltTO (First)
May 16 13:07:07 Debian6 kernel: nvidia 0000:03:00.0: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: snd_hda_intel 0000:03:00.1: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: device recovery failed
May 16 13:07:07 Debian6 flatpak[21983]: [OPVVZ] 2025/05/16 13:07:07 INFO: Exiting
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: Uncorrectable (Non-Fatal) error message received from 0000:00:02.0
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: PCIe Bus Error: severity=Uncorrectable (Non-Fatal), type=Transaction Layer, (Requester ID)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: device [8086:2f04] error status/mask=00004000/00000000
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: [14] CmpltTO (First)
May 16 13:07:07 Debian6 kernel: nvidia 0000:03:00.0: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: snd_hda_intel 0000:03:00.1: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: device recovery failed
May 16 13:07:07 Debian6 kernel: NVRM: GPU at PCI:0000:03:00: GPU-1f2b10aa-227e-690f-c37d-2369dd14d913
May 16 13:07:07 Debian6 kernel: NVRM: Xid (PCI:0000:03:00): 79, GPU has fallen off the bus.
May 16 13:07:07 Debian6 kernel: NVRM: GPU 0000:03:00.0: GPU has fallen off the bus.
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: Uncorrectable (Non-Fatal) error message received from 0000:00:02.0
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: PCIe Bus Error: severity=Uncorrectable (Non-Fatal), type=Transaction Layer, (Requester ID)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: device [8086:2f04] error status/mask=00004000/00000000
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: [14] CmpltTO (First)
May 16 13:07:07 Debian6 kernel: nvidia 0000:03:00.0: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: snd_hda_intel 0000:03:00.1: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: device recovery failed
May 16 13:07:07 Debian6 flatpak[21983]: [monitor] 2025/05/16 13:07:07 INFO: Syncthing exited: exit status 1
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: Uncorrectable (Non-Fatal) error message received from 0000:00:02.0
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: PCIe Bus Error: severity=Uncorrectable (Non-Fatal), type=Transaction Layer, (Requester ID)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: device [8086:2f04] error status/mask=00004000/00000000
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: [14] CmpltTO (First)
May 16 13:07:07 Debian6 kernel: nvidia 0000:03:00.0: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: snd_hda_intel 0000:03:00.1: AER: can't recover (no error_detected callback)
May 16 13:07:07 Debian6 kernel: pcieport 0000:00:02.0: AER: device recovery failed
May 16 13:07:08 Debian6 flatpak[21983]: [monitor] 2025/05/16 13:07:08 WARNING: 4 restarts in 40.680352234s; not retrying further
lines 4148-4182/4195 100%
I was really mad and tried 3-4 kernels version, 3 driver 570.xxx, nothing worked for me, at least in Debian Sid. With many disks, i tried swap root disk with Fedora, Gentoo, Arch, Alpine, not tried with Windows, but only single disk Fedora 41 worked as intended. I really have no idea what did Fedora dev cooked, it’s not even latest version. All test were on Gnome, both X11 and Wayland. However note that I cannot get that log error Xid 79 on X11 session, but visual result is the same. My genuine guess is that only fedora tweaked systemd-sleep somehow but i’m not sure.
Here’s bug report on Debian system:
[nvidia-bug-report.log.gz|attachment]
(upload://42MI5btDHGw1xcghabpCaOYECpc.gz) (1.8 MB)
Params for anyone fast look
CreateImexChannel0: 0
DeviceFileGID: 0
DeviceFileMode: 438
DeviceFileUID: 0
DmaRemapPeerMmio: 1
DynamicPowerManagement: 3
DynamicPowerManagementVideoMemoryThreshold: 200
EnableDbgBreakpoint: 0
EnableGpuFirmware: 0
EnableGpuFirmwareLogs: 2
EnableMSI: 1
EnablePCIeGen3: 0
EnablePCIERelaxedOrderingMode: 0
EnableResizableBar: 0
EnableS0ixPowerManagement: 0
EnableStreamMemOPs: 0
EnableUserNUMAManagement: 1
ExcludedGpus: ""
GpuBlacklist: ""
GrdmaPciTopoCheckOverride: 0
IgnoreMMIOCheck: 0
ImexChannelCount: 2048
InitializeSystemMemoryAllocations: 1
KMallocHeapMaxSize: 0
MemoryPoolSize: 0
ModifyDeviceFiles: 1
NvLinkDisable: 0
OpenRmEnableUnsupportedGpus: 1
PreserveVideoMemoryAllocations: 1
RegisterPCIDriver: 1
RegistryDwords: ""
RegistryDwordsPerDevice: ""
ResmanDebugLevel: 4294967295
RmLogonRC: 1
RmMsg: ""
RmNvlinkBandwidthLinkCount: 0
RmProfilingAdminOnly: 1
S0ixPowerManagementVideoMemoryThreshold: 256
TemporaryFilePath: ""
UsePageAttributeTable: 4294967295
VMallocHeapMaxSize: 0
Anyone please give me a light how to make suspend back to “normal” on this system. My pc was almost burned after alot of reboot suspend reboot suspend