System crashes when playing (Xid: 8, 13, 16, 79)

nvidia-bug-report.log.gz (858.9 KB)

Recently, I have been having complete system crashes when playing certain games. I have not noticed it happening on others, but it has happened on Path of Exile, both the Steam version, as well as the standalone version, as well as Vampire Survivors.

[Bug]: Without warning, all monitors will freeze. Sound will continue playing for a certain amount of time. The external monitor will flicker, either it, or both it and the laptop monitor will go black. The external monitor will show disconnected. Sound stops, the system is frozen, and requires a reset.

[Expected behaviour]: That the games continue to be played as usual.

[Steps to reproduce]: Open a game, and play them for longer than 5 minutes.

[Attempts to fix]: I have changed graphical settings in-game, as well as whether Gamemode was running or not. I have tried the games using Vulkan, as well as DX11/DX12. I have tried playing with them in fullscreen, borderless window, as well as windowed mode. I have tried playing with only the game, as well as with other applications running. I can’t seem to find a common denonimator as to when, or how, the crashes happen. I went to Reddit, and was recommended to come here with some unusual Xid logs:

May 27 16:28:43 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 8, pid=36496, name=PathOfExileStea, Channel 00000036
May 28 11:54:52 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6de
May 28 11:55:05 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6df
May 28 11:55:13 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e0
May 28 11:55:21 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e1
May 28 11:55:33 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e2
May 28 11:55:46 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e3
May 28 11:55:58 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e4
May 28 11:56:13 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e5
May 28 11:56:25 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e6
May 28 11:56:43 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e7
May 28 11:56:58 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e8
May 28 11:57:17 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6e9
May 28 11:57:35 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6ea
May 28 11:57:46 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6eb
May 28 11:58:00 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6ec
May 28 11:58:10 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 16, pid='<unknown>', name=<unknown>, Head 00000003 Count 0000e6ed
May 28 12:33:33 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 79, pid='<unknown>', name=<unknown>, GPU has fallen off the bus.

May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception on GPC 0: SAVE_RESTORE_ADDR_OOB
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception: ESR 0x500900=0x80000001
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception on GPC 1: SAVE_RESTORE_ADDR_OOB
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception: ESR 0x508900=0x80000001
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception on GPC 2: SAVE_RESTORE_ADDR_OOB
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid='<unknown>', name=<unknown>, Graphics Exception: ESR 0x510900=0x80000001
May 29 15:57:32 berry-garden kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=19465, name=PathOfExile.exe, Graphics Exception: ChID 0027, Class 0000b197, Offset 000034a8, Data 80000000

The concern was both for the ‘falling out of bus’, as well as the many unknown pid and names.

OS: EndeavourOS
CPU: i7-4710HQ
GPU: GTX 980m
RAM: 24GB.

Any ideas?