Hi,
I’m experiencing full system hangs when running demanding titles via Steam/Proton on my RTX PRO 6000 Blackwell Max-Q. This has happened across multiple games including Hogwarts Legacy (DX12), Cyberpunk 2077 (DX12), Total War: Warhammer III (DX11/Vulkan), and other GPU-intensive titles. The symptom is always the same: the display freezes completely while audio continues playing. The Wayland compositor becomes unresponsive, TTY switching is blocked, and only a hard reboot recovers the system.
Looking at the kernel log from the previous boot, the crash follows a pattern of Xid 32 errors during gameplay, eventually culminating in a fatal Xid 56 followed by a DRM flip event timeout:
[13132.342106] NVRM: Xid (PCI:0000:01:00): 32, pid=27514, name=HogwartsLegacy., channel 0x00000014 intr0 00040000
[13132.355013] NVRM: Xid (PCI:0000:01:00): 32, pid=27514, name=HogwartsLegacy., channel 0x00000014 intr0 00040000
[13641.741034] NVRM: Xid (PCI:0000:01:00): 32, pid=30738, name=HogwartsLegacy., channel 0x00000014 intr0 00040000
[13641.753002] NVRM: Xid (PCI:0000:01:00): 32, pid=30738, name=HogwartsLegacy., channel 0x00000014 intr0 00040000
[14901.195861] NVRM: Xid (PCI:0000:01:00): 56, CMDre 00000007 00000000 00000000 00000001 00000000
[14920.411036] [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Flip event timeout on head 0
The Xid 32 errors occur in pairs during gameplay across two separate game launches. About 21 minutes after the last Xid 32, the Xid 56 fires, and 19 seconds later the DRM flip timeout locks up all display output.
How to reproduce:
- Boot with
nvidia_drm.modeset=1 nvidia_drm.fbdev=1 pcie_aspm=off - Start a Wayland session (niri compositor) on a 4K 240 Hz display via DisplayPort
- Launch any demanding Proton title (Hogwarts Legacy, Cyberpunk 2077, Total War: Warhammer III, etc.)
- Play for some minutes until the screen freezes
- System is fully unresponsive — no TTY, no recovery without hard reboot
The kernel log above was captured during a Hogwarts Legacy session, but the same crash pattern has occurred with Cyberpunk 2077, Total War: Warhammer III, and other demanding titles. It does not appear to be game-specific — rather it seems to affect any GPU-intensive Proton workload.
One thing to note: during the first game launch in this particular session, a vLLM inference server was consuming ~87 GB of the 96 GB VRAM, so the initial Xid 32 errors may be related to memory pressure. However, the server was stopped before the final crash, leaving ~95 GB free. The fatal Xid 56 occurred with plenty of available VRAM. The crashes with other titles listed above happened without any VRAM contention.
System details:
- GPU: NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition (GB202GL, rev a1)
- VBIOS: 98.02.6A.00.03
- Driver: 595.58.03 (Open Kernel Module)
- CUDA: 13.2
- OS: Arch Linux (rolling)
- Kernel: 6.19.10-1-cachyos (PREEMPT_DYNAMIC, clang 22.1.1)
- CPU: AMD Ryzen 7 9800X3D
- RAM: 32 GB
- Display: ASUS ROG Swift PG32UCDMR, 3840x2160 @ 240 Hz, DisplayPort, scale 2.0
- Compositor: niri (Wayland, no X server)
- Proton: tested with proton-cachyos 10.0-20260324 and GE-Proton10-34 (Wine 10.0 Staging)
- Affected games: Hogwarts Legacy (DX12), Cyberpunk 2077 (DX12), Total War: Warhammer III (DX11/Vulkan), and other demanding titles — all via VKD3D-Proton/DXVK
The nvidia-bug-report.log.gz was generated after reboot since the crash required a hard power cycle. It is attached below.
Thanks for looking into this.
nvidia-bug-report.log.gz (752.9 KB)