Xid109 CTX SWITCH TIMEOUT Driver Crashes In Many Applications

Cannot use Linux for anything GPU heavy reliably in the last month or so… I have met many other users facing same issue and would like to bring it to light

Example of errors, always Xid 109:
NVRM: Xid (PCI:0000:01:00): 109, pid=168149, name=r5apex_dx12.exe, Ch 00000076, errorString CTX SWITCH TIMEOUT, Info 0x3c046
NVRM: Xid (PCI:0000:01:00): 109, pid=23382, name=cs2, Ch 000000b6, errorString CTX SWITCH TIMEOUT, Info 0x25c05d
NVRM: Xid (PCI:0000:01:00): 109, pid=‘’, name=, Ch 000000a6, errorString CTX SWITCH TIMEOUT, Info 0x26c058

Can consistently reproduce by playing ~1-2 games of CS2 Arms Race, the map Baggage will crash 90% of the time mid-game after a few minutes. Also has occured in compute heavy AI stuff, and in games like Apex Legends running through proton (interestingly, once Apex crashes after 10-45 mins, the game will not run for longer than 5 without another Xid 109 happening). Occasionally X11/KDE Plasma won’t recover from the crash and a full hard reboot on crash is required. This is so consistent that I can reboot, open nothing but Steam/Counter Strike 2, and have the game crash with Xid109 within 10 minutes, so testing fixes is easy.

Attempts to Debug:
-Went back to various kernel versions, that were stable for GPU usage when I used them last
-Tried 545.29.06, the beta 550.40.07, and the latest Vulkan Dev driver ( 535.43.09)
-Ensured things like power management, ReBar, etc. had no effect on reproducing the issue
-Had a friend with a 3060ti and near identical arch install (besides a Ryzen vs. my Intel, everything like driver version, graphics settings, resolution, vulkan/mesa stuff, and kernel were all the same between us) try to reproduce, and they could not
-Discussed with others also having the issue, they have tried countless other kernels, and have a variety of platforms that also are affected (AMD Ryzen, 40xx series as well, etc.), so my specific hardware is not the culprit
-Ensured my GPU is stable and in fully functional condition (passed GPU memory stress test with flying colors, can run heavy loads all night in Windows , ran stress tests, etc.)

Description of Crash
When the crash happens the screen freezes but audio, etc. continues to play in the background, and it takes ~15 seconds for the system to recover enough to alt-tab or switch terminals most of the time, with a hard (reset button) restart required occasionally. Sometimes in Proton apps the screen will freeze, then render a few frames after a few seconds, then freeze again, always with Xid 109 in dmesg after the crash. This happens independent of whether an app is run with DX11 or DX12 in Proton (all dxvk in the end), and with native Vulkan games like CS2. I have only had it happen during CUDA loads a few times but have not recently done any work with compute lately.

Bug report attached! I ran the bug tool immediately after reproducing the crash issue.
nvidia-bug-report.log.gz (937.6 KB)

I would really like to use my GPU again, so anything else I can do to help solve this would be greatly appreciated. I know there is a similar thread for this, however it is two years old and lacking any updates for this issue that renders Linux useless for the majority of my work and leisure activities.

Because I can consistently and quickly reproduce the crash, hopefully I can be of assistance pinpointing this issue, I am experienced with lowlevel debugging if I can get any dumps etc. that might help?

System info:

Arch Linux kernel 6.7.5, (other 6.6.x kernels also cause issue)
Nvidia Driver v.545.29.06 (other drivers also cause issue)
Plasma 5.27.10 through KWin
i7-12700k,
RTX 3090
MSI Z690A, 32gb DDR5,

cat /proc/cmdline                                                                                                                                                                                                                                                                                       ~
BOOT_IMAGE=/boot/vmlinuz-linux root=UUID=c1c6146b-63dc-46ff-84f3-e7661fed204d rw quiet loglevel=3 ibt=off split_lock_detect=off nvidia_drm.modeset=1

cat /proc/driver/nvidia/params                                                                                                                                                                                                                                                                           ~
ResmanDebugLevel: 4294967295
RmLogonRC: 1
ModifyDeviceFiles: 1
DeviceFileUID: 0
DeviceFileGID: 0
DeviceFileMode: 438
InitializeSystemMemoryAllocations: 1
UsePageAttributeTable: 1
EnableMSI: 1
EnablePCIeGen3: 0
MemoryPoolSize: 0
KMallocHeapMaxSize: 0
VMallocHeapMaxSize: 0
IgnoreMMIOCheck: 0
TCEBypassMode: 0
EnableStreamMemOPs: 0
EnableUserNUMAManagement: 1
NvLinkDisable: 0
RmProfilingAdminOnly: 1
PreserveVideoMemoryAllocations: 0
EnableS0ixPowerManagement: 0
S0ixPowerManagementVideoMemoryThreshold: 256
DynamicPowerManagement: 3
DynamicPowerManagementVideoMemoryThreshold: 200
RegisterPCIDriver: 1
EnablePCIERelaxedOrderingMode: 0
EnableResizableBar: 1
EnableGpuFirmware: 18
EnableGpuFirmwareLogs: 2
EnableDbgBreakpoint: 0
OpenRmEnableUnsupportedGpus: 1
DmaRemapPeerMmio: 1
RegistryDwords: ""
RegistryDwordsPerDevice: ""
RmMsg: ""
GpuBlacklist: ""
TemporaryFilePath: ""
ExcludedGpus: ""

Thank you for any assistance, this is becoming incredibly frustrating.

1 Like

Tried updated driver 545.29.06-20.
Can reproduce issue within 5 minutes of playing CS2.

NVRM: Xid (PCI:0000:01:00): 109, pid=5408, name=cs2, Ch 00000096, errorString CTX SWITCH TIMEOUT, Info 0x56c05f

Bug report from immediately after crash attached.
nvidia-bug-report.log.gz (742.7 KB)

Because of my ability to reproduce this issue I was hoping to hear some potential solutions or versions to try as I am easily able to confirm if they are effective in remedying these XID 109 driver crashes.

And on latest driver, 550.54.14, can reproduce just as easily. Kernel 6.7.6-arch1-1.

Xid (PCI:0000:01:00): 109, pid='<unknown>', name=<unknown>, Ch 0000008e, errorString CTX SWITCH TIMEOUT, Info 0x26c047

This time I ran the bug report tool before killing the offending GPU using app (CS2)
nvidia-bug-report.log.gz (795.3 KB)

I just experienced the same crash here in CS2. Running 550 driver in Ubuntu 23.10.

My card is a brand new 4070 Super, that will be used mostly for OpenCL stuff related to photo editing, but so far all heavy GPU tasks have caused failures.

When OpenCL fails I se errors like this:
[ 266.228441] NVRM: GPU at PCI:0000:0a:00: GPU-617ca489-a0c6-4820-a5d8-bb47f1f232bf
[ 266.228448] NVRM: Xid (PCI:0000:0a:00): 31, pid=8469, name=worker 3, Ch 00000008, intr 00000000. MMU Fault: ENGINE GRAPHICS GPCCLIENT_T1_2 faulted @ 0x500_00233000. Fault is of type FAULT_PDE ACCESS_TYPE_VIRT_READ
[36272.668229] NVRM: Xid (PCI:0000:0a:00): 13, pid=‘’, name=, Graphics SM Warp Exception on (GPC 3, TPC 1, SM 0): Out Of Range Address
[36272.668249] NVRM: Xid (PCI:0000:0a:00): 13, pid=‘’, name=, Graphics Exception: ESR 0x51cf30=0x101000e 0x51cf34=0x20 0x51cf28=0xf81eb60 0x51cf2c=0x1174
[36272.668882] NVRM: Xid (PCI:0000:0a:00): 43, pid=20472, name=test_basic, Ch 00000030
38704.375178] NVRM: Xid (PCI:0000:0a:00): 31, pid=‘’, name=, Ch 00000038, intr 00000000. MMU Fault: ENGINE GRAPHICS GPCCLIENT_T1_2 faulted @ 0x7fba_1cac2000. Fault is of type FAULT_PDE ACCESS_TYPE_VIRT_WRITE

Processing: nvidia-bug-report.log.gz…

I am experiencing the same “errorString CTX SWITCH TIMEOUT” when playing Path Of Exile 3.25

nvidia-bug-report.log.gz (871.7 KB)

I can try DX11, Vulkan and all do the same thing. It might work for hour or 2 and then constant crashes. Reboot might give me 1-2 hours play time again before it starts crashing.

I can also replicate it with some events in the game. Spider lair map boss room is known to crash.

1 Like

I forgot to mention. I can play World of Tanks with no problems and not experience crashes.

Hi @bling1987
Could you please test playing Path of Exile 3.25 with 560 beta driver and share test results.
If issue persists, could you save the map and share the save file with me? That way, I can load the same environment on my end.

4x King of the Mist fights and 3 times it crashed.

nvidia-bug-report.log.gz (1.3 MB)

Can’t find a way to install the 560 Beta on PopOS.

Beta install failed.
nvidia-installer.log (179.5 KB)

Crashed in “The Maven Crucible” fighting the “The Hidden” for bosses.

nvidia-bug-report.log.gz (824.3 KB)

Horizon Zero down and Control Ultimate Edition…
nvidia-bug-report.log.gz (860.6 KB)

I can easily repro issue with UE 5.5.1 and driver version 565.77: UE 5.5.1 Crashes GPU (?) when enabling 'Hit Lighting for Reflections' & 'Max Reflection Bounces'≥2 on Linux - Rendering - Epic Developer Community Forums

The issue is not present on 550.142.

Hi,
I have filed a bug internally 5052028 for tracking purpose.
Shall try for internal repro and get back if required any additional information.

1 Like

Hello,

Do we have some news about this bug ?
It still happens with nvidia latest drivers 570.86.16, opensource or proprietary

OpenSource driver:
[Sat Feb 15 00:04:06 2025] NVRM: GPU at PCI:0000:01:00: GPU-f0671f9c-6790-802e-8a9f-b44f61ae250f
[Sat Feb 15 00:04:06 2025] NVRM: Xid (PCI:0000:01:00): 109, pid=5039, name=exefile.exe, Ch 00000027, errorString CTX SWITCH TIMEOUT, Info 0x16c022

[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: PCE-I Advanced Error Reporting Corelogic Info:
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Uncorr Error Status Register : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Uncorr Error Mask Register : 00010000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Uncorr Error Severity Register : 00060010
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Corr Error Status Register : 00000001
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Corr Error Mask Register : 00002000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Advanced Err Cap & Ctrl Register: 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Header Log [0-3] : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Header Log [4-7] : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Header Log [8-B] : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Header Log [C-F] : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Root Error Command Register : 00000000
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Root Error Status : 00000003
[Sat Feb 15 00:04:06 2025] NVRM: krcCheckBusError_KERNEL: Error Source ID Register : 00000000

Proprietary driver:
[Sat Feb 15 00:19:41 2025] NVRM: Xid (PCI:0000:01:00): 109, pid=120429, name=exefile.exe, Ch 00000027, errorString CTX SWITCH TIMEOUT, Info 0x16c022

Using the game Eve Online running on Steam through Proton Experimental. Game freezes few seconds after you entered the game and logged in your character.

Still happens to me with the closed source driver or the kernel-open flag enabled with 570.86.16-r1. Elden ring always produces an XiD 109 CTX SWITCH TIMEOUT error. Even when I try on windows (565 and 570 drivers), I get a similar freeze so I’m not sure if there is something wrong with my card or not. It’s definitely either a hardware or driver issue. Very frustrating.

Hello,
I get the same error with Assassin’s Creed Shadows :
Game crashes during the after introduction loading screen with
Xid (PCI:0000:2d:00): 109, pid='<unknown>', name=<unknown>, Ch 00000014, errorString CTX SWITCH TIMEOUT, Info 0x7c011
On 560.35.03 and 570.124.04
Lutris with Proton-GE 9.26, vkd3d 2.14.1, dxvk 2.6

System:
  Kernel: 6.8.0-55-generic arch: x86_64 bits: 64 compiler: gcc v: 13.3.0 clocksource: tsc
  Desktop: Cinnamon v: 6.4.8 tk: GTK v: 3.24.41 wm: Muffin v: 6.4.1 vt: 7 dm: LightDM v: 1.30.0
    Distro: Linux Mint 22.1 Xia base: Ubuntu 24.04 noble
Machine:
  Type: Desktop System: Micro-Star product: MS-7C37 v: 2.0 serial: <superuser required>
  Mobo: Micro-Star model: MPG X570 GAMING PLUS (MS-7C37) v: 2.0 serial: <superuser required>
    uuid: <superuser required> UEFI: American Megatrends LLC. v: A.O0 date: 07/15/2024
Battery:
  Device-1: ps-controller-battery-48:18:8d:0e:32:eb model: N/A serial: N/A charge: N/A status: full
CPU:
  Info: 6-core model: AMD Ryzen 5 3600 bits: 64 type: MT MCP smt: enabled arch: Zen 2 rev: 0 cache:
    L1: 384 KiB L2: 3 MiB L3: 32 MiB
  Speed (MHz): avg: 2340 high: 3893 min/max: 2200/4208 boost: enabled cores: 1: 2200 2: 2200
    3: 2200 4: 2200 5: 2198 6: 2200 7: 2200 8: 2199 9: 2200 10: 2199 11: 3893 12: 2200
    bogomips: 86398
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: NVIDIA TU104 [GeForce RTX 2080] vendor: Micro-Star MSI driver: nvidia v: 560.35.03
    arch: Turing pcie: speed: 2.5 GT/s lanes: 16 ports: active: none off: DP-1,DP-3,HDMI-A-1
    empty: DP-2 bus-ID: 2d:00.0 chip-ID: 10de:1e82 class-ID: 0300
Info:
  Memory: total: 32 GiB available: 31.26 GiB used: 2.92 GiB (9.4%)
  Processes: 343 Power: uptime: 14m states: freeze,mem,disk suspend: deep wakeups: 0
    hibernate: platform Init: systemd v: 255 target: graphical (5) default: graphical
  Compilers: clang: 14 alt: 18 gcc: 13.3.0 alt: 6/7/9/10/11/12 Client: Cinnamon v: 6.4.8
    inxi: 3.3.34

FWIW from HansKristian-Work (vkd3d-proton’s developer):

Known driver bug.

I made a Pyroveil configuration here: pyroveil/hacks/ac-shadows-nvidia-570-stable at master · HansKristian-Work/pyroveil · GitHub

See the FF VII Rebirth issue for how people got it to work if the README doesn’t work.

Another option is to install the latest NVIDIA Vulkan beta driver 570.123.06 which seems to have a fix for it. At least I got past those hangs, but didn’t play the game for extended periods to verify or anything. Note that this is not the normal beta driver. The Vulkan beta drivers have different numbering: Vulkan Driver Support | NVIDIA Developer (Vulkan Developer Beta Driver Release Updates).

Nice.
nvidia-vulkan drivers change nothing. There are even worse performances in Path of Exile 2 (30-40 FPS instead of ~60 FPS on a RTX4070 Mobile).

But
PROTON_HIDE_NVIDIA_GPU=1 PROTON_ENABLE_NVAPI=1 PYROVEIL_CONFIG=/home/user/.local/share/pyroveil/hacks/ac-shadows-nvidia-570-stable/pyroveil.json PYROVEIL=1 %command% makes the game launch and running without crash.

This still happens on 570.133.07 for me, but the 550 drivers work fine. It’s only an issue after I use any driver after 550.

I am having the following error on Path of Exile 2. I use to have it on Poe1 but I lowered by screen size which did resolve it.

NVRM: Xid (PCI:0000:01:00): 109, pid=‘’, name=, Ch 00000030, errorString CTX SWITCH TIMEOUT, Info 0x5c031