Hello there this is a follow up post to my previous Control (The game) issue as I encounter similar issues with Metro Exodus (Linux native game) and Metro Exodus PC Enhanced Edition (Proton + VKD3D)
Nov 26 11:34:25 z004 kernel: NVRM: Xid (PCI:0000:26:00): 109, pid=9664, name=MetroExodus.exe, Ch 00000056, errorString CTX SWITCH TIMEOUT, Info 0x34c027
Nov 26 11:34:21 z004 kernel: NVRM: Xid (PCI:0000:26:00): 13, pid=‘’, name=, Graphics Exception: ESR 0x5147b0=0x17000b 0x5147b4=0x0 0x5147a8=0xf812b60 0x5147ac=0x1104
Nov 26 11:34:21 z004 kernel: NVRM: Xid (PCI:0000:26:00): 13, pid=‘’, name=, Graphics SM Warp Exception on (GPC 2, TPC 0, SM 1): Illegal Instruction Parameter
It consistently freezes my system for about 10-20 seconds whenever I launch a compute process with PyTorch. The last driver version that didn’t have this issue for me was 515.76.
I have filed a bug 3959156 internally for tracking purpose.
Shall try to reproduce issue locally and if needed any additional information, will get back.
Hi All,
I tried playing game Metro Exodus (Linux native game) for around 30 minutes on couple of notebooks which have RTX 3070 Ti and RTX 2060 but could not observed any XID errors.
I would like to know repro frequency at your end and is there any other way also to reproduce issue consistently.
The issue happens with the Windows version of Metro Exodus when it’s ran through Proton (the log says “name=MetroExodus.exe”). The Windows version runs much smoother so it’s better than the native. Before it worked almost fine except you had to disable hairworks (otherwise it freezes in intro), the rest was okay. Now it freezes on the title screen right before showing the main menu, the log reports the Xid errors as stated in posts above.
My game settings are everything to max except Hairwork which I disabled.
What I observed though is that this issue does not happen if you start Metro Exodus in safe-mode (after crash) or the first time post install and then set everything to max (except hairworks) and start playing without rebooting the game.
It happens on the 2nd start after all settings have been turned up and the game was shut-down entirely.
This however happens on both the native Metro Exodus and Metro Exodus PC Enhanced Edition via Proton and VKD3D
PC Enhanced Edition Settings I get the crash with:
Alight … I think I found the issue. For some unknown reason it’s the resolution.
Running the above settings but on 720p all is fine, setting my resolution to 1080p makes the game crash before the main menu on the next game start.
My desktop config is two 1920x1080 (60Hz) displays which makes my primary resolution 1080p and can’t get higher.
So it may be the issue that setting the game resolution to the primary desktop resolution crashes it?
when i try to run diablo2 with median xl patches and GitHub - bolrog/d2dx: D2DX is a complete solution to make Diablo II run well on modern PCs, with high fps and better resolutions. so it in turn is a dx11 title and running it fullscreen on an external monitor. windowed or even just running on the internal it works. but as fast as i try to run it fullscreen on the external monitor this Xid happends. and a reboot is required. this is on kwin 5.27 wayland, and nvidia 525.89.02, tried downgrading various things since i think this was working before. but didnt go as long back as 520.56.06 , it can occur with other various titles when trying to run them fullscreen on the external monitor in wine aswell
yep managed to find an old archive of 520.56.06 and those runs the games just fine aswell. no Xid 56, but at the point of where it usually froze. it prints this to dmesg [drm:nv_drm_fence_context_create_ioctl [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000100] Failed to allocate fence signaling event , if thats anything related or just an other issue that simply was fixed later.
I’m experiencing a similar issue using an RTX 4090. Training runs with pytorch start fine but randomly fail anywhere from 1 to 10 hours into training, with the Xid 109 CTX SWITCH TIMEOUT error.
The difficult part is that I haven’t found a way to quickly reproduce the issue, it only occurs randomly, usually after an hour or so.
Various configurations I’ve tested:
WSL
Native Ubuntu
Power Limiting GPU to 50%
Limiting memory usage to 50%
Thanks @gulafaran for sharing test results, you are no longer experiencing the original issue with driver 520.56.06.
However, you are seeing different error messages, can you please confirm if it’s consistent and you are seeing any performance drop or application crashing or any other functional issue.