I do see the xid 61 hang too. This has different symptoms - Xorg crash instead of pegging a cpu and no mention of Xid 61 in syslog. It would be great if they had the same cause and could be fixed in one go but nothing much to suggest that yet.
Seems something completely different, no XIDs involved, the nvidia DDX seems to just be crashing out of nowhere.
You’re currently running the 440.82 driver, did this also happen with previous drivers?
I have the exact same error! The problem seems to only happen when I plug in a mouse after waking my computer. I’m running a Dell laptop with Quadro T1000 Mobile graphics and the 440 drivers. I’m on Ubuntu 20.04 Gnome.
Noticed that it also happens when closing chrome windows. Will try to switch to firefox and keep everyone updated.
Below are debugging traces in case it helps anyone
…
[ 28571.334] (II) event5 - SynPS/2 Synaptics TouchPad: SYN_DROPPED event - some input events have been lost.
[ 28654.552] () Option “fd” “51”
[ 28654.552] (II) event2 - Power Button: device removed
[ 28654.553] () Option “fd” “91”
[ 28654.553] (II) event4 - Video Bus: device removed
[ 28654.553] () Option “fd” “93”
[ 28654.553] (II) event0 - Sleep Button: device removed
[ 28654.554] () Option “fd” “53”
[ 28654.554] (II) event10 - Integrated IR Camera: Integrate: device removed
[ 28654.554] () Option “fd” “96”
[ 28654.554] (II) event9 - Integrated Camera: Integrated C: device removed
[ 28654.554] () Option “fd” “46”
[ 28654.554] (II) event3 - AT Translated Set 2 keyboard: device removed
[ 28654.555] () Option “fd” “92”
[ 28654.555] (II) event5 - SynPS/2 Synaptics TouchPad: device removed
[ 28654.555] () Option “fd” “94”
[ 28654.555] (II) event8 - TPPS/2 Elan TrackPoint: device removed
[ 28654.555] () Option “fd” “90”
[ 28654.555] (II) event13 - ThinkPad Extra Buttons: device removed
[ 28654.556] () Option “fd” “97”
[ 28654.556] (II) event17 - WH-XB900N (AVRCP): device removed
[ 28657.914] (II) systemd-logind: got pause for 13:71
[ 28657.914] (II) systemd-logind: got pause for 13:73
[ 28657.914] (II) systemd-logind: got pause for 13:67
[ 28657.914] (II) systemd-logind: got pause for 13:81
[ 28657.914] (II) systemd-logind: got pause for 13:66
[ 28657.914] (II) systemd-logind: got pause for 13:70
[ 28657.914] (II) systemd-logind: got pause for 13:74
[ 28657.914] (II) systemd-logind: got pause for 13:77
[ 28657.914] (II) systemd-logind: got pause for 13:68
[ 28657.914] (II) systemd-logind: got pause for 13:69
[ 28657.914] (II) systemd-logind: got pause for 13:64
[ 28657.914] (II) systemd-logind: got pause for 13:72
[ 28664.771] (II) systemd-logind: got resume for 13:71
[ 28664.771] (–) NVIDIA(GPU-0): DFP-0: disconnected
[ 28664.771] (–) NVIDIA(GPU-0): DFP-0: Internal DisplayPort
[ 28664.771] (–) NVIDIA(GPU-0): DFP-0: 1440.0 MHz maximum pixel clock
[ 28664.771] (–) NVIDIA(GPU-0):
[ 28664.771] (–) NVIDIA(GPU-0): DFP-1: disconnected
[ 28664.772] (–) NVIDIA(GPU-0): DFP-1: Internal DisplayPort
[ 28664.772] (–) NVIDIA(GPU-0): DFP-1: 1440.0 MHz maximum pixel clock
[ 28664.772] (–) NVIDIA(GPU-0):
[ 28664.772] (–) NVIDIA(GPU-0): DFP-2: disconnected
[ 28664.772] (–) NVIDIA(GPU-0): DFP-2: Internal TMDS
[ 28664.772] (–) NVIDIA(GPU-0): DFP-2: 165.0 MHz maximum pixel clock
[ 28664.772] (–) NVIDIA(GPU-0):
[ 28664.773] (–) NVIDIA(GPU-0): AU Optronics Corporation (DFP-3): connected
[ 28664.773] (–) NVIDIA(GPU-0): AU Optronics Corporation (DFP-3): Internal DisplayPort
[ 28664.773] (–) NVIDIA(GPU-0): AU Optronics Corporation (DFP-3): 1440.0 MHz maximum pixel clock
[ 28664.773] (–) NVIDIA(GPU-0):
[ 28664.838] (II) NVIDIA(0): Setting mode “DFP-3:nvidia-auto-select”
[ 28668.134] (II) config/udev: removing device WH-XB900N (AVRCP)
[ 28668.135] (II) UnloadModule: “libinput”
[ 28668.135] (II) systemd-logind: releasing fd for 13:81
[ 28668.135] (EE) systemd-logind: failed to release device: Device not taken
[ 28668.680] (II) libinput: TPPS/2 Elan TrackPoint: SetProperty on 340 called but device is disabled.
This driver cannot change properties on a disabled device
[ 28668.680] (II) libinput: TPPS/2 Elan TrackPoint: SetProperty on 328 called but device is disabled.
This driver cannot change properties on a disabled device
[ 28668.681] (II) libinput: TPPS/2 Elan TrackPoint: SetProperty on 342 called but device is disabled.
This driver cannot change properties on a disabled device
[ 28672.804] (II) systemd-logind: got resume for 13:73
[ 28672.805] (II) event9 - Integrated Camera: Integrated C: is tagged by udev as: Keyboard
[ 28672.805] (II) event9 - Integrated Camera: Integrated C: device is a keyboard
[ 28672.806] (II) systemd-logind: got resume for 13:67
[ 28672.807] (II) event3 - AT Translated Set 2 keyboard: is tagged by udev as: Keyboard
[ 28672.807] (II) event3 - AT Translated Set 2 keyboard: device is a keyboard
[ 28672.808] (II) systemd-logind: got resume for 13:66
[ 28672.808] (II) event2 - Power Button: is tagged by udev as: Keyboard
[ 28672.808] (II) event2 - Power Button: device is a keyboard
[ 28672.809] (II) systemd-logind: got resume for 13:70
[ 28672.810] (II) systemd-logind: got resume for 13:74
[ 28672.814] (II) event10 - Integrated IR Camera: Integrate: is tagged by udev as: Keyboard
[ 28672.814] (II) event10 - Integrated IR Camera: Integrate: device is a keyboard
[ 28672.815] (II) systemd-logind: got resume for 13:77
[ 28672.818] (II) event13 - ThinkPad Extra Buttons: is tagged by udev as: Keyboard Switch
[ 28672.818] (II) event13 - ThinkPad Extra Buttons: device is a keyboard
[ 28672.819] (II) systemd-logind: got resume for 13:68
[ 28672.822] (II) event4 - Video Bus: is tagged by udev as: Keyboard
[ 28672.822] (II) event4 - Video Bus: device is a keyboard
[ 28672.823] (II) systemd-logind: got resume for 13:69
[ 28672.825] (II) event5 - SynPS/2 Synaptics TouchPad: is tagged by udev as: Touchpad
[ 28672.826] (II) event5 - SynPS/2 Synaptics TouchPad: device is a touchpad
[ 28672.827] (II) systemd-logind: got resume for 13:64
[ 28672.828] (II) event0 - Sleep Button: is tagged by udev as: Keyboard
[ 28672.828] (II) event0 - Sleep Button: device is a keyboard
[ 28672.829] (II) systemd-logind: got resume for 13:72
[ 28672.830] (II) event8 - TPPS/2 Elan TrackPoint: is tagged by udev as: Mouse Pointingstick
[ 28672.830] (II) event8 - TPPS/2 Elan TrackPoint: device is a pointer
[ 28725.819] (EE)
[ 28725.819] (EE) Backtrace:
[ 28725.821] (EE) 0: /usr/lib/xorg/Xorg (OsLookupColor+0x13c) [0x55e5e36e1e0c]
[ 28725.822] (EE) 1: /lib/x86_64-linux-gnu/libpthread.so.0 (funlockfile+0x60) [0x7f99e8d0641f]
[ 28725.823] (EE) 2: /usr/lib/x86_64-linux-gnu/nvidia-440/xorg/nvidia_drv.so (?+0x0) [0x7f99e805e82c]
[ 28725.823] (EE)
[ 28725.823] (EE) Segmentation fault at address 0xab
[ 28725.823] (EE)
Fatal server error:
[ 28725.823] (EE) Caught signal 11 (Segmentation fault). Server aborting
[ 28725.823] (EE)
[ 28725.823] (EE)
Sadly it’s pretty random but I’ve noticed it often seems coincident with an interaction Electron based applications (which use Chromium for rendering) e.g. closing VS Code.
I’ll try scripting some application churn and see if that provokes it.
Hello.
I can indeed confirm this is also happening for me with apps using electron. (eg. Discord chat client)
App freezes for a bit and whole X crashes. (can relogin and will work).
@amrits I just tried driver release 450.80.02 and still observe frequent crashes with the same stack trace. Here are my steps to reproduce. From the other descriptions, it seems like these could be simpler but these steps seem to reproduce the bug repeatedly.
Use multiple display devices. I have a Quadro P2000 driving three displays. I can reproduce the driver bug using a bare bones Xorg configuration containing only a device section that specifies the nvidia driver as the driver to use. Only one “X screen” is created.
Launch a multi-window OpenGL accelerated application.
Make a window on each display expand to full-screen.
Interact with each window for a while and try to close the application.
Repeat 2-4.
I believe the most important point is that multiple GLX contexts exist and I think it matters that their associated drawables are on different displays. In my case, I see the bug almost every time after I extend the windows to full-screen. I have seen this bug across machines CentOS 7, Ubuntu 20.04 with Xorg 1.19.5 and Xorg 1.20.8 and many different driver versions (390.87 - 450.80.02).
FWIW this bug has been reported multiple other times (in case there is valuable information there):