I have a RTX 4060 Ti, with Driver version 570.124.06 on Ubuntu 22.04 and have been encountering complete system freezes. I have not been able to reproduce them, but they seem to mostly happen when using Firefox often when creating new tabs or scrolling. I have had it happen without Firefox open, but not nearly as much as when I am using it. Usually the issue occurs once every ~30 minutes.
It’s completely unresponsive, so I need to use the power button to shut it off.
journalctl gives me:
lines 4945-4968/4968 (END)
Mär 08 13:12:12 max kernel: ? srso_return_thunk+0x5/0x5f
Mär 08 13:12:12 max kernel: ? irqentry_exit+0x43/0x50
Mär 08 13:12:12 max kernel: ? srso_return_thunk+0x5/0x5f
Mär 08 13:12:12 max kernel: ? entry_SYSCALL_64_after_hwframe+0x78/0x80
Mär 08 13:12:12 max kernel: </TASK>
Mär 08 13:12:12 max kernel: NVRM: Xid (PCI:0000:08:00): 154, GPU recovery action changed from 0x0 (None) to 0>
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (WW) NVIDIA: Wait for channel idle timed out.
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) NVIDIA(GPU-0): Push buffer DMA allocation failed
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) NVIDIA(0): Failed to allocate push buffer
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) NVIDIA(0): Error recovery failed.
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) NVIDIA(0): *** Aborting ***
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE)
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: Fatal server error:
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) Failed to recover from error!
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE)
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE)
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: Please consult the The X.Org Foundation support
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: at http://wiki.x.org
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: for help.
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE) Please also check the log file at "/home/max/.loca>
Mär 08 13:12:17 max /usr/libexec/gdm-x-session[2602]: (EE)
Mär 08 13:12:17 max kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x000>
Mär 08 13:12:17 max kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x000>
Mär 08 13:12:17 max kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x000>
Mär 08 13:12:17 max kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x000>
~
and in the xorg.1.log I have
47 [ 19.926] (--) NVIDIA(GPU-0): DFP-4: Internal DisplayPort
46 [ 19.926] (--) NVIDIA(GPU-0): DFP-4: 2670.0 MHz maximum pixel clock
45 [ 19.926] (--) NVIDIA(GPU-0):·
44 [ 19.926] (--) NVIDIA(GPU-0): DFP-5: disconnected
43 [ 19.926] (--) NVIDIA(GPU-0): DFP-5: Internal TMDS
42 [ 19.926] (--) NVIDIA(GPU-0): DFP-5: 165.0 MHz maximum pixel clock
41 [ 19.926] (--) NVIDIA(GPU-0):·
40 [ 285.676] (--) NVIDIA(GPU-0): Lenovo Group Limited T24i-2L (DFP-0): connected
39 [ 285.676] (--) NVIDIA(GPU-0): Lenovo Group Limited T24i-2L (DFP-0): Internal TMDS
38 [ 285.676] (--) NVIDIA(GPU-0): Lenovo Group Limited T24i-2L (DFP-0): 600.0 MHz maximum pixel clock
37 [ 285.676] (--) NVIDIA(GPU-0):·
36 [ 285.676] (--) NVIDIA(GPU-0): DFP-1: disconnected
35 [ 285.676] (--) NVIDIA(GPU-0): DFP-1: Internal DisplayPort
34 [ 285.676] (--) NVIDIA(GPU-0): DFP-1: 2670.0 MHz maximum pixel clock
33 [ 285.676] (--) NVIDIA(GPU-0):·
32 [ 285.676] (--) NVIDIA(GPU-0): DFP-2: disconnected
31 [ 285.676] (--) NVIDIA(GPU-0): DFP-2: Internal TMDS
30 [ 285.676] (--) NVIDIA(GPU-0): DFP-2: 165.0 MHz maximum pixel clock
29 [ 285.676] (--) NVIDIA(GPU-0):·
28 [ 285.721] (--) NVIDIA(GPU-0): Lenovo Group Limited LEN T24d-10 (DFP-3): connected
27 [ 285.722] (--) NVIDIA(GPU-0): Lenovo Group Limited LEN T24d-10 (DFP-3): Internal TMDS
26 [ 285.722] (--) NVIDIA(GPU-0): Lenovo Group Limited LEN T24d-10 (DFP-3): 600.0 MHz maximum pixel clock
25 [ 285.722] (--) NVIDIA(GPU-0):·
24 [ 285.722] (--) NVIDIA(GPU-0): DFP-4: disconnected
23 [ 285.722] (--) NVIDIA(GPU-0): DFP-4: Internal DisplayPort
22 [ 285.722] (--) NVIDIA(GPU-0): DFP-4: 2670.0 MHz maximum pixel clock
21 [ 285.722] (--) NVIDIA(GPU-0):·
20 [ 285.722] (--) NVIDIA(GPU-0): DFP-5: disconnected
19 [ 285.722] (--) NVIDIA(GPU-0): DFP-5: Internal TMDS
18 [ 285.722] (--) NVIDIA(GPU-0): DFP-5: 165.0 MHz maximum pixel clock
17 [ 285.722] (--) NVIDIA(GPU-0):·
16 [ 768.274] (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to
15 [ 768.274] (EE) NVIDIA(0): recover...
14 [ 773.274] (WW) NVIDIA: Wait for channel idle timed out.
13 [ 773.291] (EE) NVIDIA(GPU-0): Push buffer DMA allocation failed
12 [ 773.291] (EE) NVIDIA(0): Failed to allocate push buffer
11 [ 773.291] (EE) NVIDIA(0): Error recovery failed.
10 [ 773.291] (EE) NVIDIA(0): *** Aborting ***
9 [ 773.291] (EE)·
8 Fatal server error:
7 [ 773.291] (EE) Failed to recover from error!
6 [ 773.291] (EE)·
5 [ 773.291] (EE)·
4 Please consult the The X.Org Foundation support·
3 » at http://wiki.x.org
2 for help.·
1 [ 773.291] (EE) Please also check the log file at "/home/max/.local/share/xorg/Xorg.1.log" for additio nal information.
0 [ 773.291] (EE)·
I have tried a different driver version but it did not resolve the issue for me.