Ubuntu 22.10 (5.19.0-29-generic) 525.78.01 crash after some time, after wakeup from hibernate

Hi.

I have Thinkpad P16 and it freezes after some time, or some action after wakeup from hibernate (external displays and mouse&keyboard pluged in thunderbolt). I mean, mouse still moves and I can ssh to the machine and this is the stack trace I found through dmesg (nothing more in syslog). If I run nvidia-smi, the command freezes - it doesn’t output anything.

[28880.111144] INFO: task xfwm4:9834 blocked for more than 120 seconds.
[28880.111150]       Tainted: P           OE     5.19.0-29-generic #30-Ubuntu
[28880.111151] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[28880.111152] task:xfwm4           state:D stack:    0 pid: 9834 ppid:  9603 flags:0x00000002
[28880.111155] Call Trace:
[28880.111157]  <TASK>
[28880.111160]  __schedule+0x24b/0x5f0
[28880.111166]  schedule+0x63/0x110
[28880.111167]  rwsem_down_write_slowpath+0x2f7/0x5b0
[28880.111170]  down_write+0x4c/0x60
[28880.111173]  os_acquire_rwlock_write+0x3c/0x70 [nvidia]
[28880.111430]  _nv038381rm+0xc/0x30 [nvidia]
[28880.111678]  ? _nv039329rm+0x18d/0x1d0 [nvidia]
[28880.111863]  ? _nv039248rm+0x21/0xd0 [nvidia]
[28880.112059]  ? _nv000694rm+0x27d/0xc80 [nvidia]
[28880.112255]  ? rm_ioctl+0x54/0xb0 [nvidia]
[28880.112449]  ? nvidia_ioctl+0x6cc/0x890 [nvidia]
[28880.112575]  ? nvidia_frontend_unlocked_ioctl+0x55/0xa0 [nvidia]
[28880.112704]  ? __x64_sys_ioctl+0x9d/0xe0
[28880.112707]  ? do_syscall_64+0x58/0x90
[28880.112710]  ? do_syscall_64+0x67/0x90
[28880.112711]  ? __x64_sys_poll+0xc7/0x150
[28880.112712]  ? exit_to_user_mode_prepare+0x30/0xb0
[28880.112715]  ? syscall_exit_to_user_mode+0x29/0x50
[28880.112717]  ? do_syscall_64+0x67/0x90
[28880.112718]  ? irqentry_exit_to_user_mode+0x9/0x20
[28880.112720]  ? irqentry_exit+0x43/0x50
[28880.112721]  ? exc_page_fault+0x91/0x1b0
[28880.112723]  ? entry_SYSCALL_6

system information:

  • xubuntu 22.10
  • kernel: 5.19.0-29-generic
$ nvidia-smi 
Mon Jan 23 22:10:13 2023       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.78.01    Driver Version: 525.78.01    CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA RTX A100...  Off  | 00000000:01:00.0  On |                  N/A |
| N/A   39C    P5     8W /  60W |    694MiB /  4096MiB |     32%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      2907      G   /usr/lib/xorg/Xorg                669MiB |
|    0   N/A  N/A      9668      G   xfwm4                               2MiB |
+-----------------------------------------------------------------------------+

Let me know, if additional information is needed.

Kind regards.