Hi.
I have Thinkpad P16 and it freezes after some time, or some action after wakeup from hibernate (external displays and mouse&keyboard pluged in thunderbolt). I mean, mouse still moves and I can ssh to the machine and this is the stack trace I found through dmesg (nothing more in syslog). If I run nvidia-smi, the command freezes - it doesn’t output anything.
[28880.111144] INFO: task xfwm4:9834 blocked for more than 120 seconds.
[28880.111150] Tainted: P OE 5.19.0-29-generic #30-Ubuntu
[28880.111151] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[28880.111152] task:xfwm4 state:D stack: 0 pid: 9834 ppid: 9603 flags:0x00000002
[28880.111155] Call Trace:
[28880.111157] <TASK>
[28880.111160] __schedule+0x24b/0x5f0
[28880.111166] schedule+0x63/0x110
[28880.111167] rwsem_down_write_slowpath+0x2f7/0x5b0
[28880.111170] down_write+0x4c/0x60
[28880.111173] os_acquire_rwlock_write+0x3c/0x70 [nvidia]
[28880.111430] _nv038381rm+0xc/0x30 [nvidia]
[28880.111678] ? _nv039329rm+0x18d/0x1d0 [nvidia]
[28880.111863] ? _nv039248rm+0x21/0xd0 [nvidia]
[28880.112059] ? _nv000694rm+0x27d/0xc80 [nvidia]
[28880.112255] ? rm_ioctl+0x54/0xb0 [nvidia]
[28880.112449] ? nvidia_ioctl+0x6cc/0x890 [nvidia]
[28880.112575] ? nvidia_frontend_unlocked_ioctl+0x55/0xa0 [nvidia]
[28880.112704] ? __x64_sys_ioctl+0x9d/0xe0
[28880.112707] ? do_syscall_64+0x58/0x90
[28880.112710] ? do_syscall_64+0x67/0x90
[28880.112711] ? __x64_sys_poll+0xc7/0x150
[28880.112712] ? exit_to_user_mode_prepare+0x30/0xb0
[28880.112715] ? syscall_exit_to_user_mode+0x29/0x50
[28880.112717] ? do_syscall_64+0x67/0x90
[28880.112718] ? irqentry_exit_to_user_mode+0x9/0x20
[28880.112720] ? irqentry_exit+0x43/0x50
[28880.112721] ? exc_page_fault+0x91/0x1b0
[28880.112723] ? entry_SYSCALL_6
system information:
- xubuntu 22.10
- kernel: 5.19.0-29-generic
$ nvidia-smi
Mon Jan 23 22:10:13 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.78.01 Driver Version: 525.78.01 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA RTX A100... Off | 00000000:01:00.0 On | N/A |
| N/A 39C P5 8W / 60W | 694MiB / 4096MiB | 32% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2907 G /usr/lib/xorg/Xorg 669MiB |
| 0 N/A N/A 9668 G xfwm4 2MiB |
+-----------------------------------------------------------------------------+
Let me know, if additional information is needed.
Kind regards.