Distribution: Opensuse Tumbleweed
DE: Plasma 5.25.5
CPU: AMD Ryzen 5 3600 (12) @ 3.600GHz
GPU: NVIDIA GeForce GTX 1060 6GB
Kernel: 6.0.0-1-default.
Nvidia Driver: 515.76
This random freeze happens randomly, at leas from three drivers versions ago from official repos. I was trying the drivers from the Nvidia website, and they were working okay, today I installed the ones from the Suse repos, and happened again. Here are some logs from when this occurred.
logerror.txt (11.1 KB)
That log doesn’t really help, it only tells that plasma reports the gpu is gone. The kernel logs are important to see what the nvidia driver reports.
Thanks for the help, I get this:
oct 11 14:50:38 localhost.localdomain kernel: NVRM: GPU at PCI:0000:07:00: GPU-f186d740-9b0a-8de5-0b62-0c8e43797cf2
oct 11 14:50:38 localhost.localdomain kernel: NVRM: Xid (PCI:0000:07:00): 79, pid=‘’, name=, GPU has fallen off the bus.
oct 11 14:50:38 localhost.localdomain kernel: NVRM: GPU 0000:07:00.0: GPU has fallen off the bus.
oct 11 14:50:38 localhost.localdomain kernel: NVRM: A GPU crash dump has been created. If possible, please run
NVRM: nvidia-bug-report.sh as root to collect this data before
NVRM: the NVIDIA kernel module is unloaded.
oct 11 14:50:44 localhost.localdomain systemd[1]: Starting Cleanup of Temporary Directories…
oct 11 14:50:44 localhost.localdomain systemd[1]: systemd-tmpfiles-clean.service: Deactivated successfully.
oct 11 14:50:44 localhost.localdomain systemd[1]: Finished Cleanup of Temporary Directories.
oct 11 14:50:51 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
oct 11 14:50:52 localhost.localdomain kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
Which generate the following:
nvidia-bug-report.log.gz (447.7 KB)
Xid 79, fallen off the bus.
Usually on high-load situations due to overheating or lack of power (PSU).
Since in your case this happens randomly, this rather points to a dying gpu (or maybe just the psu). Please try reseating it, check/replace power connectors, check if it reliably works in another system.