FWIW, another datapoint:
HP ZBook Studio G5
Quadro P1000 Mobile
NVIDIA-Linux-x86_64-430.50
Laptop does not hang when plugged into docking station (which connects two external monitors).
Laptop will ultimately hang when not on docking station even when plugged into a regular external power source. There appears to be some difference in when exactly, but really it’s random. Working with an external monitor or power supply seems to help. Keep working on the laptop really seems to help (but not always)
Usually it will hang after leaving the laptop idle for a few minutes.
Afaict there is no suspending when on plugged into a power source and it hangs before even blanking the screen, so I doubt this is a suspension problem. Also I’ve tried running with
xset s off
xset -dpms
xset s noblank
with no noticeable difference.
okt 21 18:34:29 ltcmc2019-2 kernel: NVRM: GPU at PCI:0000:01:00: GPU-34790a52-7e95-3466-3b05-0861e2979698
okt 21 18:34:29 ltcmc2019-2 kernel: NVRM: Xid (PCI:0000:01:00): 79, pid=1473, GPU has fallen off the bus.
okt 21 18:34:29 ltcmc2019-2 kernel: NVRM: GPU 0000:01:00.0: GPU has fallen off the bus.
okt 21 18:34:29 ltcmc2019-2 kernel: NVRM: A GPU crash dump has been created. If possible, please run
NVRM: nvidia-bug-report.sh as root to collect this data before
NVRM: the NVIDIA kernel module is unloaded.
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:1:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:1:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:1:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:0:0:0x0000000f
okt 21 18:34:40 ltcmc2019-2 kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000927c:1:0:0x0000000f
okt 21 18:34:43 ltcmc2019-2 /usr/libexec/gdm-x-session[1973]: (WW) NVIDIA(0): WAIT (2-S, 17, 0x02c7, 0x0000f894, 0x0000018c)
okt 21 18:34:50 ltcmc2019-2 /usr/libexec/gdm-x-session[1973]: (WW) NVIDIA(0): WAIT (1-S, 17, 0x02c7, 0x0000f894, 0x0000018c)
okt 21 18:34:53 ltcmc2019-2 /usr/libexec/gdm-x-session[1973]: (WW) NVIDIA(0): WAIT (2-S, 17, 0x02c4, 0x0000f894, 0x0000018c)
$ lspci | grep -Ei "vga|3d"
01:00.0 VGA compatible controller: NVIDIA Corporation GP107GLM [Quadro P1000 Mobile] (rev ff)
nvidia-smi showed no sign of overheating.
System was still usable over ssh, so I ran nvidia-bug-report.sh before rebooting.
Output here: [sl]https://…/nvidia-bug-report.log.gz[/s] no longer online
Rebooting did not work. SSH connection was dropped, but the display didn’t change, so the shutdown process hung too.