$ nvidia-smi
Unable to determine the device handle for GPU 0000:02:00.0: Unknown Error
This happens suddenly.
It is assumed to be caused by the boinc-application.
$ nvidia-smi
Unable to determine the device handle for GPU 0000:02:00.0: Unknown Error
This happens suddenly.
It is assumed to be caused by the boinc-application.
Also the nvidia-smi -l 60 (as shown below), stops with “Unknown Error”:
Thu May 19 12:13:56 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.47.03 Driver Version: 510.47.03 CUDA Version: 11.6 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A |
| N/A 80C P0 N/A / N/A | 2291MiB / 4096MiB | 97% E. Process |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 1412 G /usr/lib/xorg/Xorg 4MiB |
| 0 N/A N/A 3491 C bin/python 2285MiB |
+-----------------------------------------------------------------------------+
Unexpected NVML event
Error occurred while processing the event: Unknown Error