I have the issue without any high temperatures or extensive use of the card. The issue also comes with poor graphics performance. Simply rebooting “solves” the problem, but it would be really nice to actually solve it. The current output of nvidia-smi that I get is:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.57 Driver Version: 450.57 CUDA Version: 11.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 GeForce RTX 2060 Off | 00000000:08:00.0 On | N/A |
|ERR! 63C P5 ERR! / 170W | 908MiB / 5931MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+