This happened last week too and I ignored it. This week it happened again and I have run the nvidia-bug-report.sh and have attached it too.
The GPU stops responding and when nvidia-smi is run I get the error “Unable to determine the device handle for GPU 0000:03:00.0: GPU is lost. Reboot the system to recover this GPU”.
What is this error about?
nvidia-bug-report.log (2.19 MB)