OS: CentOS Linux (Core)
Driver Version:
CUDA Version 10.1.243
NVRM version: NVIDIA UNIX Open Kernel Module for x86_64 515.65.01
When type nvidia_smi
in the terminal, there is an error “Unable to determine the device handle GPU 0000:3B:00.0: Not Found”
And this is the output of nvidia-debugdump --list
Found 2 NVIDIA devices
Error: nvmlDEviceGetHandleByINdex(): Not Found
FAILED to get details on GPU (0x0): Not Found
This happens after an attempt to update from CPU 10. to CPU 11, and after many failed attempt to reboot. The computer is stuck to reboot. I get this error by doing Ctrl +Alt+F2
.
Thank you!