We installed 8 Telsa P4 cards on our server. But yesterday night something was wrong with our software with the error log：
nnvidia-container-cli: initialization error: driver error: timed out\\n\\"\"": unknown
then we reboot the server. But when we use the “nvidia-smi” command to check GPU status, we find that the command only show 7 cards. We checked the PCIEs with command “lspci | grep -i nvidia”. It showed 8 nvidia GPU cards.
So here I wonder what’s wrong with the disappeared GPU card? How can I solve this problem?