I am trying to install a Tesla M60 in a HP Proliant DL580 gen 9 server.
The card is recognized by the lspci command but, although I have followed the entire procedure of driver installation, still I am unable to see the GPU running nvidia-smi command. Here is the output of the bug report
nvidia-bug-report.log.gz (201.8 KB)
Any help is much appreciated
It seems I made some progress.
[irace@xeon-server ~]$ cat /proc/driver/nvidia/version
NVRM version: NVIDIA UNIX x86_64 Kernel Module 535.129.03 Thu Oct 19 18:56:32 UTC 2023
GCC version: gcc version 4.8.5 20150623 (Red Hat 4.8.5-44) (GCC)
And from the nvidia-bug-report
Nov 17 19:44:40 xeon-server kernel: NVRM: GPU 0000:c6:00.0: GPU does not have the necessary power cables connected.
Nov 17 19:44:40 xeon-server kernel: NVRM: GPU 0000:c6:00.0: RmInitAdapter failed! (0x24:0x1c:1436)
Nov 17 19:44:40 xeon-server kernel: NVRM: GPU 0000:c6:00.0: rm_init_adapter failed, device minor number 0
This is because we realized we need a cable for the reverse flow cooling connected to the M60.
Any other thing to check?