I am having issues after installing cuda drivers on a RHEL 7 VM with P40 GPU
[root@qaulcdcdvm01 ~]# lspci | grep -i --color ‘vga|3d|2d’
00:0f.0 VGA compatible controller: VMware SVGA II Adapter
13:00.0 3D controller: NVIDIA Corporation GP102GL [Tesla P40] (rev a1)
[root@qaulcdcdvm01 ~]# uname -a
Linux qaulcdcdvm01.nyumc.org 3.10.0-1062.12.1.el7.x86_64 #1 SMP Thu Dec 12 06:44:49 EST 2019 x86_64 x86_64 x86_64 GNU/Linux
[root@qaulcdcdvm01 ~]#
[root@qaulcdcdvm01 ~]# rpm -qa|grep -i nvidia
nvidia-driver-latest-dkms-cuda-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-NVML-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-440.64.00-1.el7.x86_64
nvidia-driver-local-repo-rhel7-440.64.00-1.0-1.x86_64
nvidia-xconfig-latest-dkms-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-devel-440.64.00-1.el7.x86_64
nvidia-modprobe-latest-dkms-440.64.00-1.el7.x86_64
yum-plugin-nvidia-0.5-1.el7.noarch
nvidia-persistenced-latest-dkms-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-libs-440.64.00-1.el7.x86_64
kmod-nvidia-latest-dkms-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-cuda-libs-440.64.00-1.el7.x86_64
nvidia-driver-latest-dkms-NvFBCOpenGL-440.64.00-1.el7.x86_64
[root@qaulcdcdvm01 ~]#
[root@qaulcdcdvm01 ~]# nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
[root@qaulcdcdvm01 ~]#
Dmesg
NVRM: BAR1 is 0M @ 0x0 (PCI:0000:13:00.0)
[ 690.683150] NVRM: The system BIOS may have misconfigured your GPU.
[ 690.683153] nvidia: probe of 0000:13:00.0 failed with error -1
[ 690.683164] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 690.683165] NVRM: None of the NVIDIA devices were initialized.
[ 690.683304] nvidia-nvlink: Unregistered the Nvlink Core, major device number 241
[root@qaulcdcdvm01 ~]#
Do you know what could be the issue?