Problem Description: Unable to Detect Nvidia Tesla V100S GPU
Request for Assistance:
I’m encountering difficulties with detecting and utilizing the Nvidia Tesla V100S GPU on my Ubuntu system. Despite having the Nvidia driver installed and Nvidia modules loaded, the GPU is not being recognized by the system. I would greatly appreciate your assistance in diagnosing and resolving this issue.
Thank you for your expertise and help in advance.
GPU: NVIDIA Corporation GV100GL [Tesla V100S PCIe 32GB]
OS: Ubuntu 20.04
ubuntu-drivers devicescommand shows a list of Nvidia driver versions including recommended and third-party options.
gpu2@avc1:~$ ubuntu-drivers devices WARNING:root:_pkg_get_support nvidia-driver-525-server: package has invalid Support PBheader, cannot determine support level WARNING:root:_pkg_get_support nvidia-driver-535-server: package has invalid Support PBheader, cannot determine support level WARNING:root:_pkg_get_support nvidia-driver-525: package has invalid Support PBheader, cannot determine support level == /sys/devices/pci0000:00/0000:00:16.0/0000:0b:00.0 == modalias : pci:v000010DEd00001DF6sv000010DEsd000013D6bc03sc02i00 vendor : NVIDIA Corporation model : GV100GL [Tesla V100S PCIe 32GB] driver : nvidia-driver-535 - third-party non-free driver : nvidia-driver-525-server - distro non-free driver : nvidia-driver-470 - distro non-free recommended driver : nvidia-driver-535-server - distro non-free driver : nvidia-driver-470-server - distro non-free driver : nvidia-driver-525 - distro non-free driver : nvidia-driver-450-server - distro non-free driver : xserver-xorg-video-nouveau - distro free builtin== /sys/devices/pci0000:00/0000:00:0f.0 == modalias : pci:v000015ADd00000405sv000015ADsd00000405bc03sc00i00 vendor : VMware model : SVGA II Adapter driver : open-vm-tools-desktop - distro free
- The GPU’s modalias is recognized correctly, and the vendor and model information are accurate.
lspci | grep -i nvidiaand
lsmod | grep nvidiaindicate that Nvidia modules are loaded.
gpu2@avc1:~$lsmod | grep nvidia nvidia_uvm 1200128 0 nvidia_drm 65536 0 nvidia_modeset 1200128 1 nvidia_drm nvidia 35512320 5 nvidia_uvm,nvidia_modeset drm_kms_helper 307200 2 vmwgfx,nvidia_drm drm 618496 8 vmwgfx,drm_kms_helper,nvidia,nvidia_drm,ttm