Okay, @routenull I finally got access to a RHEL8 machine with a Quadro P2000 GPU. The installation was successful on bare-metal. Is it possible that there is a configuration issue with the GPU pass-through to VM?
Anyway, I am providing step-by-step with output so you can follow along and let me know where there is divergence on your machine. Note: for precompiled, the following are optional: gcc
, EPEL repo (dkms
), kernel-devel and kernel-headers packages.
Pre-installation actions
$ lspci | grep -i nvidia
65:00.0 VGA compatible controller: NVIDIA Corporation GP106GL [Quadro P2000] (rev a1)
65:00.1 Audio device: NVIDIA Corporation GP106 High Definition Audio Controller (rev a1)
$ uname -m && cat /etc/*release
x86_64
NAME="Red Hat Enterprise Linux"
VERSION="8.5 (Ootpa)"
...
$ gcc --version
gcc (GCC) 8.5.0 20210514 (Red Hat 8.5.0-4)
...
$ uname -r
4.18.0-348.20.1.el8_5.x86_64
Verify matching versions
$ sudo dnf install kernel-devel-$(uname -r) kernel-headers-$(uname -r)
$ rpm -qa | grep kernel | sort | grep $(uname -r)
kernel-4.18.0-348.20.1.el8_5.x86_64
kernel-core-4.18.0-348.20.1.el8_5.x86_64
kernel-devel-4.18.0-348.20.1.el8_5.x86_64
kernel-headers-4.18.0-348.20.1.el8_5.x86_64
kernel-modules-4.18.0-348.20.1.el8_5.x86_64
kernel-tools-4.18.0-348.20.1.el8_5.x86_64
kernel-tools-libs-4.18.0-348.20.1.el8_5.x86_64
CUDA Download Page instructions
$ sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo
$ sudo dnf clean all
$ sudo dnf repolist
repo id repo name
cuda-rhel8-x86_64 cuda-rhel8-x86_64
epel Extra Packages for Enterprise Linux 8 - x86_64
epel-modular Extra Packages for Enterprise Linux Modular 8 - x86_64
rhel-8-for-x86_64-appstream-rpms Red Hat Enterprise Linux 8 for x86_64 - AppStream (RPMs)
rhel-8-for-x86_64-baseos-rpms Red Hat Enterprise Linux 8 for x86_64 - BaseOS (RPMs)
Install the latest
precompiled kernel module stream
$ sudo dnf module install nvidia-driver:latest
Updating Subscription Management repositories.
Last metadata expiration check: 0:00:24 ago on Thu 31 Mar 2022 02:15:56 PM PDT.
Dependencies resolved.
=====================================================================================================================================
Package Architecture Version Repository Size
=====================================================================================================================================
Installing group/module packages:
cuda-drivers x86_64 510.47.03-1 cuda-rhel8-x86_64 7.0 k
nvidia-driver x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 22 M
nvidia-driver-NVML x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 516 k
nvidia-driver-NvFBCOpenGL x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 52 k
nvidia-driver-cuda x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 591 k
nvidia-driver-cuda-libs x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 63 M
nvidia-driver-devel x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 12 k
nvidia-driver-libs x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 168 M
nvidia-kmod-common noarch 3:510.47.03-1.el8 cuda-rhel8-x86_64 12 k
nvidia-libXNVCtrl x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 25 k
nvidia-libXNVCtrl-devel x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 55 k
nvidia-modprobe x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 36 k
nvidia-persistenced x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 42 k
nvidia-settings x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 832 k
nvidia-xconfig x86_64 3:510.47.03-1.el8 cuda-rhel8-x86_64 105 k
Installing dependencies:
dnf-plugin-nvidia noarch 2.0-1.el8 cuda-rhel8-x86_64 12 k
egl-wayland x86_64 1.1.7-1.el8 rhel-8-for-x86_64-appstream-rpms 34 k
kmod-nvidia-510.47.03-4.18.0-348.20.1 x86_64 3:510.47.03-3.el8_5 cuda-rhel8-x86_64 29 M
libX11-devel x86_64 1.6.8-5.el8 rhel-8-for-x86_64-appstream-rpms 976 k
libXau-devel x86_64 1.0.9-3.el8 rhel-8-for-x86_64-appstream-rpms 21 k
libglvnd-opengl x86_64 1:1.3.2-1.el8 rhel-8-for-x86_64-appstream-rpms 47 k
libvdpau x86_64 1.4-2.el8 rhel-8-for-x86_64-appstream-rpms 41 k
libxcb-devel x86_64 1.13.1-1.el8 rhel-8-for-x86_64-appstream-rpms 1.1 M
mesa-vulkan-drivers x86_64 21.1.5-1.el8 rhel-8-for-x86_64-appstream-rpms 6.1 M
ocl-icd x86_64 2.2.12-1.el8 rhel-8-for-x86_64-appstream-rpms 51 k
opencl-filesystem noarch 1.0-6.el8 rhel-8-for-x86_64-appstream-rpms 8.5 k
vulkan-loader x86_64 1.2.198.0-2.el8_5 rhel-8-for-x86_64-appstream-rpms 123 k
xorg-x11-proto-devel noarch 2020.1-3.el8 rhel-8-for-x86_64-appstream-rpms 280 k
Installing module profiles:
nvidia-driver/default
Enabling module streams:
nvidia-driver latest
Transaction Summary
=====================================================================================================================================
Install 28 Packages
Total download size: 292 M
Installed size: 697 M
Is this ok [y/N]: y
$ rpm -qa | grep nvidia | sort
dnf-plugin-nvidia-2.0-1.el8.noarch
kmod-nvidia-510.47.03-4.18.0-348.20.1-510.47.03-3.el8_5.x86_64
nvidia-driver-510.47.03-1.el8.x86_64
nvidia-driver-cuda-510.47.03-1.el8.x86_64
nvidia-driver-cuda-libs-510.47.03-1.el8.x86_64
nvidia-driver-devel-510.47.03-1.el8.x86_64
nvidia-driver-libs-510.47.03-1.el8.x86_64
nvidia-driver-NvFBCOpenGL-510.47.03-1.el8.x86_64
nvidia-driver-NVML-510.47.03-1.el8.x86_64
nvidia-kmod-common-510.47.03-1.el8.noarch
nvidia-libXNVCtrl-510.47.03-1.el8.x86_64
nvidia-libXNVCtrl-devel-510.47.03-1.el8.x86_64
nvidia-modprobe-510.47.03-1.el8.x86_64
nvidia-persistenced-510.47.03-1.el8.x86_64
nvidia-settings-510.47.03-1.el8.x86_64
nvidia-xconfig-510.47.03-1.el8.x86_64
$ find /lib/modules -name "nvidia*ko*" | sort
/lib/modules/4.18.0-348.20.1.el8_5.x86_64/extra/drivers/video/nvidia/nvidia-drm.ko
/lib/modules/4.18.0-348.20.1.el8_5.x86_64/extra/drivers/video/nvidia/nvidia.ko
/lib/modules/4.18.0-348.20.1.el8_5.x86_64/extra/drivers/video/nvidia/nvidia-modeset.ko
/lib/modules/4.18.0-348.20.1.el8_5.x86_64/extra/drivers/video/nvidia/nvidia-peermem.ko
/lib/modules/4.18.0-348.20.1.el8_5.x86_64/extra/drivers/video/nvidia/nvidia-uvm.ko
$ sudo reboot
After rebooting
$ lsmod | grep nvidia | sort
drm 573440 12 drm_kms_helper,drm_vram_helper,ast,nvidia,drm_ttm_helper,nvidia_drm,ttm
drm_kms_helper 253952 5 drm_vram_helper,ast,nvidia_drm
nvidia 38502400 321 nvidia_uvm,nvidia_modeset
nvidia_drm 61440 3
nvidia_modeset 1118208 6 nvidia_drm
nvidia_uvm 1085440 0
$ nvidia-smi
Thu Mar 31 14:24:17 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.47.03 Driver Version: 510.47.03 CUDA Version: 11.6 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro P2000 Off | 00000000:65:00.0 On | N/A |
| 45% 33C P8 5W / 75W | 65MiB / 5120MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2278 G /usr/libexec/Xorg 40MiB |
| 0 N/A N/A 3393 G /usr/bin/gnome-shell 22MiB |
+-----------------------------------------------------------------------------+
$ cat /proc/driver/nvidia/version
NVRM version: NVIDIA UNIX x86_64 Kernel Module 510.47.03 Mon Jan 24 22:58:54 UTC 2022
GCC version: gcc version 8.5.0 20210514 (Red Hat 8.5.0-4) (GCC)