"We have an issue of implementing NVIDIA driver 450 on RHEL 7.9 servers.
The nvidia card information:
[root@ai-hpcirfprd1 ~]# lspci | grep VGA
05:00.0 VGA compatible controller: NVIDIA Corporation GM200 [GeForce GTX TITAN X] (rev a1)
A nvidia driver was installed:
[root@ai-hpcirfprd1 ~]# lsmod | grep nvidia
nvidia_drm 48606 0
nvidia_modeset 1176938 2 nvidia_drm
nvidia 19658222 30 nvidia_modeset
drm_kms_helper 186531 1 nvidia_drm
drm 456166 3 drm_kms_helper,nvidia_drm
[root@ai-hpcirfprd1 ~]# nvidia-smi
Mon Apr 18 13:04:13 2022
±----------------------------------------------------------------------------+
| NVIDIA-SMI 450.66 Driver Version: 450.66 CUDA Version: 11.0 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 GeForce GTX TIT… On | 00000000:05:00.0 Off | N/A |
| 22% 43C P8 29W / 250W | 1MiB / 12212MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
| 1 Tesla K40c On | 00000000:22:00.0 Off | 0 |
| 23% 32C P8 22W / 235W | 0MiB / 11441MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+
However, it does not show any graphical driver associated with nvidia card:
[root@ai-hpcirfprd1 ~]# xrandr --listproviders
Providers: number : 0
And dmesg showed nvidia module caused kernel taint:
[root@ai-hpcirfprd1 ~]# dmesg | grep taint
[ 15.196362] nvidia: loading out-of-tree module taints kernel.
[ 15.196374] nvidia: module license ‘NVIDIA’ taints kernel.
[ 15.196376] Disabling lock debugging due to kernel taint
[ 15.300715] nvidia: module verification failed: signature and/or required key missing - tainting kernel
Could you please let me know what is wrong here? Or I was missing something. And let me know if you need more information."