Ubuntu tesla P40 NVRM: GPU 0000:03:00.0: RmInitAdapter

ubuntu 20.04 nvidia-smi output No devices were found
The following errors are reported of ‘dmesg’

root@nas:~# dmesg | grep nvidia
[ 17.292492]
nvidia-bug-report.log.gz (124.8 KB)

[ 17.292514] nvidia: module license ‘NVIDIA’ taints kernel.
[ 17.318934] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[ 17.340980] nvidia-nvlink: Nvlink Core is being initialized, major device number 234
[ 17.359051] nvidia 0000:03:00.0: enabling device (0000 → 0002)
[ 17.475010] nvidia 0000:08:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[ 17.832200] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 470.161.03 Wed Oct 19 00:05:15 UTC 2022
[ 17.906412] [drm] [nvidia-drm] [GPU ID 0x00000300] Loading driver
[ 17.906418] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:03:00.0 on minor 0
[ 17.906560] [drm] [nvidia-drm] [GPU ID 0x00000800] Loading driver
[ 17.906564] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:08:00.0 on minor 1
[ 18.033302] nvidia_uvm: module uses symbols from proprietary module nvidia, inheriting taint.
[ 18.039743] nvidia-uvm: Loaded the UVM driver, major device number 510.
[ 22.855118] audit: type=1400 audit(1680238721.288:8): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe” pid=733 comm=“apparmor_parser”
[ 22.855124] audit: type=1400 audit(1680238721.288:9): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe//kmod” pid=733 comm=“apparmor_parser”
[ 43.938339] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 44.454095] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 91.140726] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 91.438657] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 293.604316] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 293.943099] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 294.956695] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 295.296927] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 296.454594] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 296.741489] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 993.445791] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 993.792544] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs

root@nas:~# dmesg | grep NVRM
[ 17.590950] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 470.161.03 Wed Oct 19 00:10:36 UTC 2022
[ 44.170753] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 44.170841] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 44.504231] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 44.504299] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 45.177117] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 45.177199] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0
[ 45.529481] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 45.529531] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0
[ 90.540201] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 90.540247] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0
[ 90.900817] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 90.900862] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0
[ 91.197393] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 91.197444] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 91.490199] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 91.490273] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 292.998991] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)

Hello and welcome to the NVIDIA developer forums @1946831552.

Looking at the log output my first suggestion would be to blacklist the nouveau driver. It looks like it gets loaded before the NVIDIA driver and as such blocking proper initialization of the same.

For example like this:

$ sudo bash -c "echo blacklist nouveau >
	/etc/modprobe.d/blacklist-nvidia-nouveau.conf"
$ sudo bash -c "echo options nouveau modeset=0 >>
	/etc/modprobe.d/blacklist-nvidia-nouveau.conf"
$ sudo update-initramfs -u
$ sudo reboot

For more information on the Nouveau drive please refer to the Linux driver README, “8.1. Interaction with the Nouveau Driver”.

I hope that helps!

I tried to use this method, but it still didn’t work

root@nas:~# dmesg | grep nvidia
[ 16.441562] nvidia: loading out-of-tree module taints kernel.
[ 16.441585] nvidia: module license ‘NVIDIA’ taints kernel.
[ 16.464741] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[ 16.489689] nvidia-nvlink: Nvlink Core is being initialized, major device number 234
[ 16.507659] nvidia 0000:03:00.0: enabling device (0000 → 0002)
[ 16.623684] nvidia 0000:08:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[ 16.822452] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 470.161.03 Wed Oct 19 00:05:15 UTC 2022
[ 16.874452] [drm] [nvidia-drm] [GPU ID 0x00000300] Loading driver
[ 16.874456] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:03:00.0 on minor 0
[ 16.874555] [drm] [nvidia-drm] [GPU ID 0x00000800] Loading driver
[ 16.874558] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:08:00.0 on minor 1
[ 17.001883] nvidia_uvm: module uses symbols from proprietary module nvidia, inheriting taint.
[ 17.008278] nvidia-uvm: Loaded the UVM driver, major device number 510.
[ 20.725198] audit: type=1400 audit(1680262862.156:7): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe” pid=720 comm=“apparmor_parser”
[ 20.725211] audit: type=1400 audit(1680262862.156:8): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe//kmod” pid=720 comm=“apparmor_parser”
[ 41.337553] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 41.852388] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 61.714964] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
[ 62.043728] caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs

root@nas:~# dmesg | grep NVRM
[ 16.739699] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 470.161.03 Wed Oct 19 00:10:36 UTC 2022
[ 41.571792] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 41.571900] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 41.897384] NVRM: GPU 0000:08:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 41.897439] NVRM: GPU 0000:08:00.0: rm_init_adapter failed, device minor number 1
[ 42.554724] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 42.554787] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0
[ 42.872728] NVRM: GPU 0000:03:00.0: RmInitAdapter failed! (0x24:0xffff:1211)
[ 42.872773] NVRM: GPU 0000:03:00.0: rm_init_adapter failed, device minor number 0

nvidia-bug-report.log.gz (126.6 KB)

Difficult to say what could cause this now. Do you have a way of verifying the P40 is working correctly?

Sometimes the message
caller os_map_kernel_space+0xa0/0xf0 [nvidia] mapping multiple BARs
can indicate a HW defect on the card.

What Server are you running this on and ow many P40 cards are there? Does the server support this setup?

Can use Windows on my friend’s motherboard to start it.
I run on the X79 motherboard.
I suspect that the motherboard is incompatible with it.
It may also be a P40 fault?
I plan to try running Windows on this motherboard tomorrow and install P40 drive, to verify whether it is a motherboard issue.
Thank you for your help!!
Thank you!