Nvidia driver installed, but nvidia-smi says no devices found

I’m trying to install on Nvidia GTX 2080 on Ubuntu version 20.04. I installed the drivers using
sudo ubuntu-drivers autoinstall
and it worked. But
nvidia-smi
says “no devices were found’”

I’m not sure what the problem might be, the bug report it attached below.
Thank you!
nvidia-bug-report.log.gz (113.8 KB)

[ 4.004065] kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
[ 6.109084] kernel: NVRM: Open nvidia.ko is only ready for use on Data Center GPUs.
[ 6.109087] kernel: NVRM: To force use of Open nvidia.ko on other GPUs, see the
[ 6.109088] kernel: NVRM: ‘OpenRmEnableUnsupportedGpus’ kernel module parameter described
[ 6.109089] kernel: NVRM: in the README.
[ 6.463452] kernel: NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x62:0x0:1849)

Looks like you installed the -open version of the driver, which is not really compatible with your computer.
Try to install the proprietary version, which does not say open kernel using Software & Updates.

1 Like

Thank you Mart! I have deleted the open driver and installed the one you suggested. Unfortunately, the same issue remains. I attached the new bug report below.
nvidia-bug-report.log.gz (133.9 KB)

When you created the bug report, the nouveau driver was loaded.
I see you installed the driver, and 20 seconds later you created the bug report. Did you even reboot?

Anyways, the nouveau driver needs to be blacklisted in order for the nvidia driver to load.
Search the web for instructions how to.

1 Like

That’s true; the report is before the reboot. I tried getting another one after rebooting, but the computer didn’t start after the installation. I tried to create the bug report in safe mode after rebooting, but it got stuck. I will attempt to blacklist the Nouveau driver and see if it helps.

@Mart, I have blacklisted the driver and run the debug report during reboot. It got stuck and didn’t reboot correctly until I purged the nvidia driver.

Note: The error when invoking nvidia-smi is now different: " NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running."

Here’s the bug report:
nvidia-bug-report.log.gz (134.9 KB)

Your log says, for a time you hadn’t blacklisted nouveau correctly, so the nvidia driver could not load. But looks like you solved it.

Now the driver crashes, when trying to load:

[ 5.513547] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x26:0x56:1474)
[ 5.513606] BUG: unable to handle page fault for address: 0000000000004628
[ 5.513642] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
[ 5.514046] #PF: supervisor read access in kernel mode
[ 5.514827] #PF: error_code(0x0000) - not-present page
[ 5.515194] PGD 0 P4D 0
[ 5.515588] Oops: 0000 [#1] SMP PTI
[ 5.515952] CPU: 4 PID: 527 Comm: nv_queue Tainted: P O 5.15.0-72-generic #79~20.04.1-Ubuntu
[ 5.516335] Hardware name: Neousys Technology Inc. Nuvo-6108GC/NVS-6108, BIOS Build180829 08/29/2018
[ 5.516724] RIP: 0010:_nv010655rm+0x3b/0xb0 [nvidia]
[ 5.517715] Code: 33 9d dd 02 48 8b bb 68 01 00 00 e8 7f cc 5a 00 85 c0 74 0b 48 83 c4 08 5b 41 5c c3 0f 1f 00 44 89 e7 e8 98 67 b6 ff 48 89 c7 <8b> 80 28 46 00 00 83 f8 01 74 38 80 bf 71 07 00 00 00 74 49 80 bf
[ 5.518560] RSP: 0018:ffffb62800bdfdc8 EFLAGS: 00010246
[ 5.518977] RAX: 0000000000000000 RBX: ffff8d69d2bfb408 RCX: 0000000000000000
[ 5.519404] RDX: ffffb62800ec3008 RSI: 0000000000000000 RDI: 0000000000000000
[ 5.519833] RBP: ffff8d69c61b3000 R08: 0000000000000000 R09: ffffb62800bdfe70
[ 5.520317] R10: 0000000000000001 R11: 000000006472a1ee R12: 0000000000000000
[ 5.520744] R13: ffffb62800bdfec8 R14: ffff8d69cd060b28 R15: ffff8d69cd40b9c0
[ 5.521210] FS: 0000000000000000(0000) GS:ffff8d70fdd00000(0000) knlGS:0000000000000000
[ 5.521657] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5.522135] CR2: 0000000000004628 CR3: 00000005f5410006 CR4: 00000000003706e0
[ 5.522594] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 5.523049] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 5.523509] Call Trace:
[ 5.523961]
[ 5.524414] ? rm_execute_work_item+0xed/0x130 [nvidia]
[ 5.525144] ? os_execute_work_item+0x69/0x90 [nvidia]
[ 5.525737] ? _main_loop+0x89/0x140 [nvidia]
[ 5.526327] ? _raw_q_schedule+0x80/0x80 [nvidia]
[ 5.526979] ? kthread+0x127/0x150
[ 5.527432] ? set_kthread_struct+0x50/0x50
[ 5.527890] ? ret_from_fork+0x1f/0x30
[ 5.528375]
[ 5.528826] Modules linked in: nvidia_drm(PO+) intel_rapl_msr intel_rapl_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp nvidia_modeset(PO) coretemp kvm_intel kvm crct10dif_pclmul ghash_clmulni_intel nvidia(PO) aesni_intel crypto_simd cryptd rapl snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_hwdep rt2800usb rt2x00usb snd_pcm joydev rt2800lib input_leds snd_seq_midi rt2x00lib snd_seq_midi_event intel_cstate hid_generic intel_wmi_thunderbolt binfmt_misc snd_rawmidi mac80211 serio_raw drm_kms_helper snd_seq cfg80211 cec libarc4 ee1004 rc_core snd_seq_device snd_timer fb_sys_fops ucsi_ccg mei_me snd syscopyarea typec_ucsi kvaser_pciefd sysfillrect typec sysimgblt soundcore mei can_dev ie31200_edac acpi_pad mac_hid sch_fq_codel msr parport_pc ppdev lp parport ramoops reed_solomon drm pstore_blk pstore_zone efi_pstore ip_tables x_tables autofs4 usbhid hid uas
[ 5.528878] usb_storage i2c_i801 crc32_pclmul e1000e i2c_smbus igb ahci i2c_algo_bit dca libahci i2c_nvidia_gpu xhci_pci xhci_pci_renesas wmi video
[ 5.534320] CR2: 0000000000004628
[ 5.534918] —[ end trace f072984c670e759f ]—

Those things I would try:
First look for a BIOS update.

Try again with the nvidia driver. You can look at the dmesg output in the bug report, to see if it still crashes.
If yes, I’d try a different driver version. 525 series, or maybe even 515 series.