Nvidia-smi "No devices were found"

What I’ve tried:

  • adding pci=nocrs,realloc,rom to /etc/default/grub
    • this helped get me from “ERROR: could not insert ‘nvidia’: No such device” to “No devices were found”.
  • echo 1 > /sys/bus/pci/devices/0000\:00\:01.0/remove ; echo 1 > /sys/bus/pci/rescan
    • Thread 197768

Hardware I have:

  • 2013 Mac Pro “Trashcan”
  • Ubuntu 20.04 with all packages updated
  • eGPU bridge 5e:00.0 PCI bridge: Intel Corporation JHL7540 Thunderbolt 3 Bridge [Titan Ridge DD 2018] (rev 06)
  • NVIDIA RTX A5000

Why can’t nvidia-smi see my GPU even though /dev/nvidia0 exists? :(

I’ve attached the nvidia-bug-report.log.gz to this post.
nvidia-bug-report.log.gz (128.5 KB)

Also, these messages keep repeating in my dmesg log, regardless of which driver I install (470, 515, 525):

[  709.843811] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  470.161.03  Wed Oct 19 00:10:36 UTC 2022
[  709.862647] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  470.161.03  Wed Oct 19 00:05:15 UTC 2022
[  709.869723] [drm] [nvidia-drm] [GPU ID 0x00001900] Loading driver
[  709.869727] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:19:00.0 on minor 2
[  709.896517] nvidia_uvm: module uses symbols from proprietary module nvidia, inheriting taint.
[  709.900037] nvidia-uvm: Loaded the UVM driver, major device number 504.
[  709.919576] [drm] [nvidia-drm] [GPU ID 0x00001900] Unloading driver
[  709.960842] nvidia-modeset: Unloading
[  709.989640] nvidia-uvm: Unloaded the UVM driver.
[  710.026059] nvidia-nvlink: Unregistered the Nvlink Core, major device number 507
[  710.653570] nvidia-nvlink: Nvlink Core is being initialized, major device number 507

[  710.713327] nvidia 0000:19:00.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[  712.220813] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  470.161.03  Wed Oct 19 00:10:36 UTC 2022
[  712.252962] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  470.161.03  Wed Oct 19 00:05:15 UTC 2022
[  712.261056] [drm] [nvidia-drm] [GPU ID 0x00001900] Loading driver
[  712.261060] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:19:00.0 on minor 2
[  712.290075] nvidia_uvm: module uses symbols from proprietary module nvidia, inheriting taint.
[  712.293589] nvidia-uvm: Loaded the UVM driver, major device number 504.
[  712.315195] [drm] [nvidia-drm] [GPU ID 0x00001900] Unloading driver
[  712.344952] nvidia-modeset: Unloading
[  712.383611] nvidia-uvm: Unloaded the UVM driver.
[  712.406006] nvidia-nvlink: Unregistered the Nvlink Core, major device number 507

Please check if this helps:
https://forums.developer.nvidia.com/t/k-ubuntu-22-10-not-booting-kernel-oops-for-driver-450-with-egpu/235008/3?u=generix

Thanks for the reply! I’ll try a version of the 470 driver in the range you mentioned. Would it work with an ampere/A series card?

Support for the RTX 5000 (desktop) was added in 460.73.01

1 Like

Awesome, thanks! I’ll try that and report back.