Nvidia-smi 'no devices round' on Ubuntu 22.04

Hi,

I’ve been banging my head against the wall to try to install the nvidia drivers necessary for (eventually) Plex hardware transcoding.

I have a fresh install on a VM of Ubuntu 22.04, passthrough seems to be working (I can see the device in linux). I’ve installed the drivers like so:

sudo apt install nvidia-headless-550-server
sudo apt install nvidia-utils-550-server
sudo apt install libnvidia-encode-550-server

I’ve created the file /etc/modprobe.d/blacklist-nouveau.conf and added the following lines:

blacklist nouveau
options nouveau modeset=0

and ran sudo update-initramfs -u afterwards

After rebooting, nvidia-smi returns ‘No devices were found’

output of lspci | grep NVIDIA:

0b:00.0 VGA compatible controller: NVIDIA Corporation GP106 [GeForce GTX 1060 6GB] (rev a1)

output of sudo lshw -c video:

*-display
       description: VGA compatible controller
       product: SVGA II Adapter
       vendor: VMware
       physical id: f
       bus info: pci@0000:00:0f.0
       logical name: /dev/fb0
       version: 00
       width: 32 bits
       clock: 33MHz
       capabilities: vga_controller bus_master cap_list rom fb
       configuration: depth=32 driver=vmwgfx latency=64 resolution=1176,885
       resources: irq:16 ioport:2140(size=16) memory:e8000000-efffffff memory:f9000000-f97fffff memory:c0000-dffff
  *-display
       description: VGA compatible controller
       product: GP106 [GeForce GTX 1060 6GB]
       vendor: NVIDIA Corporation
       physical id: 0
       bus info: pci@0000:0b:00.0
       version: a1
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress vga_controller bus_master cap_list
       configuration: driver=nvidia latency=64
       resources: irq:19 memory:fb000000-fbffffff memory:c0000000-cfffffff memory:d0000000-d1ffffff ioport:3000(size=128

Can someone please help me get these damned drivers working??? Thank you in advance!

I’ve attached the bug report:
nvidia-bug-report.log (875.2 KB)

RmInitAdapter failed! (0x25:0x65:1589)
The gpu might be broken, already tried reseating it in its slot? Does it work in another system?

The card works (tested it in a Windows system), I’ve already reseated it. I also tried to install these drivers using an older NVidia card (GT 710) and had very similar issues (‘no devices found’).

Only now noticed you’re using pass-through on an esxi host. So this might be related to nvidia driver restrictions. Please try installing a 470 driver and attach a new nvidia-bug-report.log.

sudo apt install nvidia-headless-470-server
sudo apt install nvidia-utils-470-server
sudo apt install libnvidia-encode-470-server

Still no go, here’s the log:
nvidia-bug-report.log.gz (75.5 KB)

The error changed to
RmInitAdapter failed! (0x23:0xffff:1193)
Which should be something like “general pcie resource issue” but this doesn’t make any more sense.
Already tried updating the system bios of the hosting board?

I (have now) updated the BIOS to the latest available version. I also had a bit of a software snafu and will be re-trying this using Proxmox instead of ESXI.

I’ll post my updated logs if and when I get to that point :)

I was able to get it working! I switched to Proxmox from ESXI and followed this tutorial:

I had a hard crash upon VM start after attempting the passthru (Proxmox node hung), so I had to alter grub:

GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on pcie_acs_override=downstream,multifunction initcall_blacklist=sysfb_init"

I think the video card was in an IOMMU with other things, so that line separated them. After a reboot, I installed the 550 drivers as in the OP and nvidia-smi worked.

Thanks!

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.