Hello,
I am having trouble getting an NVIDIA A100 to work on a SuperMicro M11SDV-8C±LN4F server board with little success. I am getting the “NVRM: This PCI I/O region assigned to your NVIDIA device is invalid: NVRM: BAR0 is 0M @ 0x0 (PCI:0000:05:00.0)” error whenever I boot or attempt to use NVIDIA-SMI, which reports that the driver is not active due to being unable to bind to any devices. The card can be found via lspci:
05:00.0 3D controller: NVIDIA Corporation GA100 [A100 PCIe 80GB] (rev ff)
After many hours of Google I have tried the following remedies with no success:
- Ensuring that Above 4G Encoding is enabled (it’s enabled by default)
- pci=realloc
- pci=realloc=off
- pci=nocrs
- Trying the above kernel paramters with a pci-e rescan
- Ensuring that the OS is installed in EFI mode
I have also tried every solution above with RHEL 9, Rocky 9, Rocky 8, Alma 9, and Ubuntu Server 22.04.2.
Here’s the bug report dump: nvidia-bug-report.log.gz (51.8 KB)
Any guidance would be much appreciated. Alternatively, any recommendations for mini-itx form factor server boards that are known to work with the A100 would be very welcome.
Thanks in advance,
James