PowerEdge XE8545 - A100 - VMware 7.0.3 - Debian 10.11


i using a PowerEdge XE8545 with 4x A100 in Passthrough Mode (in VMware 7.0.3).
We have assing one A100 to one VM (with Debian 10.11) and add the following parameters to the VM config.

  • pciPassthru.use64bitMMIO=”TRUE”
  • pciPassthru.64bitMMIOSizeGB=160

After the VM is booting we can see the A100 with the command “lspci -v” and the output
13:00.0 3D controller: NVIDIA Corporation Device 20b2 (rev a1)
Subsystem: NVIDIA Corporation Device 147f
Physical Slot: 224
Flags: fast devsel, IRQ 16
Memory at fc000000 (32-bit, non-prefetchable) [size=16M]
Memory at (64-bit, prefetchable)
Memory at e4000000 (64-bit, prefetchable) [size=32M]
Capabilities: [60] Power Management version 3
Capabilities: [68] #00 [0080]
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [c8] MSI-X: Enable- Count=6 Masked-
Capabilities: [100] Virtual Channel
Capabilities: [250] Latency Tolerance Reporting
Capabilities: [258] L1 PM Substates
Capabilities: [128] Power Budgeting <?> Capabilities: [420] Advanced Error Reporting Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Capabilities: [900] #19
Capabilities: [bb0] #15
Capabilities: [bcc] Single Root I/O Virtualization (SR-IOV)
Capabilities: [c14] Alternative Routing-ID Interpretation (ARI)
Capabilities: [c1c] #26
Capabilities: [d00] #27
Capabilities: [e00] #25
Kernel modules: nouveau

We downloaded the Driver “NVIDIA-Linux-x86_64-515.76.run” and install the following packages in front of the driver installation.

  • “apt -y install linux-headers-$(uname -r) build-essential libglvnd-dev pkg-config”

At the end of the driver installation we became the following error in “/var/log/nvidia-installer.log”.
NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:
NVRM: BAR1 is 0M @ 0x0 (PCI:0000:13:00.0)

I have tested the both kernel parameters but the problem is the same.
“pci=realloc=off” or “pci=realloc”

I need help to fix that.