VFIO Passthrough for HGX B200 system

dmitry46 · July 22, 2025, 5:59pm

We’re configuring the virtualization software for the customer for HGX B200 8-GPU. Our stack utilizes VFIO / KVM / Qemu for direct GPU passthrough. We run into the following issue with this system:

The VM is created, and the GPU is successfully passed through and visible with lspci.
nvidia-smi: Shows Driver Version: 570.172.08 for the B200. Persistence mode is “On”.
deviceQuery: Reports CUDA Error: initialization error (code 3).
We consistently see the following error in the dmesg

NVRM: kbifCacheVFInfo_GB100: Unable to read NV_PF0_INITIAL_AND_TOTAL_VFS
NVRM: calculatePCIELinkRateMBps: Unknown PCIe speed
NVRM: getPCIELinkRateMBps: Generic Error: Invalid state [NV_ERR_INVALID_STATE]
[drm] [nvidia-drm] [GPU ID 0x00000010] Failed to allocate NvKmsKapiDevice

Is this a known issue? What would be the fastest way to resolve the problem, i.e., minimizing the amount of changes necessary in our software?

Topic		Replies	Views
ENOMEM when running CUDA sample on host GPU where another GPU is passed through via IOMMU/vfio-pci Linux	1	781	May 19, 2019
Error running cuda on VM with GPU passthrough. cuda.get_device_name() returns 802, not initialized CUDA Setup and Installation	5	766	June 19, 2025
Vfio passthrough to ubuntu20.04 guest GeForce RTX 2070 - 440.100 - RmInitAdapter failed! (0x23:0x56:515) Linux	2	1334	October 12, 2021
Issue with pci passthrough in ESXi and with ubuntu Linux	4	2151	January 11, 2022
This PCI I/O region assigned to your NVIDIA device is invalid: Linux cuda	5	5884	October 12, 2021
Nvidia 2x H100s passthrough failed with insufficient memory General Discussion linux , gpu , virtualization-solutions	0	898	April 17, 2024
Problem initializing second Nvidia GPU CUDA Setup and Installation	0	159	November 15, 2024
Hyper-V P2000 GPU Passthrough to Ubuntu - nvidia-smi returns 'device not found' Drivers - Linux, Windows, MacOS ubuntu , nvidia-smi	0	559	March 16, 2024
Problem installing Nvidia driver with VM and GPU passthrough. Linux	1	2145	December 6, 2017
H100 PCIe, NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver Linux kernel , ubuntu , gpu , driver , nvidia-smi	17	4761	April 12, 2024

VFIO Passthrough for HGX B200 system

Related topics