vGPU boot error

roodabigman · April 4, 2014, 8:47pm

Having an issue getting vGPU working. If I try to assign any machine a shared GPU - I get the following error when I attempt to boot the machine:

internal error: xenopsd internal error: Unix.Unix_error (20, "open", "/ sys / bus / pci / drivers / nvidia / bind")

there are no other machines using the GPU. passthrough works fine.

XenServer 6.2 SP1, Cisco UCS C240M3.

any suggestions?

RTS · April 7, 2014, 5:55pm

Hi roodabigman,

Could you please try to update this latest patch Hotfix XS62ESP1004 - For XenServer 6.2.0 Service Pack 1 and check

roodabigman · April 7, 2014, 9:36pm

Hi Raja - thanks for the suggestion.

unfortunately installing the patch did not have an effect, still have the same error when booting.

RTS · April 8, 2014, 5:58am

Thanks roodabigman for the quick check.

Could you please help to confirm the below queries?

May I know the VM OS version?
Are you trying to assign the vGPU to VM via XenServer [using command] or XenCenter [using GUI]?
Please provide nvidia bug report by running nvidia-bug-report.sh script

roodabigman · April 24, 2014, 5:51pm

Hi Raja,

Windows 7 SP1 64-bit
assigned via XenCenter GUI
found here that the Nvidia module is not loading correctly - returns fatal error - module not loaded

lsmod | grep nvidia - returns nothing
Memory Mapped I/O above 4 GB already disabled
dmesg | grep NVIDIA - returns nothing

I will have to troubleshoot why the xenserver is not loading the module.

lwignall · April 26, 2014, 4:21pm

Try uninstalling and reinstalling the vGPU driver (be sure that is the driver you downloaded from our site), rebooting after each step.

RTS · April 28, 2014, 6:12am

Hi roodabigman,

Any update after re-installing vGPU driver [on HOST]?

ucsguy · April 28, 2014, 10:29am

Issue fixed with How to Resolve GPU Memory Mapping Issues in XenServer

Change Memory Mapped I/O above 4GB to Disabled. It works.

bp_vardhaman · April 28, 2014, 10:29am

How to Resolve GPU Memory Mapping Issues in XenServer
CTX139834 Created onMar 26, 2014 Updated onApr 02, 2014
Article Topic : Storage, Other
See Applicable Products
Objective
This article is for customers running XenServer 6.2.0 who are using the 3D Graphics Pack (3DGP) with NVIDIA GRID GPUs, and have problems starting Virtual Machines (VMs) with a virtual GPU (vGPU) created. Customers may find that virtual machines fail to start with a message similar to the following:
Unix.Unix_error(20, "open", "/sys/bus/pci/drivers/nvidia/bind")
This can be caused by the NVIDIA driver not loading in the host’s control domain. To check this, run the following command on the host console:
lsmod | grep ^NVidia
This will return no results if the driver is not loaded.
To find out whether this is caused by the memory mapping issue, run the following command on the host console:
dmesg | grep NVIDIA
Check for messages containing:
"This PCI I/O region assigned to your NVIDIA device is invalid"
If you see this message, it confirms that the GPU has been mapped into memory inaccessible to the host’s control domain. This can be resolved with a change to the BIOS settings.
Instructions
The following sample procedure is for a Dell R720 server. For other server types, refer to the vendor documentation.
Reboot the server and enter System Setup (press F2).
Navigate to System BIOS, and then Integrated Devices.
Change Memory Mapped I/O above 4GB to Disabled.
Save the settings and reboot the host. It should now be possible to start VMs with vGPUs.

RTS · April 29, 2014, 6:03am

Thanks All, I hope roodabigman already mentioned he already disable the "Memory Mapped I/O above 4GB" option in SBIOS. Refer comment # 5.

Hi roodabigman,

Could you please double confirm whether "Memory Mapped I/O above 4GB" option is disabled or not?

roodabigman · November 5, 2014, 5:03pm

Hi Raja,

sorry for the extended delay - environment was migrated to a different datacenter and we have had other projects that took priority.

Memory Mapped I/O is and has always been disabled.

A bit of success - we updated the firmware on the hardware (C240M3) to the latest version from Cisco. Now 1 of the 2 server will boot vGPU :), the other still gives the same errors for some reason.

We’re going to re-flash the firmware and open them to be sure the hardware config is identical between the two of them, will let you know if we get it working.

Topic		Replies	Views
vGPU not showing up when creating VM's NVIDIA Virtual GPU Technology	2	9132	May 1, 2014
One K260Q vGPU working -> vmiop_log: error: /usr/lib/libnvidia-vgx.so NVIDIA Virtual GPU Technology	0	6668	May 1, 2014
Cant Start Xenserver 8.2 VM with vGPU General Discussion	2	734	July 17, 2024
VM's locked up on XenServer 6.5 NVIDIA Virtual GPU Technology	2	6860	October 13, 2015
Nvidia VMware vSphere-6.7 NVIDIA Virtual GPU Technology	14	10220	August 19, 2019
Could not initialize plugin 'libnvidia-vgx.so' for vGPU 'nvidia_a16-4q' XenDesktop	16	5298	April 21, 2025
Problem with Nvidia K2 and vGPU Profiles General Discussion	3	7563	October 4, 2015
Issue with Tesla M6 on Cisco B200 M4 after reboot NVIDIA Virtual GPU Drivers	4	5371	October 11, 2017
GPU hardware detected but unable to start (error code 10) NVIDIA Virtual GPU Technology	2	28184	June 1, 2015
Emulator Failed to Start NVIDIA Virtual GPU Technology	5	11486	June 29, 2016

vGPU boot error

Related topics