Startx not working with quadro and hp z4 g5

Hi,

This card works fine on a gen 4, but today I tried to get it working on a gen 5 Z4 but startx won’t start.

The OS is redhat 7.6 and the driver is nvidia-linux-410.93-6.x86-64. I have blacklisted the nou driver in grub. Also, this machine does not have an onboard gpu, even though it appears later to find one by intel.

According to lspci my nvidia is on bus: 47:00:0

I currently don’t have an xorg.conf file. When I try to run startx it fails with Fatal server error: (ee) no screens found.

If I check the x11 log I see failed to initialize the NVIDIA GPU at PCI:71:0:0. If you remember, my nvidia is at 47:00:0 according to lspci. At 71:0:0 is System peripheral: Intel Corporation Device 09a2 (rev 20).

So to fix this, I create an xorg.conf file with X -configure, which is still trying to use 71:0:0 for the nvidia card.

To correct this I add BusID: PCI:47:0:0 to the bottom of devices in xorg.conf. Next I tired startx again but it fails, no screens found.

Log file now:

[ 1806.903]
X.Org X Server 1.20.1
X Protocol Version 11, Revision 0
[ 1806.909] Build Operating System: 3.10.0-862.2.3.el7.x86_64
[ 1806.912] Current Operating System: Linux okc-exp2-cwp-05.frq.local 3.10.0-957.27.2.el7.x86_64 #1 SMP Tue Jul 9 16:53:14 UTC 2019 x86_64
[ 1806.912] Kernel command line: BOOT_IMAGE=/vmlinuz-3.10.0-957.27.2.el7.x86_64 root=/dev/mapper/frq01-root ro crashkernel=auto rd.lvm.lv=frq01/root rd.lvm.lv=frq01/swap net.ifnames=0 rd.drvier.blacklist=nouveau modprobe.blacklist=nouveau rhgb quiet
[ 1806.918] Build Date: 24 September 2018 06:30:46PM
[ 1806.920] Build ID: xorg-x11-server 1.20.1-3.el7
[ 1806.922] Current version of pixman: 0.34.0
[ 1806.927] Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.
[ 1806.927] Markers: (–) probed, () from config file, (==) default setting,
(++) from command line, (!!) notice, (II) informational,
(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[ 1806.938] (==) Log file: “/var/log/Xorg.1.log”, Time: Mon Nov 20 19:39:58 2023
[ 1806.940] (==) Using config file: “/etc/X11/xorg.conf”
[ 1806.942] (==) Using system config directory “/usr/share/X11/xorg.conf.d”
[ 1806.942] (==) ServerLayout “X.org Configured”
[ 1806.942] (
) |–>Screen “Screen0” (0)
[ 1806.942] () | |–>Monitor “Monitor0”
[ 1806.943] (
) | |–>Device “Card0”
[ 1806.943] () |–>Input Device “Mouse0”
[ 1806.943] (
) |–>Input Device “Keyboard0”
[ 1806.943] (==) Automatically adding devices
[ 1806.943] (==) Automatically enabling devices
[ 1806.943] (==) Automatically adding GPU devices
[ 1806.943] (==) Automatically binding GPU devices
[ 1806.943] (==) Max clients allowed: 256, resource mask: 0x1fffff
[ 1806.943] () FontPath set to:
catalogue:/etc/X11/fontpath.d,
built-ins,
catalogue:/etc/X11/fontpath.d,
built-ins
[ 1806.943] (
) ModulePath set to “/usr/lib64/xorg/modules”
[ 1806.943] (WW) Hotplugging is on, devices using drivers ‘kbd’, ‘mouse’ or ‘vmmouse’ will be disabled.
[ 1806.943] (WW) Disabling Mouse0
[ 1806.943] (WW) Disabling Keyboard0
[ 1806.943] (II) Loader magic: 0x560fe2053020
[ 1806.943] (II) Module ABI versions:
[ 1806.943] X.Org ANSI C Emulation: 0.4
[ 1806.943] X.Org Video Driver: 24.0
[ 1806.943] X.Org XInput driver : 24.1
[ 1806.943] X.Org Server Extension : 10.0
[ 1806.943] (II) xfree86: Adding drm device (/dev/dri/card0)
[ 1806.943] (II) Platform probe for /sys/devices/pci0000:46/0000:46:01.0/0000:47:00.0/drm/card0
[ 1806.943] (II) Platform PCI device at pci:0000:46:01.0
[ 1806.947] (–) PCI:*(71@0:0:0) 10de:1ff2:103c:1613 rev 161, Mem @ 0x98000000/16777216, 0x201fe0000000/268435456, 0x201ff0000000/33554432, I/O @ 0x00004000/128, BIOS @ 0x???/524288
[ 1806.947] (II) “glx” will be loaded. This was enabled by default and also specified in the config file.
[ 1806.947] (II) LoadModule: “glxservernvidia”
[ 1806.947] (WW) Warning, couldn’t open module glxservernvidia
[ 1806.947] (EE) Failed to load module “glxservernvidia” (module does not exist, 0)
[ 1806.947] (II) LoadModule: “glx”
[ 1806.947] (II) Loading /usr/lib64/xorg/modules/extensions/libglx.so
[ 1806.948] (II) Module glx: vendor=“X.Org Foundation”
[ 1806.948] compiled for 1.20.1, module version = 1.0.0
[ 1806.948] ABI class: X.Org Server Extension, version 10.0
[ 1806.948] (II) LoadModule: “nvidia”
[ 1806.948] (II) Loading /usr/lib64/xorg/modules/drivers/nvidia_drv.so
[ 1806.948] (II) Module nvidia: vendor=“NVIDIA Corporation”
[ 1806.948] compiled for 4.0.2, module version = 1.0.0
[ 1806.948] Module class: X.Org Video Driver
[ 1806.948] (II) NVIDIA dlloader X Driver 410.93 Thu Dec 20 17:07:57 CST 2018
[ 1806.948] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[ 1806.948] (++) using VT number 1

[ 1806.948] (EE) No devices detected.
[ 1806.948] (EE)
Fatal server error:
[ 1806.948] (EE) no screens found(EE)
[ 1806.948] (EE)
Please consult the The X.Org Foundation support
at http://wiki.x.org
for help.
[ 1806.948] (EE) Please also check the log file at “/var/log/Xorg.1.log” for additional information.
[ 1806.948] (EE)
[ 1806.965] (EE) Server terminated with error (1). Closing log file.

And this is my current xorg.conf file:

Section “ServerLayout”
Identifier “X.org Configured”
Screen 0 “Screen0” 0 0
InputDevice “Mouse0” “CorePointer”
InputDevice “Keyboard0” “CoreKeyboard”
EndSection

Section “Files”
ModulePath “/usr/lib64/xorg/modules”
FontPath “catalogue:/etc/X11/fontpath.d”
FontPath “built-ins”
EndSection

Section “Module”
Load “glxserver_nvidia”
Load “glx”
EndSection

Section “InputDevice”
Identifier “Keyboard0”
Driver “kbd”
EndSection

Section “InputDevice”
Identifier “Mouse0”
Driver “mouse”
Option “Protocol” “auto”
Option “Device” “/dev/input/mice”
Option “ZAxisMapping” “4 5 6 7”
EndSection

Section “Monitor”
Identifier “Monitor0”
VendorName “Monitor Vendor”
ModelName “Monitor Model”
EndSection

Section “Device”
Identifier “Card0”
Driver “nvidia”
BusID “PCI:47:0:0”
EndSection

Section “Screen”
Identifier “Screen0”
Device “Card0”
Monitor “Monitor0”
SubSection “Display”
Viewport 0 0
Depth 1
EndSubSection
SubSection “Display”
Viewport 0 0
Depth 4
EndSubSection
SubSection “Display”
Viewport 0 0
Depth 8
EndSubSection
SubSection “Display”
Viewport 0 0
Depth 15
EndSubSection
SubSection “Display”
Viewport 0 0
Depth 16
EndSubSection
SubSection “Display”
Viewport 0 0
Depth 24
EndSubSection
EndSection

And this is the output of my lspci:

00:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
00:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
00:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
00:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
00:14.0 USB controller: Intel Corporation Device 7ae0 (rev 11)
00:14.2 RAM memory: Intel Corporation Device 7aa7 (rev 11)
00:16.0 Communication controller: Intel Corporation Device 7ae8 (rev 11)
00:17.0 RAID bus controller: Intel Corporation C600/X79 series chipset SATA RAID Controller (rev 11)
00:1f.0 ISA bridge: Intel Corporation Device 7a8a (rev 11)
00:1f.3 Audio device: Intel Corporation Device 7ad0 (rev 11)
00:1f.4 SMBus: Intel Corporation Device 7aa3 (rev 11)
00:1f.5 Serial bus controller [0c80]: Intel Corporation Device 7aa4 (rev 11)
00:1f.6 Ethernet controller: Intel Corporation Device 1a1c (rev 11)
46:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
46:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
46:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
46:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
46:01.0 PCI bridge: Intel Corporation Device 352a (rev 04)
47:00.0 VGA compatible controller: NVIDIA Corporation Device 1ff2 (rev a1)
47:00.1 Audio device: NVIDIA Corporation Device 10fa (rev a1)
71:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
71:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
71:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
71:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
71:01.0 PCI bridge: Intel Corporation Device 352a (rev 04)
9c:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
9c:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
9c:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
9c:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
9c:01.0 PCI bridge: Intel Corporation Device 352a (rev 04)
c7:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
c7:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
c7:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
c7:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
c7:01.0 PCI bridge: Intel Corporation Device 352a (rev 04)
c7:03.0 PCI bridge: Intel Corporation Device 352b (rev 04)
c7:05.0 PCI bridge: Intel Corporation Device 352c (rev 04)
c7:07.0 PCI bridge: Intel Corporation Device 352d (rev 04)
c8:00.0 Non-Volatile memory controller: Micron Technology Inc Device 5407
f2:00.0 System peripheral: Intel Corporation Device 09a2 (rev 20)
f2:00.1 System peripheral: Intel Corporation Device 09a4 (rev 20)
f2:00.2 System peripheral: Intel Corporation Device 09a3 (rev 20)
f2:00.4 Generic system peripheral [0807]: Intel Corporation Device 0b23
f2:01.0 System peripheral: Intel Corporation Device 0b25
f2:03.0 System peripheral: Intel Corporation Device 09a6
f2:03.1 System peripheral: Intel Corporation Device 09a7
fe:00.0 System peripheral: Intel Corporation Device 3250
fe:00.1 System peripheral: Intel Corporation Device 3251
fe:00.2 System peripheral: Intel Corporation Device 3252
fe:00.3 Host bridge: Intel Corporation Device 0998
fe:00.5 System peripheral: Intel Corporation Device 3255
fe:05.0 System peripheral: Intel Corporation Device 3245
fe:05.1 System peripheral: Intel Corporation Device 3246
fe:05.2 System peripheral: Intel Corporation Device 3247
fe:06.0 System peripheral: Intel Corporation Device 3245
fe:06.1 System peripheral: Intel Corporation Device 3246
fe:06.2 System peripheral: Intel Corporation Device 3247
fe:07.0 System peripheral: Intel Corporation Device 3245
fe:07.1 System peripheral: Intel Corporation Device 3246
fe:07.2 System peripheral: Intel Corporation Device 3247
fe:0c.0 Performance counters: Intel Corporation Device 324a
fe:0d.0 Performance counters: Intel Corporation Device 324a
fe:0e.0 Performance counters: Intel Corporation Device 324a
fe:0f.0 Performance counters: Intel Corporation Device 324a
fe:1a.0 Performance counters: Intel Corporation Device 2880
fe:1c.0 Performance counters: Intel Corporation Device 2880
ff:00.0 System peripheral: Intel Corporation Device 324c
ff:00.1 System peripheral: Intel Corporation Device 324c
ff:00.2 System peripheral: Intel Corporation Device 324c
ff:00.3 System peripheral: Intel Corporation Device 324c
ff:00.4 System peripheral: Intel Corporation Device 324c
ff:00.5 System peripheral: Intel Corporation Device 324c
ff:00.6 System peripheral: Intel Corporation Device 324c
ff:00.7 System peripheral: Intel Corporation Device 324c
ff:01.0 System peripheral: Intel Corporation Device 324c
ff:01.1 System peripheral: Intel Corporation Device 324c
ff:01.2 System peripheral: Intel Corporation Device 324c
ff:01.3 System peripheral: Intel Corporation Device 324c
ff:01.4 System peripheral: Intel Corporation Device 324c
ff:01.5 System peripheral: Intel Corporation Device 324c
ff:01.6 System peripheral: Intel Corporation Device 324c
ff:01.7 System peripheral: Intel Corporation Device 324c
ff:0a.0 System peripheral: Intel Corporation Device 324d
ff:0a.1 System peripheral: Intel Corporation Device 324d
ff:0a.2 System peripheral: Intel Corporation Device 324d
ff:0a.3 System peripheral: Intel Corporation Device 324d
ff:0a.4 System peripheral: Intel Corporation Device 324d
ff:0a.5 System peripheral: Intel Corporation Device 324d
ff:0a.6 System peripheral: Intel Corporation Device 324d
ff:0a.7 System peripheral: Intel Corporation Device 324d
ff:0b.0 System peripheral: Intel Corporation Device 324d
ff:0b.1 System peripheral: Intel Corporation Device 324d
ff:0b.2 System peripheral: Intel Corporation Device 324d
ff:0b.3 System peripheral: Intel Corporation Device 324d
ff:0b.4 System peripheral: Intel Corporation Device 324d
ff:0b.5 System peripheral: Intel Corporation Device 324d
ff:0b.6 System peripheral: Intel Corporation Device 324d
ff:0b.7 System peripheral: Intel Corporation Device 324d
ff:1d.0 System peripheral: Intel Corporation Device 344f
ff:1d.1 System peripheral: Intel Corporation Device 3457
ff:1e.0 System peripheral: Intel Corporation Device 3258 (rev 08)
ff:1e.1 System peripheral: Intel Corporation Device 3259 (rev 08)
ff:1e.2 System peripheral: Intel Corporation Device 325a (rev 08)
ff:1e.3 System peripheral: Intel Corporation Device 325b (rev 08)
ff:1e.4 System peripheral: Intel Corporation Device 325c (rev 08)
ff:1e.5 System peripheral: Intel Corporation Device 325d (rev 08)
ff:1e.6 System peripheral: Intel Corporation Device 325e (rev 08)
ff:1e.7 System peripheral: Intel Corporation Device 325f (rev 08)

I’m not sure where to go from here.

Xorg uses decimal values, 47hex=71dec, please remove the busid entry you set. If you don’t have a monitor attached, please add
Option "AllowEmptyInitialConfiguration" "true"
instead. If that doesn’t help,
Please run nvidia-bug-report.sh as root and attach the resulting nvidia-bug-report.log.gz file to your post.

Hi,

Thanks for the reply, I didn’t know about the hex thing.

I removed the BusID statement, rebooted and tried again. It failed with can’t find Nvidia card. I’m attaching the following:

  1. Bug report
  2. Current xorg.conf
  3. Current output of lspci
  4. Xorg.0.log from my last startx attempt.
    nvidia-bug-report.log.gz (539.5 KB)
    Xorg.0.log (6.1 KB)
    lspci.txt (6.5 KB)
    xorg.conf.txt (1.3 KB)

I forgot to include the output of lsmod in case you need it. I have attached it.
lsmod.txt (3.4 KB)

Also, I do have a monitor attached.

I just noticed if I tail /var/log/messages at the same time I attempt to run startx I get these two messages:

NVRM: RmInitAdapter failed! (0x26:0xffff:1127)
NVRM: rm_init_adapter failed for device bearring minor number 0

The gpu might be broken. Please try reseating it in its slot, check power cables, check if it works in another system.

So I went to swap the GPU for another one (we have a few p400 cards) and I noticed that HP sent these workstations with T400 cards instead of the P400 cards we tested and had working. The P400 card seems to work in this machine. Does that mean the T400 isn’t compatible with the driver we are using? I guess it could also still be a bad card. I should retest with another T400 card.

Oops, yeah, the error message is misleading, the driver you installed is outdated and incompatible with the T400. You need to use at least 470, better use the latest driver.

Thank you for helping me!