Upgraded from 17.04 to 17.10 and now X comes up and eventually stops. This was working on 17.04 with the nvidia-384 driver but on 17.10 it no longer does.
This is an AMD x399 system with a Titan Xp card.
I’m grabbing the drivers from:
500 Index of /graphics-drivers/ppa/ubuntu artful/main i386 Packages
release v=17.10,o=LP-PPA-graphics-drivers,a=artful,n=artful,l=Proprietary GPU Drivers,c=main,b=i386
origin ppa.launchpad.net
500 Index of /graphics-drivers/ppa/ubuntu artful/main amd64 Packages
release v=17.10,o=LP-PPA-graphics-drivers,a=artful,n=artful,l=Proprietary GPU Drivers,c=main,b=amd64
origin ppa.launchpad.net
I tried:
apt-get purge nvidia-*
apt-get install -y nvidia-*
shutdown -r now
$ dpkg -l | grep nvidia
ii nvidia-384 384.90-0ubuntu3 amd64 NVIDIA binary driver - version 384.90
ii nvidia-opencl-icd-384 384.90-0ubuntu3 amd64 NVIDIA OpenCL ICD
ii nvidia-prime 0.8.5 amd64 Tools to enable NVIDIA’s Prime
ii nvidia-settings 384.90-0ubuntu0~gpu17.10.1 amd64 Tool for configuring the NVIDIA graphics driver
$ sudo modinfo nvidia-384
filename: /lib/modules/4.13.0-16-lowlatency/updates/dkms/nvidia_384.ko
alias: char-major-195-*
version: 384.90
supported: external
license: NVIDIA
srcversion: 9D546A76FA9D9523F03995D
alias: pci:v000010DEd00000E00svsdbc04sc80i00*
alias: pci:v000010DEdsvsdbc03sc02i00
alias: pci:v000010DEdsvsdbc03sc00i00
depends:
name: nvidia
vermagic: 4.13.0-16-lowlatency SMP preempt mod_unload
parm: NVreg_Mobile:int
parm: NVreg_ResmanDebugLevel:int
parm: NVreg_RmLogonRC:int
parm: NVreg_ModifyDeviceFiles:int
parm: NVreg_DeviceFileUID:int
parm: NVreg_DeviceFileGID:int
…
X will come up (I see it in nvidia-smi) and then come down, then up, then down. I see a lot of these;
888.017] (–) NVIDIA(GPU-0):
[ 888.017] (–) NVIDIA(GPU-0): DFP-3: disconnected
[ 888.017] (–) NVIDIA(GPU-0): DFP-3: Internal DisplayPort
[ 888.017] (–) NVIDIA(GPU-0): DFP-3: 1440.0 MHz maximum pixel clock
[ 888.017] (–) NVIDIA(GPU-0):
[ 888.017] (–) NVIDIA(GPU-0): DFP-4: disconnected
[ 888.017] (–) NVIDIA(GPU-0): DFP-4: Internal TMDS
[ 888.017] (–) NVIDIA(GPU-0): DFP-4: 165.0 MHz maximum pixel clock
[ 888.017] (–) NVIDIA(GPU-0):
[ 888.017] (–) NVIDIA(GPU-0): DFP-5: disconnected
[ 888.017] (–) NVIDIA(GPU-0): DFP-5: Internal DisplayPort
[ 888.018] (–) NVIDIA(GPU-0): DFP-5: 1440.0 MHz maximum pixel clock
[ 888.018] (–) NVIDIA(GPU-0):
[ 888.018] (–) NVIDIA(GPU-0): DFP-6: disconnected
[ 888.018] (–) NVIDIA(GPU-0): DFP-6: Internal TMDS
[ 888.018] (–) NVIDIA(GPU-0): DFP-6: 165.0 MHz maximum pixel clock
[ 888.018] (–) NVIDIA(GPU-0):
[ 888.355] () Option “fd” “38”
[ 888.355] (II) event1 - (II) Power Button: (II) device removed
[ 888.356] () Option “fd” “41”
[ 888.356] (II) event0 - (II) Power Button: (II) device removed
[ 888.356] (**) Option “fd” “42”
[ 888.356] (II) event9 - (II) Eee PC WMI hotkeys: (II) device removed
[ 888.357] (II) UnloadModule: “libinput”
[ 888.357] (II) systemd-logind: releasing fd for 13:73
[ 888.370] (II) UnloadModule: “libinput”
[ 888.370] (II) systemd-logind: releasing fd for 13:64
[ 888.387] (II) UnloadModule: “libinput”
[ 888.387] (II) systemd-logind: releasing fd for 13:65
[ 888.439] (II) NVIDIA(GPU-0): Deleting GPU-0
[ 888.443] (II) Server terminated successfully (0). Closing log file.
And nvidia-settings -l gives me:
$ nvidia-settings -V -l
WARNING: NV-CONTROL extension not found on this Display.
ERROR: Error querying enabled displays on GPU 0 (Missing Extension).
ERROR: Error querying connected displays on GPU 0 (Missing Extension).
WARNING: NV-CONTROL extension not found on this Display.
WARNING: Unable to determine number of NVIDIA GPUs on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA Frame Lock Devices on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA VCSs on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA SDI Input Devices on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA Fans on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA Thermal Sensors on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA 3D Vision Pro Transceivers on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA Display Devices on ‘kitt:10.0’.
WARNING: Unable to determine number of NVIDIA X Screens on ‘kitt:10.0’.
My xorg.conf file:
$ cat /etc/X11/xorg.conf
nvidia-xconfig: X configuration file generated by nvidia-xconfig
nvidia-xconfig: version 384.90 (buildmeister@swio-display-x86-rhel47-05) Tue Sep 19 18:13:03 PDT 2017
Section “ServerLayout”
Identifier “Layout0”
Screen 0 “Screen0” 0 0
InputDevice “Keyboard0” “CoreKeyboard”
InputDevice “Mouse0” “CorePointer”
EndSection
Section “Files”
EndSection
Section “InputDevice”
# generated from default
Identifier "Mouse0"
Driver "mouse"
Option "Protocol" "auto"
Option "Device" "/dev/psaux"
Option "Emulate3Buttons" "no"
Option "ZAxisMapping" "4 5"
EndSection
Section “InputDevice”
# generated from default
Identifier "Keyboard0"
Driver "kbd"
EndSection
Section “Monitor”
Identifier “Monitor0”
VendorName “Unknown”
ModelName “Unknown”
HorizSync 28.0 - 33.0
VertRefresh 43.0 - 72.0
Option “DPMS”
EndSection
Section “Device”
Identifier “Device0”
Driver “nvidia”
VendorName “NVIDIA Corporation”
EndSection
Section “Screen”
Identifier “Screen0”
Device “Device0”
Monitor “Monitor0”
DefaultDepth 24
Option “AllowEmptyInitialConfiguration” “True”
SubSection “Display”
Depth 24
EndSubSection
EndSection
What am I doing wrong?
nvidia-bug-report.log.gz (167 KB)