Driver installed but won't load: Ubuntu 20.04 NVIDIA TU104GL [Quadro RTX 4000]

Trying to get Quadro RTX 4000 to work. Drivers installed but won’t load.

I installed Windows 10 on a separate partition. It finds and installs NVIDIA drivers. The problem is not faulty hardware.

Topics read:




nvidia-bug-report.log (140.6 KB)

Program outputs:
nvidia-detector:
nvidia-driver-440

nvidia-settings:
ERROR: NVIDIA driver is not loaded

ERROR: Unable to load info from any available system

inxi -G:
Graphics: Device-1: NVIDIA TU104GL [Quadro RTX 4000] driver: N/A
Display: x11 server: X.Org 1.20.8 driver: fbdev,nouveau unloaded: modesetting,vesa resolution: 1920x1080~77Hz
OpenGL: renderer: llvmpipe (LLVM 9.0.1 256 bits) v: 3.3 Mesa 20.0.4

mokutil --sb-state:
SecureBoot disabled

lsmod | grep nvidia:
i2c_nvidia_gpu 16384 0

lshw -c video:
*-display UNCLAIMED
description: VGA compatible controller
product: TU104GL [Quadro RTX 4000]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:18:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller cap_list
configuration: latency=0
resources: memory:b3000000-b3ffffff memory:a0000000-afffffff memory:b0000000-b1ffffff ioport:5000(size=128) memory:b4000000-b407ffff

nvidia-smi:
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

more /usr/share/X11/xorg.conf.d/10-amdgpu.conf:
Section “OutputClass”
Identifier “AMDgpu”
MatchDriver “amdgpu”
Driver “modesetting”
Option “PrimaryGPU” “Yes”
EndSection

ls /lib/modprobe.d/blacklist-nvidia.conf:
ls: cannot access ‘/lib/modprobe.d/blacklist-nvidia.conf’: No such file or directory

dpkg -l | grep nvidia:
ii libnvidia-cfg1-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-common-440 440.82-0ubuntu0~0.19.10.1 all Shared files used by the NVIDIA libraries
rc libnvidia-compute-418:amd64 430.50-0ubuntu2 amd64 Transitional package for libnvidia-compute-430
rc libnvidia-compute-430:amd64 430.64-0ubuntu0~gpu19.10.1 amd64 NVIDIA libcompute package
rc libnvidia-compute-435:amd64 435.21-0ubuntu2 amd64 NVIDIA libcompute package
ii libnvidia-compute-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA libcompute package
ii libnvidia-compute-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVIDIA libcompute package
ii libnvidia-decode-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA Video Decoding runtime libraries
ii libnvidia-decode-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVIDIA Video Decoding runtime libraries
ii libnvidia-encode-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVENC Video Encoding runtime library
ii libnvidia-encode-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVENC Video Encoding runtime library
ii libnvidia-extra-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 Extra libraries for the NVIDIA driver
ii libnvidia-fbc1-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-fbc1-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-gl-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-gl-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-ifr1-440:amd64 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA OpenGL-based Inband Frame Readback runtime library
ii libnvidia-ifr1-440:i386 440.82-0ubuntu0~0.19.10.1 i386 NVIDIA OpenGL-based Inband Frame Readback runtime library
rc nvidia-compute-utils-430 430.64-0ubuntu0~gpu19.10.1 amd64 NVIDIA compute utilities
rc nvidia-compute-utils-435 435.21-0ubuntu2 amd64 NVIDIA compute utilities
ii nvidia-compute-utils-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA compute utilities
rc nvidia-cuda-toolkit 10.1.168-1build1 amd64 NVIDIA CUDA development toolkit
rc nvidia-dkms-430 430.64-0ubuntu0~gpu19.10.1 amd64 NVIDIA DKMS package
rc nvidia-dkms-435 435.21-0ubuntu2 amd64 NVIDIA DKMS package
ii nvidia-dkms-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA DKMS package
ii nvidia-driver-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA driver metapackage
rc nvidia-kernel-common-430 430.64-0ubuntu0~gpu19.10.1 amd64 Shared files used with the kernel module
rc nvidia-kernel-common-435 435.21-0ubuntu2 amd64 Shared files used with the kernel module
ii nvidia-kernel-common-440 440.82-0ubuntu0~0.19.10.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA kernel source package
ii nvidia-prime 0.8.14 all Tools to enable NVIDIA’s Prime
ii nvidia-settings 440.64-0ubuntu1 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-utils-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA driver support binaries
ii screen-resolution-extra 0.18build1 all Extension for the nvidia-settings control panel
ii xserver-xorg-video-nvidia-440 440.82-0ubuntu0~0.19.10.1 amd64 NVIDIA binary Xorg driver

Looks like you forcibly installed a v410 .run installer driver over the already installed 440 repo driver. Please uninstall the .run driver using the --uninstall option, then reinstall the repo driver
sudo apt remove nvidia*
sudo apt install nvidia-driver-440
After reboot, please post the output of
dkms status
and
grep nvidia /etc/modprobe.d/* /lib/modprobe.d/*

dkms status:

nvidia, 440.64, 5.4.0-31-generic, x86_64: installed (WARNING! Diff between built and installed module!) (WARNING! Diff between built and installed module!) (WARNING! Diff between built and installed module!)

grep nvidia /etc/modprobe.d/* /lib/modprobe.d/*:

/etc/modprobe.d/blacklist-framebuffer.conf:blacklist nvidiafb

Associated question: I wish to use the MakieGL library in julia. It requires OpenGL 3.3. I have

glxinfo>grep “OpenGL version”:

OpenGL version string: 3.1 Mesa 20.0.4

Is there a way to upgrade OpenGL to 3.3? I can’t seem to find a way by web-search.

Thanks

Mesa OpenGL 3.1 is displayed because you’re currently on VGA fallback graphics with software rendering. Once the nvidia driver is fully functional, this will automatically change into Nvidia OpenGL 4.6
The nvidia driver is loading now but too late so Xorg falls back to VGA. Please try embedding it into the initrd:
sudo dracut -f
also create /etc/X11/xorg.conf only containing

Section "Device"
    Identifier     "nvidia"
    Driver         "nvidia"
    VendorName     "NVIDIA Corporation"
    BusID          "PCI:1:0:0"
    Option         "AllowEmptyInitialConfiguration"
EndSection

After reboot, please create a new nvidia-bug-report.log
Edit: fixed wrong BusID

Ooops, sorry, please ignore the post before, it was meant for a different thread.

Please post the output of
cc --version
and create and attach a new nvidia-bug-report.log

Thank you for the OpenGL explanation.

cc --version

cc (Ubuntu 9.3.0-10ubuntu2) 9.3.0

Copyright © 2019 Free Software Foundation, Inc.

cat /etc/X11/xorg.conf:

Section “Device”

Identifier “nvidia”

Driver “nvidia”

VendorName “NVIDIA Corporation”

BusID “PCI:1:0:0”

Option “AllowEmptyInitialConfiguration”

EndSection

dkms status:

nvidia, 440.64, 5.4.0-31-generic, x86_64: installed (WARNING! Diff between built and installed module!) (WARNING! Diff between built and installed module!) (WARNING! Diff between built and installed module!)

nvidia-settings:

ERROR: Unable to load info from any available system

(nvidia-settings:36809): GLib-GObject-CRITICAL **: 12:08:51.660: g_object_unref: assertion ‘G_IS_OBJECT (object)’ failed

** Message: 12:08:51.665: PRIME: No offloading required. Abort

** Message: 12:08:51.665: PRIME: is it supported? no

nvidia-bug-report.log (373 KB)

Please change the BusID to

BusID “PCI:24:0:0”

like said, that post was meant for a different thread with similar symptoms.
The problem in your case now is that you have a mismatch of kernel modules (440.82) and rest of the driver (440.64). Probably a left-over from a failed .run install.
please download the .run installer if you don’t have it:
https://http.download.nvidia.com/XFree86/Linux-x86_64/440.82/NVIDIA-Linux-x86_64-440.82.run
Make it executable, then run it with --uninstall option.
Afterwards, please purge and reinstall the repo driver again
sudo apt remove nvidia*
sudo apt install nvidia-driver-440
reboot, then post the output of
dkms status

That fixed it–thank you–awesome.