I am unable to make my GPU work on Ubuntu 20.04 LTS.
The GPU is a RTX A4000
Here are my bug report and kern.log
The latter says:
Feb 8 07:35:47 loicus-DA kernel: [ 288.919473] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x23:0xffff:1401)
Feb 8 07:35:47 loicus-DA kernel: [ 288.919576] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
Feb 8 07:35:54 loicus-DA kernel: [ 296.096457] NVRM: Xid (PCI:0000:01:00): 79, pid=5156, GPU has fallen off the bus.
Feb 8 07:35:54 loicus-DA kernel: [ 296.096508] NVRM: GPU 0000:01:00.0: GPU has fallen off the bus.
Feb 8 07:35:54 loicus-DA kernel: [ 296.097573] NVRM: A GPU crash dump has been created. If possible, please run
Feb 8 07:35:54 loicus-DA kernel: [ 296.097573] NVRM: nvidia-bug-report.sh as root to collect this data before
Feb 8 07:35:54 loicus-DA kernel: [ 296.097573] NVRM: the NVIDIA kernel module is unloaded.
I tried to reinstall everything from scratch, reinstall the drivers in several different ways, etc…
Nothing is working… I suspect the GPU is dead, but I’d be thankful to get a confirmation
Since this is a laptop, the gpu is not necessarily broken. It falls off the bus first which would point to a power management/bus/kernel problem. Please try
updating bios
setting kernel parameter intel_idle.max_cstate=1
use a different kernel
The ubuntu 5.13 kernel was released with a lot of bugs, please check if you have a 5.11 kernel available in grub menu or try using the liquorix kernel ppa: https://launchpad.net/~damentz/+archive/ubuntu/liquorix
Seems the kernel modules didn’t compile, please reinstall kernel headers
sudo apt install linux-headers-$(uname -r)
then post the output of
dkms status
loicus@loicus-DA:~$ dpkg -l |grep nvidia
ii libnvidia-cfg1-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-common-510 510.47.03-0ubuntu0.20.04.1 all Shared files used by the NVIDIA libraries
rc libnvidia-compute-470:amd64 470.103.01-0ubuntu0.20.04.1 amd64 NVIDIA libcompute package
rc libnvidia-compute-470-server:amd64 470.103.01-0ubuntu0.20.04.1 amd64 NVIDIA libcompute package
ii libnvidia-compute-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA libcompute package
ii libnvidia-compute-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA libcompute package
ii libnvidia-decode-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA Video Decoding runtime libraries
ii libnvidia-decode-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA Video Decoding runtime libraries
ii libnvidia-encode-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVENC Video Encoding runtime library
ii libnvidia-encode-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVENC Video Encoding runtime library
ii libnvidia-extra-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 Extra libraries for the NVIDIA driver
ii libnvidia-fbc1-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-fbc1-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-gl-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-gl-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii linux-modules-nvidia-510-5.13.0-1029-oem 5.13.0-1029.36+1 amd64 Linux kernel nvidia modules for version 5.13.0-1029
ii linux-modules-nvidia-510-oem-20.04c 5.13.0-1029.36+1 amd64 Extra drivers for nvidia-510 for the oem-20.04c flavour
rc linux-objects-nvidia-470-5.11.0-1028-aws 5.11.0-1028.31~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.11.0-1028 (objects)
rc linux-objects-nvidia-470-5.11.0-1028-azure 5.11.0-1028.31~20.04.2+1 amd64 Linux kernel nvidia modules for version 5.11.0-1028 (objects)
rc linux-objects-nvidia-470-5.11.0-1028-oracle 5.11.0-1028.31~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.11.0-1028 (objects)
rc linux-objects-nvidia-470-5.11.0-1029-gcp 5.11.0-1029.33~20.04.3+1 amd64 Linux kernel nvidia modules for version 5.11.0-1029 (objects)
rc linux-objects-nvidia-470-5.13.0-1012-aws 5.13.0-1012.13~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.13.0-1012 (objects)
rc linux-objects-nvidia-470-5.13.0-1013-azure 5.13.0-1013.15~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.13.0-1013 (objects)
rc linux-objects-nvidia-470-5.13.0-1013-gcp 5.13.0-1013.16~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.13.0-1013 (objects)
rc linux-objects-nvidia-470-5.13.0-1016-oracle 5.13.0-1016.20~20.04.1+1 amd64 Linux kernel nvidia modules for version 5.13.0-1016 (objects)
rc linux-objects-nvidia-470-5.13.0-1029-oem 5.13.0-1029.36+1 amd64 Linux kernel nvidia modules for version 5.13.0-1029 (objects)
rc linux-objects-nvidia-470-5.13.0-28-generic 5.13.0-28.31~20.04.1+2 amd64 Linux kernel nvidia modules for version 5.13.0-28 (objects)
rc linux-objects-nvidia-470-5.13.0-28-lowlatency 5.13.0-28.31~20.04.1+2 amd64 Linux kernel nvidia modules for version 5.13.0-28 (objects)
rc linux-objects-nvidia-470-5.4.0-1062-oracle 5.4.0-1062.66+1 amd64 Linux kernel nvidia modules for version 5.4.0-1062 (objects)
rc linux-objects-nvidia-470-5.4.0-1063-gcp 5.4.0-1063.67+1 amd64 Linux kernel nvidia modules for version 5.4.0-1063 (objects)
rc linux-objects-nvidia-470-5.4.0-1064-aws 5.4.0-1064.67+1 amd64 Linux kernel nvidia modules for version 5.4.0-1064 (objects)
rc linux-objects-nvidia-470-5.4.0-1068-azure 5.4.0-1068.71+1 amd64 Linux kernel nvidia modules for version 5.4.0-1068 (objects)
rc linux-objects-nvidia-470-5.4.0-99-generic 5.4.0-99.112+1 amd64 Linux kernel nvidia modules for version 5.4.0-99 (objects)
rc linux-objects-nvidia-470-5.4.0-99-lowlatency 5.4.0-99.112+1 amd64 Linux kernel nvidia modules for version 5.4.0-99 (objects)
ii linux-objects-nvidia-510-5.13.0-1029-oem 5.13.0-1029.36+1 amd64 Linux kernel nvidia modules for version 5.13.0-1029 (objects)
ii linux-signatures-nvidia-5.13.0-1029-oem 5.13.0-1029.36+1 amd64 Linux kernel signatures for nvidia modules for version 5.13.0-1029-oem
rc nvidia-compute-utils-470-server 470.103.01-0ubuntu0.20.04.1 amd64 NVIDIA compute utilities
ii nvidia-compute-utils-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA compute utilities
rc nvidia-dkms-470-server 470.103.01-0ubuntu0.20.04.1 amd64 NVIDIA DKMS package
ii nvidia-driver-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA driver metapackage
rc nvidia-kernel-common-470-server 470.103.01-0ubuntu0.20.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-common-510 510.47.03-0ubuntu0.20.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA kernel source package
ii nvidia-prime 0.8.16~0.20.04.1 all Tools to enable NVIDIA's Prime
ii nvidia-settings 470.57.01-0ubuntu0.20.04.2 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-utils-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA driver support binaries
ii screen-resolution-extra 0.18build1 all Extension for the nvidia-settings control panel
ii xserver-xorg-video-nvidia-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA binary Xorg driver
Yes, it’s a wild mix of 470-server and 510, neither driver being complete. rather remove everything *nvidia* and reinstall using Software&Updates application.
loicus@loicus-DA:~$ dpkg -l |grep nvidia
ii libnvidia-cfg1-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-common-510 510.47.03-0ubuntu0.20.04.1 all Shared files used by the NVIDIA libraries
ii libnvidia-compute-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA libcompute package
ii libnvidia-compute-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA libcompute package
ii libnvidia-decode-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA Video Decoding runtime libraries
ii libnvidia-decode-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA Video Decoding runtime libraries
ii libnvidia-encode-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVENC Video Encoding runtime library
ii libnvidia-encode-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVENC Video Encoding runtime library
ii libnvidia-extra-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 Extra libraries for the NVIDIA driver
ii libnvidia-fbc1-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-fbc1-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-gl-510:amd64 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-gl-510:i386 510.47.03-0ubuntu0.20.04.1 i386 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii nvidia-compute-utils-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA compute utilities
ii nvidia-dkms-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA DKMS package
ii nvidia-driver-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA driver metapackage
ii nvidia-kernel-common-510 510.47.03-0ubuntu0.20.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA kernel source package
ii nvidia-prime 0.8.16~0.20.04.1 all Tools to enable NVIDIA's Prime
ii nvidia-settings 470.57.01-0ubuntu0.20.04.2 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-utils-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA driver support binaries
ii screen-resolution-extra 0.18build1 all Extension for the nvidia-settings control panel
ii xserver-xorg-video-nvidia-510 510.47.03-0ubuntu0.20.04.1 amd64 NVIDIA binary Xorg driver
loicus@loicus-DA:~$ dkms status
nvidia, 510.47.03, 5.16.0-7.2-liquorix-amd64, x86_64: installed
I rebooted at this point
loicus@loicus-DA:~$ sudo nvidia-smi
No devices were found
I have indeed a code43 after installing latest driver and rebooting.
If I try to open the nvidia control panel, nothing happens and when I try to open the NVIDIA RTX Desktop Manager I get an error saying that I should at least have a RTX GPU