Desktop computer won't turn on after an update when the GPU is connected

In short:
Desktop computer, running Ubuntu 22.04, GPU RTX 2080 Super
After an update, I turned on my computer, and it went directly to a black screen saying the following 3 lines (also attached as an image):
[ 0.158596] x86/cpu SGX disabled by BIOS
/dev/nvme1n1p1: recovering journal
/dev/nvme1n1p1: clean, 1672564/61054976 files 68378402/244190208 blocks

(The SGX warning is usually there, nothing special).
After this, the computer hangs forever; many resorts didn’t change a thing.
This black screen is loading before anything else (ubuntu sign/motherboard stuff, etc.)

If I disconnect the GPU from the motherboard and connect an HDMI directly to the motherboard, the computer loads “as normal.”
When connecting the HDMI to the motherboard, while the GPU is still connected, I got a large screen in the form of [ STATUS ] MESSAGE; for example - “[ FAILED ] Failed to start NVIDIA persistence daemon.”
I got many of these lines; most of them are “OK,” but some are FAILED. I attached a short video of that, since it’s too large for a single image.
After about 30-60 seconds of showing this screen with the errors, I get a completely black screen with nothing on it.

A bit more details\speculations:
This is my personal desktop, I use it daily, I didn’t install anything special before that happened.
On Friday, 05/10/24, the computer did a large update. I didn’t restart it immediately.
On Sunday when I tried, for the first time since the update, to used the computer I got into this situation. I have since done another “sudo apt update,” but I get the same issue. I can only use the computer when the GPU is disconnected (and using only the motherboard’s basic graphics card)

I couldn’t find any command that would provide me with the nvidia drivers that I’m using (nvidia-smi, etc. don’t work).

I attach the image and video I have and the result from running nvidia-bug-report.sh

Any help, advice on how to resolve this would be much appreciated,
Thanks



nvidia-bug-report.log.gz (113.9 KB)

There doesn’t seem to be any nvidia kernel modules installed. Please post the output of
dpkg -l |grep nvidia

Hi @generix , thanks for the reply(!)
Below is the output I get from running dpkg -l |grep nvidia

ii  libnvidia-cfg1-545:amd64                          545.23.08-0ubuntu1                      amd64        NVIDIA binary OpenGL/GLX configuration library
ii  libnvidia-common-545                              545.23.08-0ubuntu1                      all          Shared files used by the NVIDIA libraries
rc  libnvidia-compute-515:amd64                       525.125.06-0ubuntu0.22.04.1             amd64        Transitional package for libnvidia-compute-525
rc  libnvidia-compute-520:amd64                       520.56.06-0ubuntu0.22.04.1              amd64        NVIDIA libcompute package
rc  libnvidia-compute-525:amd64                       525.147.05-0ubuntu0.22.04.1             amd64        NVIDIA libcompute package
rc  libnvidia-compute-535:amd64                       535.129.03-0ubuntu0.22.04.1             amd64        NVIDIA libcompute package
rc  libnvidia-compute-535-server:amd64                535.129.03-0ubuntu0.22.04.1             amd64        NVIDIA libcompute package
ii  libnvidia-compute-545:amd64                       545.23.08-0ubuntu1                      amd64        NVIDIA libcompute package
ii  libnvidia-compute-545:i386                        545.23.08-0ubuntu1                      i386         NVIDIA libcompute package
ii  libnvidia-decode-545:amd64                        545.23.08-0ubuntu1                      amd64        NVIDIA Video Decoding runtime libraries
ii  libnvidia-decode-545:i386                         545.23.08-0ubuntu1                      i386         NVIDIA Video Decoding runtime libraries
ii  libnvidia-encode-545:amd64                        545.23.08-0ubuntu1                      amd64        NVENC Video Encoding runtime library
ii  libnvidia-encode-545:i386                         545.23.08-0ubuntu1                      i386         NVENC Video Encoding runtime library
ii  libnvidia-extra-545:amd64                         545.23.08-0ubuntu1                      amd64        Extra libraries for the NVIDIA driver
ii  libnvidia-fbc1-545:amd64                          545.23.08-0ubuntu1                      amd64        NVIDIA OpenGL-based Framebuffer Capture runtime library
ii  libnvidia-fbc1-545:i386                           545.23.08-0ubuntu1                      i386         NVIDIA OpenGL-based Framebuffer Capture runtime library
ii  libnvidia-gl-545:amd64                            545.23.08-0ubuntu1                      amd64        NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii  libnvidia-gl-545:i386                             545.23.08-0ubuntu1                      i386         NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
rc  linux-modules-nvidia-535-5.15.0-78-generic        5.15.0-78.85+1                          amd64        Linux kernel nvidia modules for version 5.15.0-78
rc  linux-modules-nvidia-535-5.15.0-91-generic        5.15.0-91.101                           amd64        Linux kernel nvidia modules for version 5.15.0-91
rc  linux-modules-nvidia-535-server-5.15.0-78-generic 5.15.0-78.85+1                          amd64        Linux kernel nvidia modules for version 5.15.0-78
rc  linux-modules-nvidia-535-server-5.15.0-79-generic 5.15.0-79.86+2                          amd64        Linux kernel nvidia modules for version 5.15.0-79
rc  linux-modules-nvidia-535-server-5.15.0-82-generic 5.15.0-82.91                            amd64        Linux kernel nvidia modules for version 5.15.0-82
rc  linux-modules-nvidia-535-server-5.15.0-83-generic 5.15.0-83.92                            amd64        Linux kernel nvidia modules for version 5.15.0-83
rc  linux-modules-nvidia-535-server-5.15.0-84-generic 5.15.0-84.93+2                          amd64        Linux kernel nvidia modules for version 5.15.0-84
rc  linux-modules-nvidia-535-server-5.15.0-86-generic 5.15.0-86.96+1                          amd64        Linux kernel nvidia modules for version 5.15.0-86
rc  linux-modules-nvidia-535-server-5.15.0-87-generic 5.15.0-87.97                            amd64        Linux kernel nvidia modules for version 5.15.0-87
rc  linux-modules-nvidia-535-server-5.15.0-88-generic 5.15.0-88.98+1                          amd64        Linux kernel nvidia modules for version 5.15.0-88
rc  linux-modules-nvidia-535-server-5.15.0-89-generic 5.15.0-89.99+1                          amd64        Linux kernel nvidia modules for version 5.15.0-89
rc  linux-modules-nvidia-535-server-5.15.0-91-generic 5.15.0-91.101                           amd64        Linux kernel nvidia modules for version 5.15.0-91
rc  linux-objects-nvidia-535-5.15.0-78-generic        5.15.0-78.85+1                          amd64        Linux kernel nvidia modules for version 5.15.0-78 (objects)
rc  linux-objects-nvidia-535-5.15.0-91-generic        5.15.0-91.101+2                         amd64        Linux kernel nvidia modules for version 5.15.0-91 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-78-generic 5.15.0-78.85+1                          amd64        Linux kernel nvidia modules for version 5.15.0-78 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-79-generic 5.15.0-79.86+2                          amd64        Linux kernel nvidia modules for version 5.15.0-79 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-82-generic 5.15.0-82.91                            amd64        Linux kernel nvidia modules for version 5.15.0-82 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-83-generic 5.15.0-83.92                            amd64        Linux kernel nvidia modules for version 5.15.0-83 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-84-generic 5.15.0-84.93+2                          amd64        Linux kernel nvidia modules for version 5.15.0-84 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-86-generic 5.15.0-86.96+1                          amd64        Linux kernel nvidia modules for version 5.15.0-86 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-87-generic 5.15.0-87.97                            amd64        Linux kernel nvidia modules for version 5.15.0-87 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-88-generic 5.15.0-88.98+1                          amd64        Linux kernel nvidia modules for version 5.15.0-88 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-89-generic 5.15.0-89.99+1                          amd64        Linux kernel nvidia modules for version 5.15.0-89 (objects)
rc  linux-objects-nvidia-535-server-5.15.0-91-generic 5.15.0-91.101+2                         amd64        Linux kernel nvidia modules for version 5.15.0-91 (objects)
rc  nvidia-compute-utils-525                          525.125.06-0ubuntu0.22.04.1             amd64        NVIDIA compute utilities
rc  nvidia-compute-utils-535                          535.129.03-0ubuntu0.22.04.1             amd64        NVIDIA compute utilities
rc  nvidia-compute-utils-535-server                   535.129.03-0ubuntu0.22.04.1             amd64        NVIDIA compute utilities
ii  nvidia-compute-utils-545                          545.23.08-0ubuntu1                      amd64        NVIDIA compute utilities
rc  nvidia-cuda-toolkit                               11.5.1-1ubuntu1                         amd64        NVIDIA CUDA development toolkit
rc  nvidia-dkms-525                                   525.125.06-0ubuntu0.22.04.1             amd64        NVIDIA DKMS package
ii  nvidia-dkms-545                                   545.23.08-0ubuntu1                      amd64        NVIDIA DKMS package
ii  nvidia-driver-545                                 545.23.08-0ubuntu1                      amd64        NVIDIA driver metapackage
rc  nvidia-kernel-common-515:amd64                    525.125.06-0ubuntu0.22.04.1             amd64        Transitional package for nvidia-kernel-common-525
rc  nvidia-kernel-common-525                          525.125.06-0ubuntu0.22.04.1             amd64        Shared files used with the kernel module
rc  nvidia-kernel-common-535                          535.129.03-0ubuntu0.22.04.1             amd64        Shared files used with the kernel module
rc  nvidia-kernel-common-535-server                   535.129.03-0ubuntu0.22.04.1             amd64        Shared files used with the kernel module
ii  nvidia-kernel-common-545                          545.23.08-0ubuntu1                      amd64        Shared files used with the kernel module
ii  nvidia-kernel-source-545                          545.23.08-0ubuntu1                      amd64        NVIDIA kernel source package
ii  nvidia-modprobe                                   545.23.08-0ubuntu1                      amd64        Load the NVIDIA kernel driver and create device files
ii  nvidia-prime                                      0.8.17.1                                all          Tools to enable NVIDIA's Prime
ii  nvidia-settings                                   545.23.08-0ubuntu1                      amd64        Tool for configuring the NVIDIA graphics driver
ii  nvidia-utils-545                                  545.23.08-0ubuntu1                      amd64        NVIDIA driver support binaries
ii  screen-resolution-extra                           0.18.2                                  all          Extension for the nvidia-settings control panel
ii  xserver-xorg-video-nvidia-545                     545.23.08-0ubuntu1                      amd64        NVIDIA binary Xorg driver

Don’t think it matters, but in case it does, I run this command when the GPU is disconnected from the computer. (The computer doesn’t load when the GPU is connected). Also, everything worked before the last time I tried to use the computer. Further, in the past (1+ year ago), I trained a few models, so I know the GPU was working correctly.

I’d appreciate any advice on what to do. Should I uninstall everything and download only the latest version? Can I do it without the GPU being connected?

Yes.
I suspect you got a kernel update and the 545 driver became incompatible. Please either downgrade to the 535 driver which got updated for newer kernels or add the graphics driver ppa to install the 550 driver.
Afterwards, please create a new nvidia-bug-report.log.

@generix, you’re a true lifesaver!
I added the graphics driver ppa and installed the 550 driver. After a reset, I connected the GPU back, and the computer is working(!!).
For completeness, I attached the bug report generated.
I don’t know how to start reading it, but if something weird/out-of-place jumps out, I’d appreciate your insight.

[nvidia-bug-report.log.gz|attachment]

Thank you for your help!

Logs look fine.

1 Like

If someone has a similar issue,
what works for me was

sudo add-apt-repository ppa:graphics-drivers/ppa
sudo apt update
sudo apt install nvidia-driver-550

Then, a reboot.

Thank you generix!