Hi,
I am experiencing the same issue running NVIDIA DeepStream pipelines, though it happens intermittently.
Before January 8th, everything was running perfectly with 0 crashes. After an Ubuntu update, I started experiencing random crashes whenever I launch DeepStream pipelines, even with default configurations.
I reduced the power limit to 400W, but it still crashes randomly. Often, after a crash, when I reboot, the motherboard beeps indicating that no GPU is detected.
System Specs:
GPU: MSI RTX 5090
CPU: Intel Xeon w3-2435
RAM: Samsung M321R2GA3BB6-CQKET 2x16GB
MOTHERBOARD: HP Z4 G5 Workstation Desktop PC
PSU: 1125W
Original Environment:
Driver: 580.95.05
CUDA: 12.8 (for DeepStream)
TensorRT: 10.9.0.34 compiled for CUDA 12.8
DeepStream: 8.0
The crashes started after I installed the following updates on Ubuntu 24.04.3 LTS:
Start-Date: 2026-01-08 06:14:13
Commandline: /usr/bin/unattended-upgrade
Upgrade: libglib2.0-dev-bin:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libglib2.0-bin:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libglib2.0-dev:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), gir1.2-glib-2.0:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libglib2.0-data:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libgirepository-2.0-0:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), gir1.2-glib-2.0-dev:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libglib2.0-0t64:amd64 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6), libglib2.0-0t64:i386 (2.80.0-6ubuntu3.5, 2.80.0-6ubuntu3.6)
End-Date: 2026-01-08 06:14:17
Start-Date: 2026-01-08 08:33:11
Commandline: aptdaemon role=‘role-commit-packages’ sender=‘:1.9332221’
Upgrade: libblkid-dev:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), libxnvctrl0:amd64 (590.44.01-0ubuntu1, 590.48.01-0ubuntu1), netplan-generator:amd64 (1.1.2-2~ubuntu24.04.2, 1.1.2-8ubuntu1~24.04.1), libsmartcols1:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), udev:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), systemd-oomd:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), python3.13:amd64 (3.13.10-1+noble1, 3.13.11-1+noble1), dhcpcd-base:amd64 (1:10.0.6-1ubuntu3.1, 1:10.0.6-1ubuntu3.2), libmbim-utils:amd64 (1.31.2-0ubuntu3, 1.31.2-0ubuntu3.1), mutter-common-bin:amd64 (46.2-1ubuntu0.24.04.12, 46.2-1ubuntu0.24.04.13), google-chrome-stable:amd64 (143.0.7499.40-1, 143.0.7499.192-1), libmount-dev:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), systemd-timesyncd:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libpipewire-0.3-common:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), libmbim-glib4:amd64 (1.31.2-0ubuntu3, 1.31.2-0ubuntu3.1), libpam-systemd:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), pipewire-pulse:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), libgdm1:amd64 (46.2-1ubuntu1~24.04.4, 46.2-1ubuntu1~24.04.5), python3-netplan:amd64 (1.1.2-2~ubuntu24.04.2, 1.1.2-8ubuntu1~24.04.1), libpython3.13-stdlib:amd64 (3.13.10-1+noble1, 3.13.11-1+noble1), libmutter-14-0:amd64 (46.2-1ubuntu0.24.04.12, 46.2-1ubuntu0.24.04.13), libsystemd0:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libsystemd0:i386 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libmount1:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), libmount1:i386 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), libnss-systemd:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libudev-dev:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), pipewire:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), mutter-common:amd64 (46.2-1ubuntu0.24.04.12, 46.2-1ubuntu0.24.04.13), util-linux:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), gnome-shell:amd64 (46.0-0ubuntu6~24.04.11, 46.0-0ubuntu6~24.04.12), systemd:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libudev1:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libudev1:i386 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), fdisk:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), gnome-settings-daemon-common:amd64 (46.0-1ubuntu1, 46.0-1ubuntu1.24.04.1), python3.13-venv:amd64 (3.13.10-1+noble1, 3.13.11-1+noble1), libfdisk1:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), systemd-dev:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), eject:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), gdm3:amd64 (46.2-1ubuntu1~24.04.4, 46.2-1ubuntu1~24.04.5), gnome-shell-extension-desktop-icons-ng:amd64 (46+really47.0.9-1ubuntu4, 46+really47.0.9-1ubuntu5), gnome-settings-daemon:amd64 (46.0-1ubuntu1, 46.0-1ubuntu1.24.04.1), libspa-0.2-bluetooth:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), libuuid1:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), clickhouse-client:amd64 (25.11.2.24, 25.12.2.54), uuid-runtime:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), systemd-resolved:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), gir1.2-mutter-14:amd64 (46.2-1ubuntu0.24.04.12, 46.2-1ubuntu0.24.04.13), libmbim-proxy:amd64 (1.31.2-0ubuntu3, 1.31.2-0ubuntu3.1), gstreamer1.0-pipewire:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), uuid-dev:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), pipewire-audio:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), pipewire-bin:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), gnome-shell-common:amd64 (46.0-0ubuntu6~24.04.11, 46.0-0ubuntu6~24.04.12), nvidia-settings:amd64 (590.44.01-0ubuntu1, 590.48.01-0ubuntu1), gir1.2-gdm-1.0:amd64 (46.2-1ubuntu1~24.04.4, 46.2-1ubuntu1~24.04.5), rfkill:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), mount:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), clickhouse-common-static:amd64 (25.11.2.24, 25.12.2.54), libspa-0.2-modules:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), libwhoopsie0:amd64 (0.2.77build3, 0.2.77ubuntu0.1), libsystemd-shared:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), netplan.io:amd64 (1.1.2-2~ubuntu24.04.2, 1.1.2-8ubuntu1~24.04.1), libpipewire-0.3-0t64:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), clickhouse-server:amd64 (25.11.2.24, 25.12.2.54), systemd-sysv:amd64 (255.4-1ubuntu8.11, 255.4-1ubuntu8.12), libblkid1:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), libblkid1:i386 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), whoopsie:amd64 (0.2.77build3, 0.2.77ubuntu0.1), libpipewire-0.3-modules:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2), nvidia-firmware-580-580.95.05:amd64 (580.95.05-0ubuntu0.24.04.2, 580.95.05-0ubuntu0.24.04.3), bsdutils:amd64 (1:2.39.3-9ubuntu6.3, 1:2.39.3-9ubuntu6.4), libnetplan1:amd64 (1.1.2-2~ubuntu24.04.2, 1.1.2-8ubuntu1~24.04.1), bsdextrautils:amd64 (2.39.3-9ubuntu6.3, 2.39.3-9ubuntu6.4), pipewire-alsa:amd64 (1.0.5-1ubuntu3.1, 1.0.5-1ubuntu3.2)
End-Date: 2026-01-08 08:33:54
Start-Date: 2026-01-08 08:34:27
Commandline: aptdaemon role=‘role-commit-packages’ sender=‘:1.9332221’
Upgrade: linux-firmware:amd64 (20240318.git3b128b60-0ubuntu2.21, 20240318.git3b128b60-0ubuntu2.22)
End-Date: 2026-01-08 08:34:34
What I have tried:
- Limited power to 400W.
- Upgrading to the 590 driver branch.
- Downgrading to the original driver with a clean installation (580.95.05).
- Downgrading to driver 570.211.01 (clean installation, headless/no visual environment).
- Setting GPU clocks: --lock-gpu-clocks=2100,2100 before running the pipeline.
Current Situation:
Yesterday, I had one crash at the start of the day. After that, I ran my pipeline about 30 times with no errors. I didn’t shut down the PC, but today I got another crash (black screen and GPU not detected upon reboot).
Here are the logs I get when it crashes:
ene 22 08:49:39 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:6:0:0x0000000f
ene 22 08:49:39 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:4:0:0x0000000f
ene 22 08:49:40 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:0:0:0x0000000f
ene 22 08:49:40 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:2:0:0x0000000f
ene 22 08:49:40 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:4:0:0x0000000f
ene 22 08:49:40 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000ca7e:6:0:0x0000000f
– Boot 68d79d5335ac4537bb3da60040c158ae –
ene 22 09:01:34 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel:
ene 22 09:01:56 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm-password][5034]: gkr-pam: unable to locate daemon control file
ene 22 09:01:57 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm3[1669]: Gdm: on_display_added: assertion ‘GDM_IS_REMOTE_DISPLAY (display)’ failed
ene 22 09:01:59 goia-pc-HP-Z4-G5-Workstation-Desktop-PC systemd[5077]: Failed to start app-gnome-gnome\x2dkeyring\x2dpkcs11-5820.scope - Application launched by gnome-sessio>
ene 22 09:01:59 goia-pc-HP-Z4-G5-Workstation-Desktop-PC systemd[5077]: Failed to start app-gnome-xdg\x2duser\x2ddirs-5837.scope - Application launched by gnome-session-binar>
ene 22 09:02:01 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm3[1669]: Gdm: on_display_removed: assertion ‘GDM_IS_REMOTE_DISPLAY (display)’ failed
– Boot d517f0bf88de4c8eb6f911200ceddc5d –
ene 22 09:16:59 goia-pc-HP-Z4-G5-Workstation-Desktop-PC kernel:
ene 22 09:17:14 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm-password][4749]: gkr-pam: unable to locate daemon control file
ene 22 09:17:15 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm3[1715]: Gdm: on_display_added: assertion ‘GDM_IS_REMOTE_DISPLAY (display)’ failed
ene 22 09:17:17 goia-pc-HP-Z4-G5-Workstation-Desktop-PC systemd[4789]: Failed to start app-gnome-gnome\x2dkeyring\x2dpkcs11-5565.scope - Application launched by gnome-sessio>
ene 22 09:17:19 goia-pc-HP-Z4-G5-Workstation-Desktop-PC systemd[4789]: Failed to start app-gnome-user\x2ddirs\x2dupdate\x2dgtk-6000.scope - Application launched by gnome-ses>
ene 22 09:17:20 goia-pc-HP-Z4-G5-Workstation-Desktop-PC gdm3[1715]: Gdm: on_display_removed: assertion ‘GDM_IS_REMOTE_DISPLAY (display)’ failed