RTX 4090 Multi GPU Segmentation Fault with Core Dump

When i use Tensorflow with
OS = Ubuntu20.04
Driver = 525.89.02
GPU = RTX 4090 4ea
CUDA=11.8
cuDNN = 8.6.0
tensorflow=2.7.0

This is my nvidia-smi topo -m
image

and i got this syslog error

May 31 15:18:50 oem-System-Product-Name systemd[1]: Stopped GNOME Display Manager.
May 31 15:18:50 oem-System-Product-Name systemd[1]: Starting Detect the available GPUs and deal with any system changes…
May 31 15:18:50 oem-System-Product-Name kernel: [2525547.622623] gpu-manager[3884989]: segfault at 18 ip 00007fb8c230dad0 sp 00007fffd4ba6a40 error 6 in libkmod.so.2.3.5[7fb8c2308000+11000]
May 31 15:18:50 oem-System-Product-Name kernel: [2525547.622643] Code: f8 ff ff 89 44 24 0c 44 0f be f8 40 0f be c5 41 29 c7 45 8d 77 01 49 63 c6 48 8d 3c 85 20 00 00 00 e8 64 ac ff ff 8b 54 24 0c <40> 88 68 18 49 89 c5 88 50 19 45 85 f6 7e 25 48 8d 68 1c 4e 8d 7c
May 31 15:18:50 oem-System-Product-Name systemd[1]: gpu-manager.service: Main process exited, code=dumped, status=11/SEGV
May 31 15:18:50 oem-System-Product-Name systemd[1]: gpu-manager.service: Failed with result ‘core-dump’.
May 31 15:18:50 oem-System-Product-Name systemd[1]: Failed to start Detect the available GPUs and deal with any system changes.
May 31 15:18:50 oem-System-Product-Name systemd[1]: Starting GNOME Display Manager…
May 31 15:18:50 oem-System-Product-Name systemd[1]: Started GNOME Display Manager.
May 31 15:18:55 oem-System-Product-Name systemd[3884512]: tracker-extract.service: Succeeded.
May 31 15:19:02 oem-System-Product-Name systemd[1]: Started Session 138869 of user oem.
May 31 15:19:02 oem-System-Product-Name kernel: [2525559.628156] traps: python[3885187] trap invalid opcode ip:7fb410e5fa30 sp:7fff6b96ea28 error:0 in _openssl.abi3.so[7fb410d8d000+311000]
May 31 15:19:02 oem-System-Product-Name systemd[1]: session-138869.scope: Succeeded.
May 31 15:19:11 oem-System-Product-Name rtkit-daemon[1635]: Warning: PolicyKit call failed: Failed to activate service ‘org.freedesktop.PolicyKit1’: timed out (service_start_timeout=25000ms)
May 31 15:19:11 oem-System-Product-Name gdm-launch-environment]: accountsservice: SetXSession call failed: GDBus.Error:org.freedesktop.Accounts.Error.PermissionDenied: Not authorized: GDBus.Error:org.freedesktop.DBus.Error.TimedOut: Failed to activate service ‘org.freedesktop.PolicyKit1’: timed out (service_start_timeout=25000ms)
May 31 15:19:11 oem-System-Product-Name dbus-daemon[1076]: [system] Activating via systemd: service name=‘org.freedesktop.PolicyKit1’ unit=‘polkit.service’ requested by ‘:1.34’ (uid=0 pid=1635 comm=“/usr/libexec/rtkit-daemon " label=“unconfined”)
May 31 15:19:11 oem-System-Product-Name systemd[1]: Starting Authorization Manager…
May 31 15:19:11 oem-System-Product-Name polkitd[3885285]: started daemon version 0.105 using authority implementation local' version 0.105’
May 31 15:19:11 oem-System-Product-Name dbus-daemon[1076]: [system] Successfully activated service ‘org.freedesktop.PolicyKit1’
May 31 15:19:11 oem-System-Product-Name systemd[1]: Started Authorization Manager.
May 31 15:19:11 oem-System-Product-Name kernel: [2525568.963318] traps: polkitd[3885285] general protection fault ip:7fc6538c6cbd sp:7fff85e2b078 error:0 in libsystemd.so.0.28.0[7fc6538b3000+75000]
May 31 15:19:11 oem-System-Product-Name systemd[1]: polkit.service: Main process exited, code=dumped, status=11/SEGV
May 31 15:19:11 oem-System-Product-Name systemd[1]: polkit.service: Failed with result ‘core-dump’.
May 31 15:19:11 oem-System-Product-Name rtkit-daemon[1635]: Warning: PolicyKit call failed: Message recipient disconnected from message bus without replying
May 31 15:19:11 oem-System-Product-Name gdm-launch-environment]: accountsservice: SetLanguage for language ko failed: GDBus.Error:org.freedesktop.Accounts.Error.PermissionDenied: Not authorized: GDBus.Error:org.freedesktop.DBus.Error.NoReply: Message recipient disconnected from message bus without replying
May 31 15:19:11 oem-System-Product-Name dbus-daemon[1076]: [system] Activating via systemd: service name=‘org.freedesktop.PolicyKit1’ unit=‘polkit.service’ requested by ‘:1.34’ (uid=0 pid=1635 comm=”/usr/libexec/rtkit-daemon " label=“unconfined”)
May 31 15:19:11 oem-System-Product-Name systemd[1]: Starting Authorization Manager…
May 31 15:19:12 oem-System-Product-Name systemd[1]: Started Session c4 of user gdm.
May 31 15:19:12 oem-System-Product-Name polkitd[3885293]: started daemon version 0.105 using authority implementation local' version 0.105’
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Successfully activated service ‘org.freedesktop.PolicyKit1’
May 31 15:19:12 oem-System-Product-Name systemd[1]: Started Authorization Manager.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (–) Log file renamed from “/var/log/Xorg.pid-3885299.log” to “/var/log/Xorg.0.log”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: X.Org X Server 1.20.13
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: X Protocol Version 11, Revision 0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Build Operating System: linux Ubuntu
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Current Operating System: Linux oem-System-Product-Name 5.15.0-69-generic #76~20.04.1-Ubuntu SMP Mon Mar 20 15:54:19 UTC 2023 x86_64
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Kernel command line: BOOT_IMAGE=/boot/vmlinuz-5.15.0-69-generic root=UUID=5595e8e9-0456-4deb-98ae-1512694aaee6 ro quiet splash net.ifnames=0 biosdevname=0 vt.handoff=7
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Build Date: 06 July 2022 01:53:24PM
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: xorg-server 2:1.20.13-1ubuntu1~20.04.3 (For technical support please see Enterprise open source support | Ubuntu)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Current version of pixman: 0.38.4
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Before reporting problems, check http://wiki.x.org
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011to make sure that you have the latest version.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: Markers: (–) probed, () from config file, (==) default setting,
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011(++) from command line, (!!) notice, (II) informational,
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Log file: “/var/log/Xorg.0.log”, Time: Wed May 31 15:19:12 2023
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Using config file: “/etc/X11/xorg.conf”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Using system config directory “/usr/share/X11/xorg.conf.d”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) ServerLayout “Layout0”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) |–>Screen “Screen0” (0)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () | |–>Monitor “Monitor0”
May 31 15:19:12 oem-System-Product-Name kernel: [2525569.122595] traps: polkitd[3885293] general protection fault ip:7f73d477acbd sp:7fffad0dbdf8 error:0 in libsystemd.so.0.28.0[7f73d4767000+75000]
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) | |–>Device “Device0”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () |–>Screen “Screen1” (1)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) | |–>Monitor “Monitor1”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () | |–>Device “Device1”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) |–>Screen “Screen2” (2)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () | |–>Monitor “Monitor2”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) | |–>Device “Device2”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () |–>Screen “Screen3” (3)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) | |–>Monitor “Monitor3”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () | |–>Device “Device3”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) |–>Input Device “Keyboard0”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () |–>Input Device “Mouse0”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Automatically adding devices
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Automatically enabling devices
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Automatically adding GPU devices
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Automatically binding GPU devices
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) Max clients allowed: 256, resource mask: 0x1fffff
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) The directory “/usr/share/fonts/X11/cyrillic” does not exist.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Entry deleted from font path.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) The directory “/usr/share/fonts/X11/100dpi/” does not exist.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Entry deleted from font path.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) The directory “/usr/share/fonts/X11/75dpi/” does not exist.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Entry deleted from font path.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) The directory “/usr/share/fonts/X11/100dpi” does not exist.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Entry deleted from font path.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) The directory “/usr/share/fonts/X11/75dpi” does not exist.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Entry deleted from font path.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) FontPath set to:
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011/usr/share/fonts/X11/misc,
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011/usr/share/fonts/X11/Type1,
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011built-ins
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (==) ModulePath set to “/usr/lib/xorg/modules”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) Hotplugging is on, devices using drivers ‘kbd’, ‘mouse’ or ‘vmmouse’ will be disabled.
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) Disabling Keyboard0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (WW) Disabling Mouse0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loader magic: 0x55f583bc3020
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module ABI versions:
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011X.Org ANSI C Emulation: 0.4
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011X.Org Video Driver: 24.1
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011X.Org XInput driver : 24.1
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011X.Org Server Extension : 10.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (++) using VT number 1
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: took control of session /org/freedesktop/login1/session/c4
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) xfree86: Adding drm device (/dev/dri/card0)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: got fd for /dev/dri/card0 226:0 fd 12 paused 0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) xfree86: Adding drm device (/dev/dri/card1)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: got fd for /dev/dri/card1 226:1 fd 13 paused 0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) xfree86: Adding drm device (/dev/dri/card2)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: got fd for /dev/dri/card2 226:2 fd 14 paused 0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) xfree86: Adding drm device (/dev/dri/card3)
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: got fd for /dev/dri/card3 226:3 fd 15 paused 0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) OutputClass “nvidia” ModulePath extended to “/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: () OutputClass “nvidia” ModulePath extended to “/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (
) OutputClass “nvidia” ModulePath extended to “/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (**) OutputClass “nvidia” ModulePath extended to “/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/x86_64-linux-gnu/nvidia/xorg,/usr/lib/xorg/modules”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (–) PCI: (25@0:0:0) 10de:2684:1458:40bf rev 161, Mem @ 0xb4000000/16777216, 0xa0000000/268435456, 0xb0000000/33554432, I/O @ 0x00007000/128, BIOS @ 0x???/524288
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (–) PCI: (26@0:0:0) 10de:2684:1458:40bf rev 161, Mem @ 0xb2000000/16777216, 0x80000000/268435456, 0x90000000/33554432, I/O @ 0x00006000/128, BIOS @ 0x???/524288
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (–) PCI: (103@0:0:0) 10de:2684:1458:40bf rev 161, Mem @ 0xf4000000/16777216, 0xe0000000/268435456, 0xf0000000/33554432, I/O @ 0x0000b000/128, BIOS @ 0x???/524288
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (–) PCI:*(104@0:0:0) 10de:2684:1458:40bf rev 161, Mem @ 0xf2000000/16777216, 0xc0000000/268435456, 0xd0000000/33554432, I/O @ 0x0000a000/128, BIOS @ 0x???/131072
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) LoadModule: “glx”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading /usr/lib/xorg/modules/extensions/libglx.so
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module glx: vendor=“X.Org Foundation”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011compiled for 1.20.13, module version = 1.0.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011ABI class: X.Org Server Extension, version 10.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) LoadModule: “nvidia”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module nvidia: vendor=“NVIDIA Corporation”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011compiled for 1.6.99.901, module version = 1.0.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011Module class: X.Org Video Driver
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) NVIDIA dlloader X Driver 525.89.02 Wed Feb 1 23:14:37 UTC 2023
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) systemd-logind: releasing fd for 226:3
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading sub module “fb”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) LoadModule: “fb”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading /usr/lib/xorg/modules/libfb.so
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module fb: vendor=“X.Org Foundation”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011compiled for 1.20.13, module version = 1.0.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011ABI class: X.Org ANSI C Emulation, version 0.4
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading sub module “wfb”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) LoadModule: “wfb”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading /usr/lib/xorg/modules/libwfb.so
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module wfb: vendor=“X.Org Foundation”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011compiled for 1.20.13, module version = 1.0.0
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: #011ABI class: X.Org ANSI C Emulation, version 0.4
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Loading sub module “ramdac”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) LoadModule: “ramdac”
May 31 15:19:12 oem-System-Product-Name /usr/lib/gdm3/gdm-x-session[3885299]: (II) Module “ramdac” already built-in
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Main process exited, code=dumped, status=11/SEGV
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Failed with result ‘core-dump’.
May 31 15:19:12 oem-System-Product-Name rtkit-daemon[1635]: Warning: PolicyKit call failed: Message recipient disconnected from message bus without replying
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Activating via systemd: service name=‘org.freedesktop.PolicyKit1’ unit=‘polkit.service’ requested by ‘:1.34’ (uid=0 pid=1635 comm=“/usr/libexec/rtkit-daemon " label=“unconfined”)
May 31 15:19:12 oem-System-Product-Name systemd[1]: Starting Authorization Manager…
May 31 15:19:12 oem-System-Product-Name polkitd[3885301]: started daemon version 0.105 using authority implementation local' version 0.105’
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Successfully activated service ‘org.freedesktop.PolicyKit1’
May 31 15:19:12 oem-System-Product-Name systemd[1]: Started Authorization Manager.
May 31 15:19:12 oem-System-Product-Name kernel: [2525569.311342] traps: polkitd[3885301] general protection fault ip:7f344acdecbd sp:7ffdb36431c8 error:0 in libsystemd.so.0.28.0[7f344accb000+75000]
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Main process exited, code=dumped, status=11/SEGV
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Failed with result ‘core-dump’.
May 31 15:19:12 oem-System-Product-Name rtkit-daemon[1635]: Warning: PolicyKit call failed: Message recipient disconnected from message bus without replying
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Activating via systemd: service name=‘org.freedesktop.PolicyKit1’ unit=‘polkit.service’ requested by ‘:1.34’ (uid=0 pid=1635 comm=”/usr/libexec/rtkit-daemon " label=“unconfined”)
May 31 15:19:12 oem-System-Product-Name systemd[1]: Starting Authorization Manager…
May 31 15:19:12 oem-System-Product-Name polkitd[3885306]: started daemon version 0.105 using authority implementation local' version 0.105’
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Successfully activated service ‘org.freedesktop.PolicyKit1’
May 31 15:19:12 oem-System-Product-Name systemd[1]: Started Authorization Manager.
May 31 15:19:12 oem-System-Product-Name kernel: [2525569.460284] traps: polkitd[3885306] general protection fault ip:7fee98e48cbd sp:7ffc2a30fd88 error:0 in libsystemd.so.0.28.0[7fee98e35000+75000]
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Main process exited, code=dumped, status=11/SEGV
May 31 15:19:12 oem-System-Product-Name systemd[1]: polkit.service: Failed with result ‘core-dump’.
May 31 15:19:12 oem-System-Product-Name rtkit-daemon[1635]: Warning: PolicyKit call failed: Message recipient disconnected from message bus without replying
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Activating via systemd: service name=‘org.freedesktop.PolicyKit1’ unit=‘polkit.service’ requested by ‘:1.34’ (uid=0 pid=1635 comm="/usr/libexec/rtkit-daemon " label=“unconfined”)
May 31 15:19:12 oem-System-Product-Name systemd[1]: Starting Authorization Manager…
May 31 15:19:12 oem-System-Product-Name polkitd[3885311]: started daemon version 0.105 using authority implementation local' version 0.105’
May 31 15:19:12 oem-System-Product-Name dbus-daemon[1076]: [system] Successfully activated service ‘org.freedesktop.PolicyKit1’
May 31 15:19:12 oem-System-Product-Name systemd[1]: Started Authorization Manager.
May 31 15:19:12 oem-System-Product-Name kernel: [2525569.600179] traps: polkitd[3885311] general protection fault ip:7fd2cdb4ccbd sp:7ffcda9d0258 error:0 in libsystemd.so.0.28.0[7fd2cdb39000+75000]

and i have 2 gpu with 4090 RTX
here is my 2gpu system topology with ‘nvidia-smi topo -m’
image

2 gpu system is not dead but only 4gpu server is dead with core dump

please help me

1 Like