Update:
I did a complete system upgrade, including a kernel update:
Running scriptlet: kernel-modules-core-5.14.0-611.5.1.el9 7.x86 64
Running scriptlet: kernel-core-5.14.0-611.5.1.el9 7.x86 64
Sign command: /lib/modules/5.14.0-611.5.1.el9_7.x86_64/build/scripts/sign-file
Signing key: /var/lib/dkms/mok.key
Public certificate (MOK) : /var/lib/dkms/mok.pub
Autoinstall of module nvidia/580.105.08 for kernel 5.14.0-611.5.1.el9 7.x86_64 (x86_64)
Building module (s) ..
Signing module /var/lib/dkms/nvidia/580.105.08/build/nvidia.ko
igning module /var/lib/dkms/nvidia/580.105.08/build/nvidia-uvm.ko
Signing module /var/lib/dkms/nvidia/580.105.08/build/nvidia-modeset.ko
Signing module /var/lib/dkms/nvidia/580.105.08/build/nvidia-drm.ko
Signing module /var/lib/dkms/nvidia/580.105.08/build/nvidia-peermem.ko
Installing /lib/modules/5.14.0-611.5.1.el9 7.x86 64/extra/nvidia.ko.xz
Installing /lib/modules/5.14.0-611.5.1.el9_7.x86_64/extra/nvidia-uvm.ko.xz
Installing /lib/modules/5.14.0-611.5.1.el9 7.x86_64/extra/nvidia-modeset.ko.xz
Installing /lib/modules/5.14.0-611.5.1.el9_7.x86_64/extra/nvidia-drm.ko.xz
Installing /lib/modules/5.14.0-611.5.1.el9 7.x86_64/extra/nvidia-peermem.ko.xz
Running depmod … done.
Autoinstall on 5.14.0-611.5.1.el9 7.x86 64 succeeded for module (s) nvidia.
Running scriptlet: kernel-modules-5.14.0-611.5.1.el9 7.x86 64
It now runs kernel-core-5.14.0-611.5.1
After a reboot, the system still does not detect the graphics card. In dmesg, there is a problem with the GSP firmware initialization:
[ 13.237230] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 13.237245] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x0, 0x0, 0x0, 0x0
[ 13.237263] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 13.237267] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 13.237272] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x0
[ 13.237278] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 13.237283] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 13.237288] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x0
[ 13.237296] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237300] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 13.237307] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237310] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 13.237317] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237321] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 13.237328] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237331] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 13.237338] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237341] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 13.237348] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 13.237351] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 13.237439] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 13.237442] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 13.237446] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7600 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237450] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7610 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237454] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7620 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237457] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7630 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237460] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7640 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237464] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7650 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237467] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7660 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237470] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7670 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237473] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7680 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237477] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7690 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237480] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B76A0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 13.237484] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B76B0 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 13.237491] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 13.237585] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 13.238457] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 13.241841] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 18.479243] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 18.479256] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x0, 0x0, 0x0, 0x0
[ 18.479280] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 18.479284] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 18.479289] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x0
[ 18.479294] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 18.479299] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 18.479304] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x0
[ 18.479312] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479315] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 18.479321] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479324] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 18.479331] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479334] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 18.479341] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479343] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 18.479350] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479352] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 18.479359] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 18.479361] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 18.479443] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 18.479445] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 18.479449] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7450 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479453] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7460 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479456] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7470 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479459] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7480 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479462] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7490 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479465] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74A0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479468] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74B0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479471] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74C0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479474] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74D0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479477] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74E0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479480] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B74F0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 18.479483] NVRM: nvDbgDumpBufferBytes: FF4DADCD029B7500 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 18.479489] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 18.479565] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 18.480244] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 18.483696] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 19.928851] rfkill: input handler disabled
[ 71.810683] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 71.810689] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x56, 0x0, 0x0, 0x2
[ 71.810697] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 71.810698] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 71.810701] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x56
[ 71.810703] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 71.810705] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 71.810707] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x2
[ 71.810710] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810711] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 71.810715] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810716] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 71.810720] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810720] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 71.810724] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810725] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 71.810728] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810729] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 71.810732] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 71.810733] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 71.810803] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 71.810803] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 71.810804] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5A0 00 00 14 00 56 00 00 00 02 00 00 00 00 00 00 00
[ 71.810805] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5B0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810805] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5C0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810806] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5D0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810807] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5E0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810807] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B5F0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810808] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B600 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810809] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B610 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810809] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B620 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810810] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B630 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810811] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B640 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 71.810811] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B650 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 71.810813] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 71.810856] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 71.811706] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 71.812727] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 72.805786] block dm-4: the capability attribute has been deprecated.
[ 76.895827] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 76.895832] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x56, 0x0, 0x0, 0x2
[ 76.895840] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 76.895841] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 76.895845] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x56
[ 76.895847] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 76.895849] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 76.895851] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x2
[ 76.895855] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895856] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 76.895861] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895861] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 76.895866] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895866] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 76.895871] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895871] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 76.895875] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895876] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 76.895880] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 76.895880] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 76.895951] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 76.895951] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 76.895953] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B6D0 00 00 14 00 56 00 00 00 02 00 00 00 00 00 00 00
[ 76.895954] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B6E0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895954] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B6F0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895955] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B700 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895956] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B710 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895957] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B720 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895958] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B730 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895959] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B740 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895960] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B750 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895961] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B760 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895961] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B770 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 76.895962] NVRM: nvDbgDumpBufferBytes: FF4DADCD0366B780 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 76.895965] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 76.896005] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 76.896516] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 76.897461] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 76.897738] NVRM: nvAssertFailedNoLog: Assertion failed: rmapiLockIsOwner() && rmGpuLockIsOwner() @ conf_compute_api.c:78
[ 82.111977] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 82.111991] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x56, 0x0, 0x0, 0x2
[ 82.112013] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 82.112018] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 82.112023] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x56
[ 82.112029] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 82.112034] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 82.112039] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x2
[ 82.112047] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112050] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 82.112057] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112060] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 82.112067] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112071] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 82.112078] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112081] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 82.112087] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112090] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 82.112097] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 82.112100] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 82.112180] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 82.112182] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 82.112187] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB7B0 00 00 14 00 56 00 00 00 02 00 00 00 00 00 00 00
[ 82.112191] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB7C0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112195] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB7D0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112198] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB7E0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112201] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB7F0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112205] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB800 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112208] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB810 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112212] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB820 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112215] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB830 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112218] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB840 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112222] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB850 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 82.112225] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB860 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 82.112232] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 82.112308] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 82.113234] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 82.113967] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 87.193089] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 87.193093] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x56, 0x0, 0x0, 0x2
[ 87.193100] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 87.193101] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 87.193104] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x56
[ 87.193106] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 87.193108] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 87.193109] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x2
[ 87.193113] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193114] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 87.193118] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193119] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 87.193123] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193123] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 87.193127] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193127] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 87.193131] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193131] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 87.193135] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 87.193136] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 87.193205] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 87.193206] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 87.193207] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB520 00 00 14 00 56 00 00 00 02 00 00 00 00 00 00 00
[ 87.193208] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB530 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193208] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB540 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193209] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB550 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193210] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB560 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193210] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB570 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193211] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB580 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193212] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB590 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193212] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB5A0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193213] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB5B0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193214] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB5C0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 87.193214] NVRM: nvDbgDumpBufferBytes: FF4DADCD0E1FB5D0 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 87.193216] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 87.193249] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 87.193609] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 87.194404] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
[ 87.194728] NVRM: nvAssertFailedNoLog: Assertion failed: rmapiLockIsOwner() && rmGpuLockIsOwner() @ conf_compute_api.c:78
[ 171.287386] NVRM: GPU at PCI:0000:17:00: GPU-801383be-20e1-a842-5cc3-215ec26f58d3
[ 171.287390] NVRM: Xid (PCI:0000:17:00): 143, Error status 0x65 while polling for FSP boot complete, 0x13, 0x56, 0x0, 0x0, 0x2
[ 171.287398] NVRM: kfspDumpDebugState_GB100: FSP microcode v4.76
[ 171.287399] NVRM: kfspDumpDebugState_GB100: GPU 0000:17:00
[ 171.287402] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x56
[ 171.287404] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[ 171.287406] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[ 171.287407] NVRM: kfspDumpDebugState_GB100: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0x2
[ 171.287411] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110040, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287412] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX0 = 0xbadf4100
[ 171.287416] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110044, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287416] NVRM: kfspDumpDebugState_GB100: NV_PGSP_FALCON_MAILBOX1 = 0xbadf4100
[ 171.287420] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110804, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287420] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(0) = 0xbadf4100
[ 171.287424] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110808, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287424] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(1) = 0xbadf4100
[ 171.287428] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x11080c, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287428] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(2) = 0xbadf4100
[ 171.287432] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110810, regvalue: 0xbadf4100, error code: Unknown SYS_PRI_ERROR_CODE
[ 171.287433] NVRM: kfspDumpDebugState_GB100: NV_PGSP_MAILBOX(3) = 0xbadf4100
[ 171.287501] NVRM: _kfspPrintCms2Log_GB100: CMS2 Log:
[ 171.287501] NVRM: nvDbgDumpBufferBytes: x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xa xb xc xd xe xf
[ 171.287502] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37A58 00 00 14 00 56 00 00 00 02 00 00 00 00 00 00 00
[ 171.287503] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37A68 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287503] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37A78 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287504] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37A88 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287505] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37A98 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287505] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AA8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287506] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AB8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287507] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AC8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287507] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AD8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287508] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AE8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287509] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37AF8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 171.287509] NVRM: nvDbgDumpBufferBytes: FF4DADCD00F37B08 00 00 00 00 00 00 00 00 00 00 00 00 13 00 00 00
[ 171.287511] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 171.287552] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 171.287949] NVRM: GPU 0000:17:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 171.289450] NVRM: GPU 0000:17:00.0: rm_init_adapter failed, device minor number 0
Does anyone have an idea how to solve this problem? Any help welcome…