OS: Centos 7.9
GPU:8 pieces 4090 card
Jan 20 00:19:04 gpu-4090-dev024 kernel: pciehp 0000:4c:01.0:pcie204: Slot(2-1): Card not present
Jan 20 00:19:04 gpu-4090-dev024 kernel: pciehp 0000:4c:01.0:pcie204: Slot(2-1): Link Down
Jan 20 00:19:04 gpu-4090-dev024 kernel: NVRM: GPU at PCI:0000:4e:00: GPU-6d34b5e2-a686-f21c-83b7-3b36cb566060
Jan 20 00:19:04 gpu-4090-dev024 kernel: NVRM: Xid (PCI:0000:4e:00): 79, pid=‘’, name=, GPU has fallen off the bus.
Jan 20 00:19:04 gpu-4090-dev024 kernel: NVRM: GPU 0000:4e:00.0: GPU has fallen off the bus.
Jan 20 00:19:04 gpu-4090-dev024 kernel: NVRM: A GPU crash dump has been created. If possible, please run#012NVRM: nvidia-bug-report.sh as root to collect this data before#012NVRM: the NVIDIA kernel module is unloaded.
nvidia-bug-report.log.gz (1.6 MB)