New installed gpu is not detected by nvidia-smi

Hi, I have been successfully able to overclock my 2 card, thanks to @generix .
Yesterday I decided to buy another 1. But it does not show up when I do nvidia-smi

Wed Jan 13 22:17:53 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 455.23.05    Driver Version: 455.23.05    CUDA Version: 11.1     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GeForce RTX 3080    Off  | 00000000:0B:00.0 Off |                  N/A |
| 45%   58C    P5   126W / 340W |    199MiB / 10018MiB |      8%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
|   1  GeForce RTX 3080    Off  | 00000000:0C:00.0 Off |                  N/A |
|  0%   30C    P8     1W / 320W |      1MiB / 10018MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

The NVIDIA card is detected with this command

$ lspci | egrep 'VGA|3D|NVIDIA'
00:02.0 VGA compatible controller: Intel Corporation HD Graphics 630 (rev 04)
0b:00.0 VGA compatible controller: NVIDIA Corporation Device 2206 (rev a1)
0b:00.1 Audio device: NVIDIA Corporation Device 1aef (rev a1)
0c:00.0 VGA compatible controller: NVIDIA Corporation Device 2206 (rev a1)
0c:00.1 Audio device: NVIDIA Corporation Device 1aef (rev a1)
0d:00.0 VGA compatible controller: NVIDIA Corporation Device 2206 (rev a1)
0d:00.1 Audio device: NVIDIA Corporation Device 1aef (rev a1)

I found the related article from nvidia forums also from this article but still cannot solve my problem.

$ dmesg |grep NVRM
[    1.659702] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  455.23.05  Fri Sep 18 19:37:12 UTC 2020
[    6.411328] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[    6.411350] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[    8.424882] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[    8.424934] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.632110] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.632145] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.824383] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.824408] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.850310] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.850341] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.890079] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.890106] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.930356] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.930383] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   23.970516] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   23.970542] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   24.010572] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   24.010607] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   24.050338] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   24.050362] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   24.090219] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   24.090260] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[   24.129891] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)
[   24.129938] NVRM: GPU 0000:0d:00.0: rm_init_adapter failed, device minor number 2
[  210.285969] NVRM: GPU 0000:0d:00.0: RmInitAdapter failed! (0x30:0x56:825)

All the helps is much appreciated.
Thank you in advanced!

nvidia-bug-report.log.gz (1.6 MB)

Looks broken, please try

  • reseating the card in its slot
  • check/reseat power connectors
  • check if it works in another slot/system/other power cords