Good afternoon
I have dual Quadro RTX-5000 cards and do a bit of GPU-based computation with these.
After a kernel update I have an issue getting my system to load the Nvidia driver correctly.
I went through all the prior postings about this issue and removed and reinstalled the 460.73.01 driver.
$ nvidia-detector
nvidia-driver-460
$ nvidia-smi
Mon May 24 13:28:17 2021
±----------------------------------------------------------------------------+
| NVIDIA-SMI 460.73.01 Driver Version: 460.73.01 CUDA Version: 11.2 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 5000 Off | 00000000:AF:00.0 On | Off |
| 34% 36C P8 19W / 230W | 1MiB / 16124MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
| 1 Quadro RTX 5000 Off | 00000000:D8:00.0 Off | Off |
| 34% 32C P8 8W / 230W | 1MiB / 16125MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+
$ dkms status
nvidia, 460.73.01, 5.8.0-53-generic, x86_64: installed
$ $ uname -r
5.8.0-53-generic
$ sudo lshw -c video
*-display
description: VGA compatible controller
product: ASPEED Graphics Family
vendor: ASPEED Technology, Inc.
physical id: 0
bus info: pci@0000:04:00.0
version: 41
width: 32 bits
clock: 33MHz
capabilities: pm msi vga_controller cap_list
configuration: driver=ast latency=0
resources: irq:17 memory:99000000-99ffffff memory:9a000000-9a01ffff ioport:2000(size=128)
*-display
description: VGA compatible controller
product: TU104GL [Quadro RTX 5000]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:af:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=nvidia latency=0
resources: irq:444 memory:d3000000-d3ffffff memory:c0000000-cfffffff memory:d0000000-d1ffffff ioport:e000(size=128) memory:c0000-dffff
*-display
description: VGA compatible controller
product: TU104GL [Quadro RTX 5000]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:d8:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=nvidia latency=0
resources: irq:445 memory:fa000000-faffffff memory:e0000000-efffffff memory:f0000000-f1ffffff ioport:f000(size=128) memory:fb000000-fb07ffff
As far as I can tell it is present and loaded, but inactive as video after boot loader menu.
Any help would be appreciated.
I am sure to an X-windows expert it is a simple, but not to me.
Thanks
FK
nvidia-bug-report.log.gz (530.2 KB)