Hi,
I bought a new laptop ThinkPad P16s. There is NVIDIA RTX A500 graphics card.
And I can’t make it work.
OS: Fresh intallation of Kubuntu 24.04
Driver version (now): 555.58.02 from ubuntu repository
$ uname -r
6.8.0-40-generic
$ nvidia-settings
ERROR: NVIDIA driver is not loaded
(nvidia-settings:8016): GLib-GObject-CRITICAL **: 13:08:38.574: g_object_unref: assertion 'G_IS_OBJECT (object)' failed
** (nvidia-settings:8016): CRITICAL **: 13:08:38.575: ctk_powermode_new: assertion '(ctrl_target != NULL) && (ctrl_target->h != NULL)' failed
ERROR: nvidia-settings could not find the registry key file or the X server is not accessible. This file should have been installed along with this driver at
/usr/share/nvidia/nvidia-application-profiles-key-documentation. The application profiles will continue to work, but values cannot be prepopulated or validated, and will not be
listed in the help text. Please see the README for possible values and descriptions.
** Message: 13:08:38.599: PRIME: Requires offloading
** Message: 13:08:38.599: PRIME: is it supported? yes
** Message: 13:08:38.639: PRIME: Usage: /usr/bin/prime-select nvidia|intel|on-demand|query
** Message: 13:08:38.639: PRIME: on-demand mode: "1"
** Message: 13:08:38.639: PRIME: is "on-demand" mode supported? yes
$ nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
$ lspci
00:00.0 Host bridge: Intel Corporation Raptor Lake-P 6p+8e cores Host Bridge/DRAM Controller
00:02.0 VGA compatible controller: Intel Corporation Raptor Lake-P [Iris Xe Graphics] (rev 04)
00:04.0 Signal processing controller: Intel Corporation Raptor Lake Dynamic Platform and Thermal Framework Processor Participant
00:06.0 PCI bridge: Intel Corporation Raptor Lake PCIe 4.0 Graphics Port
00:06.2 PCI bridge: Intel Corporation Device a73d
00:07.0 PCI bridge: Intel Corporation Raptor Lake-P Thunderbolt 4 PCI Express Root Port #0
00:07.2 PCI bridge: Intel Corporation Raptor Lake-P Thunderbolt 4 PCI Express Root Port #2
00:0d.0 USB controller: Intel Corporation Raptor Lake-P Thunderbolt 4 USB Controller
00:0d.2 USB controller: Intel Corporation Raptor Lake-P Thunderbolt 4 NHI #0
00:0d.3 USB controller: Intel Corporation Raptor Lake-P Thunderbolt 4 NHI #1
00:14.0 USB controller: Intel Corporation Alder Lake PCH USB 3.2 xHCI Host Controller (rev 01)
00:14.2 RAM memory: Intel Corporation Alder Lake PCH Shared SRAM (rev 01)
00:14.3 Network controller: Intel Corporation Raptor Lake PCH CNVi WiFi (rev 01)
00:15.0 Serial bus controller: Intel Corporation Alder Lake PCH Serial IO I2C Controller #0 (rev 01)
00:16.0 Communication controller: Intel Corporation Alder Lake PCH HECI Controller (rev 01)
00:16.3 Serial controller: Intel Corporation Alder Lake AMT SOL Redirection (rev 01)
00:1f.0 ISA bridge: Intel Corporation Raptor Lake LPC/eSPI Controller (rev 01)
00:1f.3 Audio device: Intel Corporation Raptor Lake-P/U/H cAVS (rev 01)
00:1f.4 SMBus: Intel Corporation Alder Lake PCH-P SMBus Host Controller (rev 01)
00:1f.5 Serial bus controller: Intel Corporation Alder Lake-P PCH SPI Controller (rev 01)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (23) I219-LM (rev 01)
02:00.0 Non-Volatile memory controller: KIOXIA Corporation NVMe SSD Controller XG8 (rev 01)
03:00.0 3D controller: NVIDIA Corporation GA107GLM [RTX A500 Laptop GPU] (rev a1)
$ sudo dmesg | grep 0000:03:00.0
[ 1.028175] pci 0000:03:00.0: [10de:25bb] type 00 class 0x030200 PCIe Endpoint
[ 1.028185] pci 0000:03:00.0: BAR 0 [mem 0xbd000000-0xbdffffff]
[ 1.028191] pci 0000:03:00.0: BAR 1 [mem 0x6000000000-0x60ffffffff 64bit pref]
[ 1.028198] pci 0000:03:00.0: BAR 3 [mem 0x6100000000-0x6101ffffff 64bit pref]
[ 1.028201] pci 0000:03:00.0: BAR 5 [io 0x2000-0x207f]
[ 1.028205] pci 0000:03:00.0: ROM [mem 0xfff80000-0xffffffff pref]
[ 1.028251] pci 0000:03:00.0: PME# supported from D0 D3hot
[ 1.028310] pci 0000:03:00.0: 63.012 Gb/s available PCIe bandwidth, limited by 16.0 GT/s PCIe x4 link at 0000:00:06.2 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link)
[ 1.551004] pci 0000:03:00.0: ROM [mem 0xfff80000-0xffffffff pref]: can't claim; no compatible bridge window
[ 1.551442] pci 0000:03:00.0: ROM [mem size 0x00080000 pref]: can't assign; no space
[ 1.551443] pci 0000:03:00.0: ROM [mem size 0x00080000 pref]: failed to assign
[ 1.553090] pci 0000:03:00.0: Adding to iommu group 14
[ 4.916863] nvidia 0000:03:00.0: enabling device (0000 -> 0003)
[ 7.821785] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:03:00.0 on minor 0
[ 9.298368] bbswitch: Found discrete VGA device 0000:03:00.0: \_SB_.PC00.PEG2.PEGP
[ 9.298549] bbswitch: Succesfully loaded. Discrete card 0000:03:00.0 is on
I often see these messages in journals:
libvirtd[1875]: internal error: Unknown PCI header type ‘127’ for device ‘0000:03:00.0’
(udev-worker)[19138]: nvidia: Process ‘/sbin/modprobe -r nvidia-modeset’ failed with exit code 1.
and if selected intel (or on-demand) in prime-select:
kernel: nvidia 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible
when selected nvidia, I don’t see these messages.
kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 509
kernel:
kernel: nvidia 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible
kernel: nvidia 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible
kernel: NVRM: The NVIDIA GPU 0000:03:00.0
NVRM: (PCI ID: 10de:25bb) installed in this system has
NVRM: fallen off the bus and is not responding to commands.
kernel: nvidia: probe of 0000:03:00.0 failed with error -1
kernel: NVRM: The NVIDIA probe routine failed for 1 device(s).
kernel: NVRM: None of the NVIDIA devices were initialized.
kernel: nvidia-nvlink: Unregistered Nvlink Core, major device number 509
systemd[1]: bumblebeed.service: Deactivated successfully.
systemd[1]: Stopped bumblebeed.service - Bumblebee C Daemon.
(udev-worker)[30239]: nvidia: Process '/sbin/modprobe nvidia-modeset' failed with exit code 1.
kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 509
What I tried:
- add ibt=off option
prime-select intel
prime-select nvidia
reboot
- purge all nvidia packets and install again
- install different driver versions: 535, 540, 550, 555, 560(cuda)
- ubuntu-drivers autoinstall
- apt install --fix-broken
dkms remove -m nvidia/<version>
dkms install -m nvidia/<version>
- enable and disable Secure Boot in UEFI. And:
sudo mokutil --import /var/lib/shim-signed/mok/MOK.der
For some reason, I can’t upload nvidia-bug-report. Strange…
Could you help me to find solution?
Best regards,