Not able to install nvidia driver on Ubuntu 20.04 with NVIDIA Corporation Device 249d (rev a1)

Hi,
i am having a trouble to install the nvidia driver in ubuntu 20.04. I have seen many post related to this problem but didnt get the solution yet.

I have installed all kind of driver from command line or from software and update with Additinal driver tab but no success.

Driver is getting installed successfully but when running a command as ‘nvidia-smi’

NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

other command which i executed ----

  • lspci | grep -i nvidia
    01:00.0 VGA compatible controller: NVIDIA Corporation Device 249d (rev a1)
    01:00.1 Audio device: NVIDIA Corporation Device 228b (rev a1)

  • sudo prime-select nvidia
    [sudo] password for abhishek:
    Info: the nvidia profile is already set

  • lspci | grep VGA
    00:02.0 VGA compatible controller: Intel Corporation Device 9a60 (rev 01)
    01:00.0 VGA compatible controller: NVIDIA Corporation Device 249d (rev a1)

  • sudo lshw -C display
    [sudo] password for abhishek:
    *-display UNCLAIMED
    description: VGA compatible controller
    product: NVIDIA Corporation
    vendor: NVIDIA Corporation
    physical id: 0
    bus info: pci@0000:01:00.0
    version: a1
    width: 64 bits
    clock: 33MHz
    capabilities: pm msi pciexpress vga_controller cap_list
    configuration: latency=0
    resources: iomemory:fffffffe0-fffffffdf iomemory:ffffffff0-fffffffef memory:6d000000-6dffffff memory:6000000000-61ffffffff memory:6200000000-6201ffffff ioport:4000(size=128) memory:6e080000-6e0fffff
    *-display
    description: VGA compatible controller
    product: Intel Corporation
    vendor: Intel Corporation
    physical id: 2
    bus info: pci@0000:00:02.0
    logical name: /dev/fb0
    version: 01
    width: 64 bits
    clock: 33MHz
    capabilities: pciexpress msi pm vga_controller bus_master cap_list fb
    configuration: depth=32 driver=i915 latency=0 mode=1920x1080 visual=truecolor xres=1920 yres=1080
    resources: iomemory:620-61f iomemory:400-3ff irq:183 memory:624c000000-624cffffff memory:4000000000-400fffffff ioport:5000(size=64) memory:c0000-dffff memory:4010000000-4016ffffff memory:4020000000-40ffffffff

Not sure why display unclaimed for NVIDIA corporation product.

-dkms status
nvidia, 495.46, 5.13.0-27-generic, x86_64: installed

  • dmesg
    [18313.488616] nvidia: probe of 0000:01:00.0 failed with error -1
    [18313.488629] NVRM: The NVIDIA probe routine failed for 1 device(s).
    [18313.488630] NVRM: None of the NVIDIA devices were initialized.
    [18313.488826] nvidia-nvlink: Unregistered the Nvlink Core, major device number 507
    [18313.874303] nvidia-nvlink: Nvlink Core is being initialized, major device number 507

[18313.875062] nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[18313.875081] NVRM: The NVIDIA GPU 0000:01:00.0
NVRM: (PCI ID: 10de:249d) installed in this system has
NVRM: fallen off the bus and is not responding to commands.
[18313.875132] nvidia: probe of 0000:01:00.0 failed with error -1
[18313.875144] NVRM: The NVIDIA probe routine failed for 1 device(s).
[18313.875144] NVRM: None of the NVIDIA devices were initialized.
[18313.875234] nvidia-nvlink: Unregistered the Nvlink Core, major device number 507
[18314.136095] nvidia-nvlink: Nvlink Core is being initialized, major device number 507

[18314.136982] nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none

From bios, i have already disabled the secure boot still things are not working.

I have attached my bug report. kindly help me out to resolve the issue, i would really appreciate it !!
Thank you in Advance!

nvidia-bug-report.log.gz (12.0 MB)

If you have bbswitch/bumblebee installed, please uninstall it.

Thanks for your Reply !!!

I have tried to uninstall both bbswitch/bumblebee but both are not install on my system

Below are the responses which i get after executing the removing command

  • sudo apt-get remove bumblebee
    [sudo] password for abhishek:
    Reading package lists… Done
    Building dependency tree
    Reading state information… Done
    Package ‘bumblebee’ is not installed, so not removed
    0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.

-sudo apt-get remove bbswitch
Reading package lists… Done
Building dependency tree
Reading state information… Done
E: Unable to locate package bbswitch

There’s something really wrong with the gpu, please uninstall the driver, then apply a bios update. Afterwards, please run
sudo dmesg >dmesg.txt
sudo lspci -xxx -vv -d 10de:* >lspci.txt
and attach both files.

I have HP laptop (Zbook series) with processor 11th Gen Intel® Core™ i7-11800H @ 2.30GHz × 16. Currently graphics card is showing as Mesa Intel® UHD Graphics (TGL GT1).
Not sure how to update bios or whether its feasible to update it.
any pointer ?

Just use google
HP ZBook Studio g8 bios
which will lead you to the HP download site. Looks you need Windows to install the update, though.
Please uninstall the driver and create the dmesg output first.

Attached dmesg.txt after uninstalling the nvidia driver.

dmesg.txt (96.5 KB)

Looks like the gpu is turning off due to runtime-suspend and then refuses to turn on again. I guess you should really look into updating the bios first.

Since another user showed up with a similar notebook and the same problem
https://forums.developer.nvidia.com/t/fresh-ubuntu-20-04-install-nvidia-not-working/201534/3
I’m beginning to suspect the Ubuntu kernel upgrade to 5.13 broke things.