I use the following commands to install nvidia driver:
ubuntu-drivers devices
sudo ubuntu-drivers autoinstall
But I still get error when I run nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
The bug report is attached. Thanks in advance.
nvidia-bug-report.log.gz (2.0 MB)
Welcome to the NVIDIA developer forums @haoyue77.
Reading the bug-report log I do not really know where to start. It looks like a complete mismatch between Hardware, BIOS settings, kernel version and driver version. I recommend starting from scratch.
- Clean your device
- Reset your BIOS to default settings
- Install Ubuntu
- Download the GPU driver from Official Drivers | NVIDIA for the GeForce GTX 1050 Ti Mobile and for 64bit Linux
- Install the driver following the included instructions exactly
Good luck!
Hi. @MarkusHoHo .
Thanks for your reply. I have reinstall ubuntu 22.04 and installed nvidia driver 535 by software&updates. I also double checked driver version on official website and it’s correct.
But still I met the same problem.
nvidia-bug-report.log (1.1 MB)
kind regards
NVRM: This is a 64-bit BAR mapped above 4GB by the system
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162264] NVRM: BIOS or the Linux kernel, but the PCI bridge
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162264] NVRM: immediately upstream of this GPU does not define
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162264] NVRM: a matching prefetchable memory window.
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162267] NVRM: This may be due to a known Linux kernel bug. Please
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162267] NVRM: see the README section on 64-bit BARs for additional
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162267] NVRM: information.
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162286] NVRM: The NVIDIA probe routine failed for 1 device(s).
Jan 18 13:54:28 hy-ThinkPad-S5-2nd-Gen kernel: [ 15.162287] NVRM: None of the NVIDIA devices were initialized.
Fix your BIOS, you probably have some setting regarding Resizable BAR enabled, but this GPU does not support that.
I turn on CSM in bios and resizable bar problem gets resolved. Here is a new bug report attached.
Update:
I found I was wrong . The Resizable BAR problem is still here and I lookup bios carefully and there is no such kind of settings related it.
nvidia-bug-report.log (2.6 MB)
So I found the mentioned README entry with an older driver:
64-Bit BARs (Base Address Registers)
Starting with native PCI Express GPUs, NVIDIA’s GPUs will advertise a 64-bit BAR capability (a Base Address Register stores the location of a PCI I/O region, such as registers or a frame buffer). This means that the GPU’s PCI I/O regions (registers and frame buffer) can be placed above the 32-bit address space (the first 4 gigabytes of memory).
The decision of where the BAR is placed is made by the system BIOS at boot time. If the BIOS supports 64-bit BARs, then the NVIDIA PCI I/O regions may be placed above the 32-bit address space. If the BIOS does not support this feature, then our PCI I/O regions will be placed within the 32-bit address space as they have always been.
Unfortunately, some Linux kernels (such as 2.6.11.x) do not understand or support 64-bit BARs. If the BIOS does place any NVIDIA PCI I/O regions above the 32-bit address space, such kernels will reject the BAR and the NVIDIA driver will not work.
My interpretation is that Ubuntu 22.04 and the new kernel re-introduce the issue. I don’t think there is much to do beside downgrading to an earlier version of Ubuntu.