Ubuntu 20.04, nvidia-smi has failed. Reinstalling drivers has not solved the issue, GTX 1650

Hello,

whilst trying to use tensorflow-gpu, I ran into the error that tf-gpu was not recognizing CUDA in my computer. I have a dell xps15 laptop, where I have installed ubuntu budgie 20.04. I ran nvidia-smi and got the following:

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running

I purged the nvidia drivers and re-installed using

sudo apt install nvidia-driver-440

I rebooted and ran nvidia-smi and got the exact same error. I looked up into several other discussions, but I am still unsure of why is this happening.

here’s the output of inxi -G

Graphics:
  Device-1: Intel UHD Graphics 630 driver: i915 v: kernel 
  Device-2: NVIDIA TU117M [GeForce GTX 1650 Mobile / Max-Q] driver: N/A 
  Display: x11 server: X.Org 1.20.8 driver: modesetting unloaded: fbdev,vesa 
  resolution: 1920x1080~60Hz 
  OpenGL: renderer: Mesa Intel UHD Graphics 630 (CFL GT2) v: 4.6 Mesa 20.0.8 

by running lsmod | grep nvidia I get nothing.

lshw -class display yields:

  *-display UNCLAIMED       
       description: 3D controller
       product: TU117M [GeForce GTX 1650 Mobile / Max-Q]
       vendor: NVIDIA Corporation
       physical id: 0
       bus info: pci@0000:01:00.0
       version: a1
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress cap_list
       configuration: latency=0
       resources: memory:ec000000-ecffffff memory:c0000000-cfffffff memory:d0000000-d1ffffff ioport:3000(size=128) memory:ed000000-ed07ffff
  *-display
       description: VGA compatible controller
       product: UHD Graphics 630 (Mobile)
       vendor: Intel Corporation
       physical id: 2
       bus info: pci@0000:00:02.0
       version: 00
       width: 64 bits
       clock: 33MHz
       capabilities: pciexpress msi pm vga_controller bus_master cap_list rom
       configuration: driver=i915 latency=0
       resources: irq:178 memory:eb000000-ebffffff memory:80000000-8fffffff ioport:4000(size=64) memory:c0000-dffff

I’d deeply appreciate some help

Hi, for my 1650 I must install the newer drivers to see something on nvidia-smi.
You can try with the script I wrote on my post Notebook XMG CORE 15 AMD with GTX1650Ti and Ubuntu 20.0.4 NVIDIA 450.66
Remember also to add your user to video group:
sudo usermod -a -G video
I made a little more step, but still have problem with 3d acceleration: stay tuned :P

1 Like

Hi Luca,

Thank you for your suggestions! I did run through your post and eventhough I installed the driver, and blacklisted nouveau, I have an issue after the line

The terminal prints that that file does not exist, and the other command succesively.

Even when running nvidia-setting, the computer yields:

ERROR: NVIDIA driver is not loaded


ERROR: Unable to load info from any available system


(nvidia-settings:4661): GLib-GObject-CRITICAL **: 10:06:23.039: g_object_unref: assertion 'G_IS_OBJECT (object)' failed
** Message: 10:06:23.041: PRIME: Requires offloading
** Message: 10:06:23.042: PRIME: is it supported? yes
** Message: 10:06:23.065: PRIME: Usage: /usr/bin/prime-select nvidia|intel|on-demand|query
** Message: 10:06:23.065: PRIME: on-demand mode: "1"
** Message: 10:06:23.065: PRIME: is "on-demand" mode supported? yes

and still, nvidia-smi yields the same original output. Thank you for your help, though!

on my clean Ubuntu 20.04 install I had that file.
this is my full 10-nvidia.conf

Section "OutputClass"
    Identifier "nvidia"
    MatchDriver "nvidia-drm"
    Driver "nvidia"
    Option "AllowEmptyInitialConfiguration"
    ModulePath "/usr/lib/x86_64-linux-gnu/nvidia/xorg"
EndSection

Section "ServerFlags"
    Option "AllowIndirectGLX" "on"
    Option "IndirectGLX" "on"
EndSection

thanks for the reply!

So, do you recon I should add those lines as well?

I found the solution of my problem.
Mine main graphic card AMD Renoir is not yet supported on the ubuntu 20.0.4 mainstream kernel.
After the upgrade to the kernel >5.6 and two extra config on x11 files, now works as expected on internal and external monitor!

1 Like