Still have a problem getting NVIDIA GPU driver loaded w/Ubuntu OS

I’m using a Asus TUF705DU Gaming Laptop, purported to be equipped w/a ‘AMD Ryzen 7 3750H with Radeon Vega Mobile Gfx’, and have been attempting to enable the GPU’s driver get integrated into the installed versions of Ubuntu that installed into various partitions of the box for a while. I’ve managed to get a handle on all sorts of system command sequences in order to verify the results of these attempts, but I’ve not been successful in actually completing this task. I’ve included many the investigative commands that I’ve used, and attached the result file (2021-04-15-nvidia-bug-report_log.gz) of the ‘nvidia-bug-report.sh’ that I ran yesterday.
From the following it appears that ‘/usr/bin/nvidia-persistenced’ is not initializing correctly, and consequently a ‘NVIDIA Persistence Daemon’ is not making it to, and being instantiated into a continual/stable runtime execution state, in order to fulfill the required services.

odoncaoa@tufFX705du[10]$ showBootLog [sysFPN="/var/log/boot.log]
[sudo] password for odoncaoa:
------------ Fri Apr 16 11:44:06 EDT 2021 ------------
/dev/nvme0n1p2: clean, 1070675/8282112 files, 23723968/33124864 blocks

[FAILED] Failed to start NVIDIA Persistence Daemon.
See ‘systemctl status nvidia-persistenced.service’ for details.

[FAILED] Failed to start NVIDIA Persistence Daemon.
See ‘systemctl status nvidia-persistenced.service’ for details.

odoncaoa@tufFX705du[11]$ systemctl status nvidia-persistenced.service
● nvidia-persistenced.service - NVIDIA Persistence Daemon
Loaded: loaded (/lib/systemd/system/nvidia-persistenced.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Fri 2021-04-16 11:44:07 EDT; 2h 52min ago
Process: 1347 ExecStart=/usr/bin/nvidia-persistenced --verbose (code=exited, status=1/FAILURE)
Process: 1351 ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced/* (code=exited, status=0/SUCCESS)

Apr 16 11:44:07 tufFX705du systemd[1]: nvidia-persistenced.service: Failed with result ‘exit-code’.
Apr 16 11:44:07 tufFX705du systemd[1]: Failed to start NVIDIA Persistence Daemon.
Apr 16 11:44:07 tufFX705du systemd[1]: nvidia-persistenced.service: Scheduled restart job, restart counter is at 5.
Apr 16 11:44:07 tufFX705du systemd[1]: Stopped NVIDIA Persistence Daemon.
Apr 16 11:44:07 tufFX705du systemd[1]: nvidia-persistenced.service: Start request repeated too quickly.
Apr 16 11:44:07 tufFX705du systemd[1]: nvidia-persistenced.service: Failed with result ‘exit-code’.
Apr 16 11:44:07 tufFX705du systemd[1]: Failed to start NVIDIA Persistence Daemon.

/home/odoncaoa 2021-04-16 14:36
odoncaoa@tufFX705du[12]$ sudo ls -l /lib/systemd/system/nvidia-persistenced.service
-rw-r–r-- 1 root root 323 Dec 29 17:21 /lib/systemd/system/nvidia-persistenced.service

/home/odoncaoa 2021-04-16 14:37
odoncaoa@tufFX705du[13]$ sudo file /lib/systemd/system/nvidia-persistenced.service
/lib/systemd/system/nvidia-persistenced.service: ASCII text

odoncaoa@tufFX705du[21]$ sudo cat /lib/systemd/system/nvidia-persistenced.service
[sudo] password for odoncaoa:
[Unit]
Description=NVIDIA Persistence Daemon
After=syslog.target

[Service]
Type=forking
PIDFile=/var/run/nvidia-persistenced/nvidia-persistenced.pid
Restart=always
ExecStart=/usr/bin/nvidia-persistenced --verbose
ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced/*
TimeoutSec=300

[Install]
WantedBy=multi-user.target

odoncaoa@tufFX705du[22]$ lsmod | grep -i ‘nvidia’
i2c_nvidia_gpu 16384 0

odoncaoa@tufFX705du[23]$ lsmod | grep -i ‘nouveau’

odoncaoa@tufFX705du[24]$ sudo lshw -numeric -C display
*-display UNCLAIMED
description: VGA compatible controller
product: TU116M [GeForce GTX 1660 Ti Mobile] [10DE:2191]
vendor: NVIDIA Corporation [10DE]
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller cap_list
configuration: latency=0
resources: memory:f6000000-f6ffffff memory:c0000000-cfffffff memory:d0000000-d1ffffff ioport:f000(size=128) memory:f7000000-f707ffff
*-display
description: VGA compatible controller
product: Picasso [1002:15D8]
vendor: Advanced Micro Devices, Inc. [AMD/ATI] [1002]
physical id: 0
bus info: pci@0000:05:00.0
version: c1
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi msix vga_controller bus_master cap_list
configuration: driver=amdgpu latency=0
resources: irq:61 memory:e0000000-efffffff memory:f0000000-f01fffff ioport:c000(size=256) memory:f7500000-f757ffff

odoncaoa@tufFX705du[5]$ listCompHW
2.4G Wireless Mouse
AMD Ryzen 7 3750H with Radeon Vega Mobile Gfx
Bluetooth Radio
Extreme SSD
Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
Family 17h (Models 10h-1fh) HD Audio Controller
Family 17h (Models 10h-1fh) Platform Security Processor
FCH LPC Bridge
FCH SATA Controller [AHCI mode]
FCH SMBus Controller
FX705DU
HD-GDU3
HMA81GS6JJR8N-VK
Picasso
PnP device PNP0303
PnP device PNP0b00
PnP device PNP0c01
PnP device PNP0c02
Raven/Raven2 Device 24: Function 0
Raven/Raven2 Device 24: Function 1
Raven/Raven2 Device 24: Function 2
Raven/Raven2 Device 24: Function 3
Raven/Raven2 Device 24: Function 4
Raven/Raven2 Device 24: Function 5
Raven/Raven2 Device 24: Function 6
Raven/Raven2 Device 24: Function 7
Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
Raven/Raven2 IOMMU
Raven/Raven2 PCIe GPP Bridge [6:0]
Raven/Raven2 Root Complex
Raven USB 3.1
RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller
RTL8822BE 802.11a/b/g/n/ac WiFi adapter
SES Device
TOSHIBA MQ04ABF1
TU116 High Definition Audio Controller
TU116M [GeForce GTX 1660 Ti Mobile]
TU116 USB 3.1 Host Controller
TU116 USB Type-C UCSI Controller
TUF Gaming FX705DU_TUF705DU
USB2.0 HD UVC WebCam
USB2.0 Hub
WD Black 2018/PC SN520 NVMe SSD
WDC PC SN520 SDAPNUW-256G-1002
xHCI Host Controller

odoncaoa@tufFX705du[13]$ glxinfo -B
name of display: :0
display: :0 screen: 0
direct rendering: Yes
Extended renderer info (GLX_MESA_query_renderer):
Vendor: X.Org (0x1002)
Device: AMD RAVEN (DRM 3.35.0, 5.4.0-47-generic, LLVM 11.0.0) (0x15d8)
Version: 20.2.6
Accelerated: yes
Video memory: 2048MB
Unified memory: no
Preferred profile: core (0x1)
Max core profile version: 4.6
Max compat profile version: 4.6
Max GLES1 profile version: 1.1
Max GLES[23] profile version: 3.2
Memory info (GL_ATI_meminfo):
VBO free memory - total: 1743 MB, largest block: 1743 MB
VBO free aux. memory - total: 3025 MB, largest block: 3025 MB
Texture free memory - total: 1743 MB, largest block: 1743 MB
Texture free aux. memory - total: 3025 MB, largest block: 3025 MB
Renderbuffer free memory - total: 1743 MB, largest block: 1743 MB
Renderbuffer free aux. memory - total: 3025 MB, largest block: 3025 MB
Memory info (GL_NVX_gpu_memory_info):
Dedicated video memory: 2048 MB
Total available memory: 5120 MB
Currently available dedicated video memory: 1743 MB
OpenGL vendor string: X.Org
OpenGL renderer string: AMD RAVEN (DRM 3.35.0, 5.4.0-47-generic, LLVM 11.0.0)
OpenGL core profile version string: 4.6 (Core Profile) Mesa 20.2.6
OpenGL core profile shading language version string: 4.60
OpenGL core profile context flags: (none)
OpenGL core profile profile mask: core profile

OpenGL version string: 4.6 (Compatibility Profile) Mesa 20.2.6
OpenGL shading language version string: 4.60
OpenGL context flags: (none)
OpenGL profile mask: compatibility profile

OpenGL ES profile version string: OpenGL ES 3.2 Mesa 20.2.6
OpenGL ES profile shading language version string: OpenGL ES GLSL ES 3.20

odoncaoa@tufFX705du[38]$ showInstPkgs | grep -i ‘NVidia’
libnvidia-cfg1-460:amd64 install
libnvidia-common-460 install
libnvidia-compute-460:amd64 install
libnvidia-decode-460:amd64 install
libnvidia-encode-460:amd64 install
libnvidia-extra-460:amd64 install
libnvidia-fbc1-460:amd64 install
libnvidia-gl-460:amd64 install
libnvidia-ifr1-460:amd64 install
nvidia-compute-utils-460 install
nvidia-dkms-460 install
nvidia-driver-460 install
nvidia-kernel-common-460 install
nvidia-kernel-source-460 install
nvidia-modprobe install
nvidia-prime install
nvidia-settings install
nvidia-utils-460 install
xserver-xorg-video-nvidia-460 install

odoncaoa@tufFX705du[13]$ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:01.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:01.7 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 7
01:00.0 VGA compatible controller: NVIDIA Corporation TU116M [GeForce GTX 1660 Ti Mobile] (rev a1)
01:00.1 Audio device: NVIDIA Corporation TU116 High Definition Audio Controller (rev a1)
01:00.2 USB controller: NVIDIA Corporation TU116 USB 3.1 Host Controller (rev a1)
01:00.3 Serial bus controller [0c80]: NVIDIA Corporation TU116 USB Type-C UCSI Controller (rev a1)
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
03:00.0 Non-Volatile memory controller: Sandisk Corp WD Black 2018/PC SN520 NVMe SSD (rev 01)
04:00.0 Network controller: Realtek Semiconductor Co., Ltd. RTL8822BE 802.11a/b/g/n/ac WiFi adapter
05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Picasso (rev c1)
05:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
05:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
05:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
05:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) HD Audio Controller
06:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 61)

odoncaoa@tufFX705du[19]$ ubuntu-drivers --gpgpu devices
This is gpgpu mode
== /sys/devices/pci0000:00/0000:00:01.1/0000:01:00.0 ==
modalias : pci:v000010DEd00002191sv00001043sd000010DFbc03sc00i00
vendor : NVIDIA Corporation
model : TU116M [GeForce GTX 1660 Ti Mobile]
driver : nvidia-driver-460-server - distro non-free
driver : nvidia-driver-450 - third-party non-free
driver : nvidia-driver-418-server - distro non-free
driver : nvidia-driver-450-server - distro non-free
driver : nvidia-driver-455 - third-party non-free
driver : nvidia-driver-460 - third-party non-free
driver : nvidia-driver-465 - third-party non-free recommended
driver : xserver-xorg-video-nouveau - distro free builtin

Slainte,
odoncaoa

P.S. Don’t seem to be able to find mechanics to attach the file, though?

If you haven’t already disable secure boot from BIOS disable it. Also next time put NV bug report log itself here, do not copy paste stuff here because it severely impacts readability.

There is an upload icon in the edit window.

Please show the output of dkms status
and upload your bug-report (if you don’t manage here, put it elsewhere i.e. g00gle drive).

Hello @Mart and @VulkanGuy

I am using RTX 3070 with i7 10th Gen.

I installed nvidia-driver-465(Tried 460 also but no use) using software & updates. It looks like installed but I am getting the following error with nvidia-smi command.

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

Already disabled secure boot.

dkms status gives me the following output
nvidia, 465.27, 5.4.0-73-generic, x86_64: installed

Also attaching the log file.
nvidia-bug-report.log.gz (2.1 MB)

Ahhh. Thanks for ‘heads up’! I wasn’t aware that one could 'boot into an ‘intel mode’. And, I turned off the video mode in use ‘appropriate/dual’ (or whatever), and selected the ‘nvidia’ only mode currently, thinking that the warnings I was getting meant that the nvidia video driver code was NOT ABLE TO function in a coordinated fashion with other driver code; which usually seemed to end up with a ‘transfer/select’ botch, making the process gag, and taking the machine down, all the time! Now, if you don’t mind, how is one able to observe the power consumption involved with an in operation video driver, then?

VulkanGuy,
Thanks for would-be helpful directive, but I figured out, and arrived at a solution similar to Tigwelder (below), but floated another query, having to do with the solution that he figured out, and uses, consequentially as a dual use workaround.
Cheers