Linux Mint 2080 SLI

cassini12lp · February 1, 2019, 4:02pm

Hello,

Hoping to get some assistance. I have a brand new setup with fresh install of Linux Mint 18 x64, I have installed SLI RTX 2080 series cards with NVLINK bridge…

Upon installing drivers 410 and also 415 the machine reboots and then crashes into an endless loop of “crashed”.

I am hoping someone can assist? I am using the PPA option via the driver manager when attempting to install driver versions. again I have tried both 410, 415 with no luck. Thanks

nvidia-bug-report.log (2.09 MB)

generix · February 1, 2019, 4:23pm

Please run nvidia-bug-report.sh as root and attach the resulting .gz file to your post. Hovering the mouse over an existing post of yours will reveal a paperclip icon.
[url]https://devtalk.nvidia.com/default/topic/1043347/announcements/attaching-files-to-forum-topics-posts/[/url]

cassini12lp · February 1, 2019, 5:10pm

Apologize this is mostly new to me. I do not seem to have that option? My PC is currently in “fallback mode” and that is endless loop if I hit restart. I can get to a command prompt by F1, which I am currently in. However if I go to /usr/lib/NVIDIA I only have “pre-install” listed.

generix · February 1, 2019, 5:19pm

On the command prompt, check if you have internet connection by running
ping google.com

If you get a reply, hit ctrl+c to stop the ping, then

install pastebinit (sudo apt install pastebinit)
sudo nvidia-bug-report.sh
unzip logfile (gunzip nvidia-bug-report.log.gz)
upload logfile (pastebinit -i nvidia-bug-report.log)
note down and post the url you’re given

cassini12lp · February 1, 2019, 5:27pm

thank you for that great reply!

I got all the way to the end and it shows
bad API request, maximum paste file size exceeded

let me see if I can get to desktop on it and web browser back to this page to attach the log.
nvidia-bug-report.log (2.09 MB)

generix · February 1, 2019, 5:39pm

Seems to be a quite large log. You can also try to use ubuntu’s pastebin, has a higher size limit:
pastebinit -b http://paste.ubuntu.com -i nvidia-bug-report.log

cassini12lp · February 1, 2019, 5:42pm

not sure if its attached.

cassini12lp · February 1, 2019, 5:43pm

not sure if its attached.

cassini12lp · February 1, 2019, 5:53pm

I see it now above. Do you see it as well? Thanks

generix · February 1, 2019, 6:01pm

Ok, first of all, you’re running a much too old kernel for your hardware, please upgrade to the latest HWE stack:

sudo apt-get install --install-recommends linux-generic-hwe-16.04 xserver-xorg-hwe-16.04

https://wiki.ubuntu.com/Kernel/LTSEnablementStack

Next, you still have your Intel gpu active (which currently doesn’t work properly, see above), is that on purpose? Is there a monitor connected to it? If you want to run the Nvidias in SLI for graphics, you’ll have to connect your monitor to them and disable the onboard intel in bios.
BTW, SLI doesn’t really work with linux, instead of doubling gaming performance you will most often cut it in half. Or do you want to use for compute reasons?

cassini12lp · February 1, 2019, 6:07pm

I ran that update, TY for that. it has a monitor attached yes. Intel GPU active? no not on purpose. I will remove that in BIOS after the install finishes.

I am building this for deep learning only… I am going to reboot now and see what happens. cant thank you enough for your help

cassini12lp · February 1, 2019, 6:14pm

So I got the same error, “fallback mode” upon running nvidia-smi I get this now
nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Ma ke sure that the latest NVIDIA driver is installed and running.

generix · February 1, 2019, 6:16pm

Ok, for deep learning only a different setup might have some advantages. Enabling the Intel gpu and using it for graphics and the nvidia gpus for cuda only enables you to run larger cuda kernels. Downside of that is that you can’t use the nvidia gpus for graphics and a different driver setup is needed. See how far you get and if you run into cuda kernel timeouts you can change it.

generix · February 1, 2019, 6:21pm

Please create a new nvidia-bug-report.log.

cassini12lp · February 1, 2019, 7:01pm

Thanks for that tip. If I can get these cards working in general it seems a better route for me.
See below for new pastebin log file

https://pastebin.com/TcaktHw8

TY

generix · February 1, 2019, 7:47pm

Please run
gcc -v
this should return the version
gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.10)
if it displays the version 5.3.1 then you first need to run
sudo apt-get update
sudo apt-get upgrade
first to update your system and get the right gcc.

You installed the driver using the .run installer and didn’t use the dkms option. This is not recommended and now left you without a driver after kernel update. Please reinstall the driver using --dkms option or uninstall it and use the driver from Ubuntu’s graphics ppa.
Please remove your current /etc/X11/xorg.conf and replace it with just

Section "Device"
    Identifier     "nvidia"
    Driver         "nvidia"
    VendorName     "NVIDIA Corporation"
    BusID          "PCI:1:0:0"
    Option         "AllowEmptyInitialConfiguration"
EndSection

Also connect your monitor to the first Nvidia gpu.

cassini12lp · February 4, 2019, 5:41pm

Output of gcc -v = Ubuntu 5.4.0 -6ubuntu1~16.04.11

I installed this time with .run yes but my other attempts at ways to install have also failed. If I am to purge all Nvidia right now, can I ask how should I be getting/identifying this driver and task?

Please reinstall the driver using --dkms option or uninstall it and use the driver from Ubuntu’s graphics ppa.
[i][/i]

Also, after I purge Nvidia drivers now I am going to shut down and remove one video card to attempt to just get a single 2080 working correctly before moving into SLI. Does that make sense? Thanks

cassini12lp · February 4, 2019, 6:28pm

I purged the files and then selected 415 version from Driver Manager (which I had done prior without success) BUT, this time it seemed to work just fine? output of NVIDIA-SMI is below and this is still in SLI…

±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1199 G /usr/lib/xorg/Xorg 203MiB |
| 0 1894 G cinnamon 42MiB |
±----------------------------------------------------------------------------+

and of inix -G

Graphics: Card-1: NVIDIA Device 1e87
Card-2: NVIDIA Device 1e87
Display Server: N/A drivers: nvidia,nouveau (unloaded: fbdev,vesa)
tty size: 128x37 Advanced Data: N/A out of X

generix · February 4, 2019, 8:48pm

You initially installed the nvidia driver right after OS install, without updating it first to the latest state so you had a kernel which didn’t support the rest of your hardware and an outdated compiler so nothing worked.

cassini12lp · February 5, 2019, 5:19pm

Thank you for all your help! To summarize, I should run exactly what command after fresh install?

Topic		Replies	Views
Linux Mint 18 Install issues : 2080 CUDA Setup and Installation	1	519	February 1, 2019
S870 causes kernel panic Device query of S870 crashes kernel CUDA Programming and Performance	27	25668	May 29, 2008
CUDA 4 + driver 270.35 (C2050) random errors CUDA Programming and Performance	13	18700	April 7, 2011
2x750m how to activate SLI? Linux	11	3527	July 28, 2013
Unable to install Nvidia driver in SLES 11 CUDA Programming and Performance	10	4930	January 27, 2011
(2) GeForce 210, 4 displays, only first GPU recognized using nvidia proprietary drivers Linux	6	1956	March 24, 2015
Enabling SLI makes all the windows start flashing on Ubuntu 14.04 Linux	21	16229	October 7, 2014
Unity doesn't start on Ubuntu 16.04 when SLI=on Linux	27	9376	October 12, 2017
Sli Lenovo y510p Ubuntu 13.10 64bit Freeze on Boot Linux	22	11099	August 16, 2014
375.10 - bad experience Linux	10	4320	October 14, 2021

Linux Mint 2080 SLI

Related topics