I recently got a new laptop to replace my ancient Asus A7k. Its a lovely ASUS ROG GL752VW-DH71.
I’ve tried to install Leap 42.2 several times without much success. I understand that it is an optimus platform. Accordingly I’ve following the bumblee instructions (SDB:NVIDIA Bumblebee - openSUSE Wiki). I’ve tried both btrfs and ext4 filesystems for both home and root partitions. Neither has changed the end result.
During my latest install attempt, I got to the point where the NVIDA drivers (easy way) were installing but encountered soft-locks on the CPUs (this has been typical of every attempt thus far…very annoying) that wouldn’t go away for the last two rpms. I rebooted, and wasn’t able to login with my stated user id, and default kernel (hung at text login). Zo I thought I’d resume using debug kernel. I finished installing the rest of the rpms, and finished the bumblebee config steps (Perhaps not the best idea, but it was late, I was frustrated, and had little sleep). Now for debug and regular kernel and regular safemode, I boot, get to text login. This looks alive long enough for me to type 4 characters (i.e. root) as login credentials, then it hangs.
The only way I can login to linux at this point is debug, safe mode. By some small miracle I get a graphical login and can get into plasma.
When I run optirun --status, it tells me that X is inactive and discrete video is likely on. optirun glxgears tells me that secondary GPU driver could not be loaded.
Any suggestions would be helpful (please tell me I don’t need to reinstall again!). I’ve also tried ‘intel_pstate=disable’ and ‘nomodeset’ or ‘nouveau.modprobe=0’ after doing research within the forums. Unfortunately this hasn’t moved me any further.
I know I should be including various diagnostics, but its been awhile since I had to build drivers so bear with me. Let me know what is needed so I don’t waste anyones time
Do you have a setting Discrete/Intel/Hybrid in your BIOS? If so, what’s it set to?
For more info, run nvidia-bug-report.sh and attach output file to your post.
No setting in my bios. Bios is America Megatrends V2.0.1034
Unfortunately no luck while running in debug -recoverymode
nvidia-bug-report.sh --safe-mode
nvidia-bug-report.sh will now collect information about your
system and create the file ‘nvidia-bug-report.log.gz’ in the current
directory. It may take several seconds to run. In some
cases, it may hang trying to capture data generated dynamically
by the Linux kernel and/or the NVIDIA kernel module. While
the bug report log file will be incomplete if this happens, it
may still contain enough data to diagnose your problem.
Please include the ‘nvidia-bug-report.log.gz’ log file when reporting
your bug via the NVIDIA Linux forum (see devtalk.nvidia.com)
or by sending email to ‘linux-bugs@nvidia.com’.
Running nvidia-bug-report.sh…ls: cannot access ‘/proc/driver/nvidia/./gpus/’: No such file or directory
complete.
ok, so I can’t get past a login screen under any of the default kernels.
When load just the default kernel, I get hung at the login prompt
When I run kernel-default-recovery, I get hung at the login prompt but at least I can see a bit more info.
in one boot case I noted:
bbswitch: successfully loaded: discrete card 0000:01:00:0 is off
In another I noted that Linux 4.4.62.-18.6-default xhci-hcd was the
kernel loaded, following
NVIDIA: module license “NVIDIA” taints kernel
While I would hope that a simple change of bbswitch configuration would fix things, I’ll post a question pre-emptively in the case that I need to reinstall everything:
->is there any way to prevent CPU softlocks from occurring during the installation of the NVIDIA drivers?
Ah, sorry about that; I didn’t realize that was the case the post that you suggested sounds like very similar symptoms. I expect only the kernel versions are slightly newer?
I’ve attached the acpidump file and dmesg (since I noted that it was requested on the link provided).
I’m happy to follow up on the bumblebee thread should the above suggest thats where the problem lies
Yeah, the forum virus-scanner ate the acpidump.txt.gz file and I don’t think there’s an admin interface to get it back. Sorry!
I don’t see anything obvious in the attached bug report from the debug kernel. Is the system accessible via SSH when it hangs? One other option might be to configure netconsole to log to a different computer during the hang. Since this is a laptop, the last-resort fallback of a serial console probably isn’t possible.
@aplattner: Can you find someone with pcie/drivers expertise to have a look at this:
[url]Invalid Bug ID
Something in kernel is really broken regarding pcie power management and nobody seems to have a clue what is wrong. A rapidly rising number of sky/kaby lake laptops with nvidia gpus are affected and there’s not always an easy workaround. I’m fearing that once Windows 7 is not supported by new laptop’s firmwares, this will break completely.
One noteworthy symptom is that all claim not to support ASPM. Is this a change in policy to enable this only on driver level so that Linux breaks something by disabling it altogether?
I then ‘grub2-mkconfig -o /boot/grub2/grub.cfg’ as instructed, and nothing major happened. same behaviour as before
Earlier in the evening I tried GRUB_CMDLINE_LINUX_DEFAULT=“resume=/dev/disk/by-uuid/fd4f92b1-b6e9-4bee-972e-6dc8f92fa47a splash=silent quie t showopts” “acpi_osi=!acpi_osi=Windows 2009”
forgetting to run grub2-mkconfig, and got a bunch of new PCIe error messages that I’d never seen before.
previous incarnations of opensuse (13.x) provided command access at boot so this method is pretty different for me. I’m sure that I have a typo or syntax error somewhere here.
thanks for this. I got a bit further this time and am indeed using default kernel
No nvidia driver though (touch pad works so based on your comments I figured as much)
nvidia-bug-report.sh --safe-mode
nvidia-bug-report.sh will now collect information about your
system and create the file ‘nvidia-bug-report.log.gz’ in the current
directory. It may take several seconds to run. In some
cases, it may hang trying to capture data generated dynamically
by the Linux kernel and/or the NVIDIA kernel module. While
the bug report log file will be incomplete if this happens, it
may still contain enough data to diagnose your problem.
Please include the ‘nvidia-bug-report.log.gz’ log file when reporting
your bug via the NVIDIA Linux forum (see devtalk.nvidia.com)
or by sending email to ‘linux-bugs@nvidia.com’.
Running nvidia-bug-report.sh…ls: cannot access ‘/proc/driver/nvidia/./gpus/’: No such file or directory
complete.
The last logs were useful. Still not looking perfect, though. Two things:
Your bumblebee install looks misconfigured or not configured at all, it’s loading the nvidia glx module on the intel xserver which doesn’t work.
Your bios seems buggy so it would need kernel quirks which usually don’t get backported so the 4.4 kernel of Leap might not be the best choice for your hardware.
Bumblebee config is beyond the scope of this forum, you might purge it and try with a plain PRIME setup for further debugging.
A newer kernel might be worth trying to see if kernel oopses vanish.