Fedora 27 [GeForce GT 640M LE] [Driver 390.25] [Kernel 4.15.4] __schedule OOPS with Optimus PRIME Synch...

I ran into several soft-freezes in workloads such as running Google Chrome in full screen on idle or in high disk loads while workspace switching on GNOME. The soft-freeze makes GNOME on Xorg unresponsive, but does not affect sysrq keys. Hardware cursor in this case is still responsive until switching to another virtual getty terminal or forcing the framebuffer to refresh with sysrq. An nvidia-bug-report dump was not possible and the bug was not reproducible using intel modesetting.

Feb 27 21:00:27 Sherry kernel: BUG: unable to handle kernel paging request at ffffffffffffffb8
Feb 27 21:00:27 Sherry kernel: IP: __schedule+0x247/0x880
Feb 27 21:00:27 Sherry kernel: PGD b520e067 P4D b520e067 PUD b5210067 PMD 0 
Feb 27 21:00:27 Sherry kernel: Oops: 0002 [#1] SMP PTI
Feb 27 21:00:27 Sherry kernel: Modules linked in: xt_addrtype br_netfilter xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c tun bridge stp llc overlay ebtable_filter ebtables ip6table_filter ip6_tables ccm vmnet(OE) ppdev parport_pc parport fuse vmw_vsock_vmci_transport vsock vmw_vmci vmmon(OE) sunrpc arc4 nvidia_drm(POE) nvidia_modeset(POE) nvidia_uvm(POE) nvidia(POE) uvcvideo ath9k_htc ath9k_common videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core ath9k_hw intel_rapl mac80211 x86_pkg_temp_thermal videodev intel_powerclamp ath rtsx_usb_ms coretemp media cfg80211 memstick snd_hda_codec_hdmi kvm_intel snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel kvm irqbypass intel_cstate iTCO_wdt
Feb 27 21:00:27 Sherry kernel:  snd_hda_codec intel_uncore dell_laptop rfkill iTCO_vendor_support intel_rapl_perf snd_hda_core dell_smbios_smm snd_hwdep dcdbas dell_smm_hwmon joydev bcma snd_seq wmi_bmof dell_smbios_wmi snd_seq_device ipmi_devintf ipmi_msghandler dell_wmi snd_pcm dell_smbios sparse_keymap dell_wmi_descriptor mei_me mei snd_timer snd lpc_ich soundcore i2c_i801 shpchp vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) dm_crypt i915 i2c_algo_bit mxm_wmi drm_kms_helper crct10dif_pclmul crc32_pclmul crc32c_intel drm ghash_clmulni_intel r8169 serio_raw mii video wmi rtsx_usb_sdmmc mmc_core rtsx_usb
Feb 27 21:00:27 Sherry kernel: CPU: 1 PID: 19907 Comm: kworker/u8:4 Tainted: P        W  OE    4.15.4-300.fc27.x86_64 #1
Feb 27 21:00:27 Sherry kernel: Hardware name: Dell Inc. Inspiron 3421/0G9D08, BIOS A13 12/14/2015
Feb 27 21:00:27 Sherry kernel: RIP: 0010:__schedule+0x247/0x880
Feb 27 21:00:27 Sherry kernel: RSP: 0018:ffff9b4688e27e70 EFLAGS: 00010282
Feb 27 21:00:27 Sherry kernel: RAX: ffff8929af2a0a80 RBX: 01010201af2a0a80 RCX: 0000000000000000
Feb 27 21:00:27 Sherry kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff8929a552be80
Feb 27 21:00:27 Sherry kernel: RBP: 0000000000000000 R08: ffff8929af2a14a0 R09: 0000000000000000
Feb 27 21:00:27 Sherry kernel: R10: 0000000000000000 R11: 000000000000038c R12: ffff8929a552be80
Feb 27 21:00:27 Sherry kernel: R13: ffff892831925dc0 R14: 0000000000000000 R15: ffff8928d95b5580
Feb 27 21:00:27 Sherry kernel: FS:  0000000000000000(0000) GS:ffff8929af280000(0000) knlGS:0000000000000000
Feb 27 21:00:27 Sherry kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 27 21:00:27 Sherry kernel: CR2: 00000000000000b0 CR3: 00000000b520a006 CR4: 00000000001606e0
Feb 27 21:00:27 Sherry kernel: Call Trace:
Feb 27 21:00:27 Sherry kernel:  ? schedule+0x28/0x80
Feb 27 21:00:27 Sherry kernel:  ? worker_thread+0xaf/0x380
Feb 27 21:00:27 Sherry kernel:  ? process_one_work+0x3b0/0x3b0
Feb 27 21:00:27 Sherry kernel:  ? kthread+0x113/0x130
Feb 27 21:00:27 Sherry kernel:  ? kthread_create_worker_on_cpu+0x70/0x70
Feb 27 21:00:27 Sherry kernel:  ? do_syscall_64+0x173/0x180
Feb 27 21:00:27 Sherry kernel:  ? ret_from_fork+0x35/0x40
Feb 27 21:00:27 Sherry kernel: Code: 98 09 00 00 76 07 c7 45 c8 04 00 00 00 49 8b 84 24 18 1b 00 00 4c 89 e6 4c 89 ef 0f b6 00 e8 11 3f 16 00 48 89 c7 e8 49 17 83 ff <48> 89 45 b8 48 8b 45 b8 48 83 b8 d8 09 00 00 00 0f 85 3d 04 00 
Feb 27 21:00:27 Sherry kernel: RIP: __schedule+0x247/0x880 RSP: ffff9b4688e27e70
System:    Host: Sherry Kernel: 4.15.4-300.fc27.x86_64 x86_64 bits: 64 Desktop: Gnome 3.26.2
           Distro: Fedora release 27 (Twenty Seven)
Machine:   Device: portable System: Dell product: Inspiron 3421 serial: N/A
           Mobo: Dell model: 0G9D08 v: A13 serial: N/A BIOS: Dell v: A13 date: 12/14/2015
Battery    BAT0: charge: 66.6 Wh 1079.1% condition: 6.2/66.6 Wh (9%)
CPU:       Dual core Intel Core i5-3317U (-MT-MCP-) cache: 3072 KB
           clock speeds: max: 2600 MHz 1: 2394 MHz 2: 2394 MHz 3: 2394 MHz 4: 2394 MHz
Graphics:  Card-1: Intel 3rd Gen Core processor Graphics Controller
           Card-2: NVIDIA GK107M [GeForce GT 640M]
           Display Server: x11 (X.org 119.6 ) drivers: modesetting,nvidia (unloaded: fbdev,vesa,nouveau)
           Resolution: 1366x768@59.97hz
           OpenGL: renderer: GeForce GT 640M LE/PCIe/SSE2 version: 4.6.0 NVIDIA 390.25
Audio:     Card Intel 7 Series/C216 Family High Definition Audio Controller driver: snd_hda_intel
           Sound: Advanced Linux Sound Architecture v: k4.15.4-300.fc27.x86_64
Network:   Card-1: Broadcom Limited BCM43142 802.11b/g/n driver: bcma-pci-bridge
           IF: N/A state: N/A mac: N/A
           Card-2: Realtek RTL810xE PCI Express Fast Ethernet controller driver: r8169
           IF: enp9s0 state: down mac: e0:db:55:8d:dc:af
           Card-3: Qualcomm Atheros AR9271 802.11n driver: ath9k_htc
           IF: wlp0s29u1u2 state: N/A mac: N/A
Drives:    HDD Total Size: 1000.2GB (39.2% used)
           ID-1: /dev/sda model: TOSHIBA_MQ01ABD1 size: 1000.2GB
Partition: ID-1: / size: 49G used: 32G (69%) fs: ext4 dev: /dev/dm-1
           ID-2: /boot size: 976M used: 202M (23%) fs: ext4 dev: /dev/sda1
           ID-3: /home size: 859G used: 327G (41%) fs: ext4 dev: /dev/dm-3
           ID-4: swap-1 size: 8.37GB used: 0.00GB (0%) fs: swap dev: /dev/dm-2
RAID:      No RAID devices: /proc/mdstat, md_mod kernel module present
Sensors:   System Temperatures: cpu: 68.0C mobo: N/A gpu: 60C
           Fan Speeds (in rpm): cpu: N/A
Info:      Processes: 321 Uptime: 39 min Memory: 3434.5/7875.0MB Client: Shell (zsh) inxi: 2.3.56

Hi tom,

Is this a regression from one driver to another, and can you find reliable steps to cause the problem? I’m not sure what to make of the kernel BUG output because there’s nothing in it that indicates the driver was involved at all.

Hi Aaron,

I really appreciate the quick reply. At the moment, I’m not sure if it is a driver regression. The most reliable way I have found to cause the problem is to simply run Google Chrome on fullscreen, load WebGL or a video to allow the window to unredirect, and leave the window idle on a static page for 10-20 minutes. This also occurs on Firefox 58 under similar conditions with forced hardware acceleration.

This unfortunately only occurs when using PRIME Synchronization and I am unclear as what the cause of this may be.
crash2.txt (531 KB)
crash3.txt (3.66 KB)

Probably this is the same bug:

https://bugs.gentoo.org/649298

All the logs are attached there to the Bugzilla.