Now been running the new drivers for around a week without any crashes. :)
So far no problems since February 25 when I have installed 460.56.
I had downgraded CUDA to 10.2.89 before, so I could downgrade the driver to 440.100. Now I will try to upgrade CUDA again to 11.2.0, and see if it will cause any trouble.
Installed 460.56-1 on 02/25 and haven’t had issues since. Installing 460.56-2 on 03/08 and seeing how that will go.
Only half a year to fix a critical bug, and years in the making for proper Wayland support, maybe Linus was wrong after all. Thanks nvidia!
I’m running the 460.56 but recompiled my kernel with CONFIG_PREEMPT=n, so far no problems, can have latest CUDA and all the fancy stuff. Thanks @generix.
I have the same with CONFIG_PREEMPT in kernel 5.4.97, and it seems to work so far. CUDA 11.2.0 and 11.2.1 compilation crashes, however 11.1.1 builds and works.
Hi kamiox,
Please confirm if you are still observing crash issue.
I tried reproducing issue on multiple setups after running VLC player and doing suspend / resume multiple times but did not hit with repro.
Precision T7600 + Genuine Intel(R) CPU @ 2.60GHz + 5.9.1-arch1-1 + Driver 460.56 + NVIDIA TITAN Xp
Alienware + AMD Ryzen Threadripper 1950X 16-Core Processor + Ubuntu 20.04.2 LTS + Driver 460.56 + RTX 3090
Can you please help with detailed repro steps.
Hi @amrits
I’m currently using Nvidia Driver 465.24.02 and Linux 5.11.14-zen1-1-zen. I did not observe any crashes recently, however, I didn’t stress the system with the same usage as before (by running VLC or Kodi).
I will try to do some tests in the upcoming days and will let you know if I experience any crashes.
@kamiox
Thanks for the update, will await for your test results.
An issue has hit many members of the Arch linux community, myself included with the latest drivers. The solution is to downgrade the kernel to 5.11.13 or older and nvidia drivers 460.67. The problem appears to happen with kernel 5.11.15 + 465 nvidia driver, myself included with a 3080 FE:
Apr 20 02:35:31 chrome kernel: BUG: kernel NULL pointer dereference, address: 0000000000000170
Apr 20 02:35:31 chrome kernel: #PF: supervisor read access in kernel mode
Apr 20 02:35:31 chrome kernel: #PF: error_code(0x0000) - not-present page
Calling nvidia-smi from the shell will hang/freeze the machine requiring a hard reboot, as will trying to start X. There’s also a fair number of people reporting unplugging their secondary monitor fixes the problem, but unplugging the primary monitor doesn’t. I have 2 DP monitors myself but downgraded before I tried.
For more info:
https://bbs.archlinux.org/viewtopic.php?id=265563
Kernel panic happens during boot.
lspci -k | grep -A 2 -E “(VGA|3D)”
01:00.0 VGA compatible controller: NVIDIA Corporation TU102 [GeForce RTX 2080 Ti Rev. A] (rev a1)
Subsystem: Micro-Star International Co., Ltd. [MSI] RTX 2080 Ti GAMING X TRIO
Kernel modules: nouveau, nvidia_drm, nvidia
Grub cmdline options:
Apr 19 23:33:21 redstar kernel: Command line: BOOT_IMAGE=/vmlinuz-linux root=UUID=355f88b0-acb5-4e41-b859-707c985eddd8 rw loglevel=3 nvidia-drm.modeset=1 ignore_loglevel
uname -a
Linux redstar 5.11.15-arch1-2 #1 SMP PREEMPT Sat, 17 Apr 2021 00:22:30 +0000 x86_64 GNU/Linux
Bug:
Apr 19 23:33:27 redstar kernel: BUG: kernel NULL pointer dereference, address: 0000000000000170
Apr 19 23:33:27 redstar kernel: #PF: supervisor read access in kernel mode
Apr 19 23:33:27 redstar kernel: #PF: error_code(0x0000) - not-present page
Apr 19 23:33:27 redstar kernel: PGD 0 P4D 0
Apr 19 23:33:27 redstar kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Apr 19 23:33:27 redstar kernel: CPU: 1 PID: 412 Comm: systemd-udevd Tainted: P OE 5.11.15-arch1-2 #1
Apr 19 23:33:27 redstar kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C79/MPG Z490 GAMING EDGE WIFI (MS-7C79), BIOS 1.60 02/01/2021
Apr 19 23:33:27 redstar kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia]
Apr 19 23:33:27 redstar kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 00 00 00 e8 cf 50 9a c2 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8 a7 50 9a c2 89 c3 48
Apr 19 23:33:27 redstar kernel: RSP: 0018:ffffb1fc013cb780 EFLAGS: 00010293
Apr 19 23:33:27 redstar kernel: RAX: 0000000000000000 RBX: 0000000000002000 RCX: 0000000000000004
Apr 19 23:33:27 redstar kernel: RDX: 0000000000000004 RSI: 0000000000000005 RDI: 0000000000000000
Apr 19 23:33:27 redstar kernel: RBP: ffff90dddc21add0 R08: 0000000000000001 R09: ffff90dddc21acb8
Apr 19 23:33:27 redstar kernel: R10: ffff90dddcb10008 R11: 0000000010100000 R12: 0000000000002400
Apr 19 23:33:27 redstar kernel: R13: 0000000000000005 R14: ffff90ddd92f4010 R15: 0000000000008000
Apr 19 23:33:27 redstar kernel: FS: 00007f76eb7aea40(0000) GS:ffff90e51da40000(0000) knlGS:0000000000000000
Apr 19 23:33:27 redstar kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 19 23:33:27 redstar kernel: CR2: 0000000000000170 CR3: 0000000110afa005 CR4: 00000000007706e0
Apr 19 23:33:27 redstar kernel: PKRU: 55555554
Apr 19 23:33:27 redstar kernel: Call Trace:
Apr 19 23:33:27 redstar kernel: ? _nv015556rm+0x7fd/0x1020 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv027154rm+0x22c/0x4f0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv017786rm+0x303/0x5e0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv017787rm+0x30/0xa0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv017788rm+0xe1/0x220 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv022828rm+0xed/0x220 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv023064rm+0x30/0x60 [nvidia]
Apr 19 23:33:27 redstar kernel: ? _nv000704rm+0x16da/0x22b0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? rm_init_adapter+0xc5/0xe0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? kthread_create_on_node+0x51/0x70
Apr 19 23:33:27 redstar kernel: ? nv_open_device+0x122/0x8a0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? nvidia_dev_get+0x63/0xb0 [nvidia]
Apr 19 23:33:27 redstar kernel: ? nvkms_open_gpu+0x4e/0x90 [nvidia_modeset]
Apr 19 23:33:27 redstar kernel: ? _nv000010kms+0x40/0x260 [nvidia_modeset]
Apr 19 23:33:27 redstar kernel: ? printk+0x68/0x7f
Apr 19 23:33:27 redstar kernel: ? security_kernfs_init_security+0x2a/0x40
Apr 19 23:33:27 redstar kernel: ? nv_drm_load+0xac/0x3ae [nvidia_drm]
Apr 19 23:33:27 redstar kernel: ? nv_drm_master_drop+0x60/0x60 [nvidia_drm]
Apr 19 23:33:27 redstar kernel: ? drm_dev_register+0xc8/0x1b0 [drm]
Apr 19 23:33:27 redstar kernel: ? nv_drm_probe_devices+0x184/0x210 [nvidia_drm]
Apr 19 23:33:27 redstar kernel: ? 0xffffffffc0baf000
Apr 19 23:33:27 redstar kernel: ? do_one_initcall+0x57/0x220
Apr 19 23:33:27 redstar kernel: ? do_init_module+0x5c/0x270
Apr 19 23:33:27 redstar kernel: ? load_module+0x243e/0x2610
Apr 19 23:33:27 redstar kernel: ? __do_sys_init_module+0x136/0x1b0
Apr 19 23:33:27 redstar kernel: ? do_syscall_64+0x33/0x40
Apr 19 23:33:27 redstar kernel: ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Apr 19 23:33:27 redstar kernel: Modules linked in: joydev mousedev uvcvideo btusb btrtl btbcm videobuf2_vmalloc btintel videobuf2_memops videobuf2_v4l2 bluetooth snd_usb_audio videobuf2_common videodev snd_usbmidi_lib snd_rawmidi snd_seq_device mc ecdh_generic ecc usbhid crc16 intel_rapl_msr nvidia_drm(POE+) intel_rapl_common nvidia_modeset(POE) snd_sof_pci snd_sof_intel_hda_common uas nvidia(POE) usb_storage snd_sof_intel_hda snd_sof_intel_byt ucsi_ccg iTCO_wdt typec_ucsi snd_sof_intel_ipc intel_pmc_bxt ee1004 iTCO_vendor_support mei_hdcp typec wmi_bmof intel_wmi_thunderbolt mxm_wmi snd_sof snd_sof_xtensa_dsp snd_soc_skl snd_hda_codec_realtek snd_hda_codec_generic snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_hdmi ledtrig_audio snd_soc_sst_ipc x86_pkg_temp_thermal snd_soc_sst_dsp intel_powerclamp iwlmvm snd_soc_acpi_intel_match coretemp snd_soc_acpi kvm_intel snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence mac80211 kvm snd_hda_codec libarc4 irqbypass
Apr 19 23:33:27 redstar kernel: snd_hda_core crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hwdep aesni_intel r8125(OE) soundwire_bus crypto_simd snd_soc_core iwlwifi cryptd r8169 snd_compress glue_helper ac97_bus rapl snd_pcm_dmaengine intel_cstate realtek intel_uncore cfg80211 drm_kms_helper snd_pcm pcspkr i2c_i801 mdio_devres snd_timer i2c_smbus mei_me cec libphy snd mei syscopyarea sysfillrect soundcore sysimgblt rfkill fb_sys_fops i2c_nvidia_gpu intel_pch_thermal video mac_hid wmi acpi_tad acpi_pad vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm sg crypto_user fuse agpgart bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq xhci_pci crc32c_intel xhci_pci_renesas
Apr 19 23:33:27 redstar kernel: CR2: 0000000000000170
Apr 19 23:33:27 redstar kernel: ---[ end trace 60456de3156bc3b3 ]---
Apr 19 23:33:27 redstar kernel: RIP: 0010:_nv015534rm+0x1b6/0x330 [nvidia]
Apr 19 23:33:27 redstar kernel: Code: 8b 87 68 05 00 00 ba 01 00 00 00 be 02 00 00 00 e8 cf 50 9a c2 41 83 c5 01 41 83 fd 1f 0f 84 0b 01 00 00 48 8b 45 10 44 89 ee <48> 8b b8 70 01 00 00 48 8b 87 d8 04 00 00 e8 a7 50 9a c2 89 c3 48
Apr 19 23:33:27 redstar kernel: RSP: 0018:ffffb1fc013cb780 EFLAGS: 00010293
Apr 19 23:33:27 redstar kernel: RAX: 0000000000000000 RBX: 0000000000002000 RCX: 0000000000000004
Apr 19 23:33:27 redstar kernel: RDX: 0000000000000004 RSI: 0000000000000005 RDI: 0000000000000000
Apr 19 23:33:27 redstar kernel: RBP: ffff90dddc21add0 R08: 0000000000000001 R09: ffff90dddc21acb8
Apr 19 23:33:27 redstar kernel: R10: ffff90dddcb10008 R11: 0000000010100000 R12: 0000000000002400
Apr 19 23:33:27 redstar kernel: R13: 0000000000000005 R14: ffff90ddd92f4010 R15: 0000000000008000
Apr 19 23:33:27 redstar kernel: FS: 00007f76eb7aea40(0000) GS:ffff90e51da40000(0000) knlGS:0000000000000000
Apr 19 23:33:27 redstar kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 19 23:33:27 redstar kernel: CR2: 0000000000000170 CR3: 0000000110afa005 CR4: 00000000007706e0
Apr 19 23:33:27 redstar kernel: PKRU: 55555554
Apr 19 23:33:27 redstar systemd-udevd[367]: Worker [412] terminated by signal 9 (KILL)
Apr 19 23:33:27 redstar systemd-udevd[367]: 0000:01:00.0: Worker [412] failed
Apr 19 23:33:29 redstar NetworkManager[474]: <info> [1618846409.2434] manager: NetworkManager state is now CONNECTED_GLOBAL
Apr 19 23:33:50 redstar dbus-daemon[473]: [system] Failed to activate service 'org.freedesktop.resolve1': timed out (service_start_timeout=25000ms)
Apr 19 23:33:55 redstar systemd-timesyncd[470]: Initial synchronization to time server 27.124.125.251:123 (2.arch.pool.ntp.org).
-- Boot 90d2768fdc6d4deebd68db5ea7028005 --
setting grub cmdline option acpi=off
in grub allows Arch to boot, but different problems happen:
Apr 19 00:26:10 redstar kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-linux root=UUID=355f88b0-acb5-4e41-b859-707c985eddd8 rw loglevel=3 nvidia-drm.modeset=1 pci=noacpi
Apr 19 00:26:10 redstar kernel: nvidia: loading out-of-tree module taints kernel.
Apr 19 00:26:10 redstar kernel: nvidia: module license 'NVIDIA' taints kernel.
Apr 19 00:26:10 redstar kernel: Disabling lock debugging due to kernel taint
Apr 19 00:26:10 redstar kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
Apr 19 00:26:10 redstar kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 239
Apr 19 00:26:10 redstar kernel:
Apr 19 00:26:10 redstar kernel: nvidia 0000:01:00.0: can't find IRQ for PCI INT A; please try using pci=biosirq
Apr 19 00:26:10 redstar kernel: NVRM: Can't find an IRQ for your NVIDIA card!
Apr 19 00:26:10 redstar kernel: NVRM: Please check your BIOS settings.
Apr 19 00:26:10 redstar kernel: NVRM: [Plug & Play OS] should be set to NO
Apr 19 00:26:10 redstar kernel: NVRM: [Assign IRQ to VGA] should be set to YES
Apr 19 00:26:10 redstar kernel: nvidia: probe of 0000:01:00.0 failed with error -1
Apr 19 00:26:10 redstar kernel: NVRM: The NVIDIA probe routine failed for 1 device(s).
Apr 19 00:26:10 redstar kernel: NVRM: None of the NVIDIA devices were initialized.
Apr 19 00:26:10 redstar kernel: nvidia-nvlink: Unregistered the Nvlink Core, major device number 239
Apr 19 00:26:10 redstar kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 239
Apr 19 00:26:10 redstar kernel: NVRM: Can't find an IRQ for your NVIDIA card!
This bug report was generated with acpi=off
in cmdline options.
If any more information is needed, let me know.
nvidia-bug-report.log.gz (76.7 KB)
@fearfactory2006 Looks like you have nouveau module loaded. It has to be blacklisted while you’re using Nvidia binary driver.
@kamiox
Did you get a chance to run tests.
@kamiox
Did you get a chance to run tests.
I repeatedly get this issue on Fedora 32 with several repository versions of kernel and several nvidia driver versions I tested, incl. the version 460.67. It does not happen very often (once in 1-2 weeks) but obviously still quite annoying. I can always ssh to the machine when the freeze happens. It always froze while in Chrome (both when actively using it and after I returned to the computer with Chrome active) albeit I spent a lot of time in Chrome, so I cannot exclude that it may not be specific to Chrome/browser. I do have a two-screen setup, not sure whether that is relevant. The dmesg log is attached, this is for the freeze using the nvidia version 460.67. The nvidia-bug-report script just hangs even with --safe-mode and provides basically nothing.
dmesg.log (8.6 KB)
Can someone distinguish whether my bug report concerns this bug or the 2-screen monitor bug (or yet another one)? A few people reported in this thread that this bug was fixed for them with 460.67 but it is definitely not the case for me.
For me, this issue started a few months ago when I upgraded the system and I am not able to get rid of it with any nvidia version from the 460 series. I know for sure that I did not have this issue with the 440 series though and my uptime used to be in months then, with no freezes etc.
@skub This is something different, you’re getting an XID 61 and the driver gets downhill. Since this was working previously, I guess the gpu is starting to fail. Please try reseating it, use gpu-burn to stress-test it.
Thanks for the suggestion! I have just tried to stress-test it pretty intensively (without reseating), no issues found. Also, the freezes did not happen under GPU load but just randomly when browsing in Chrome or when I was away but Chrome was open in the foreground (both happen a lot, so I cannot exclude it could not freeze with other usage as well but definitely not associated with high GPU load).
I will try reseating it but since normally the freeze happens randonly once in roughly 1-2 weeks, it will take time to report back.
But, from your comment, I guess my issue could be more likely this one: Random Xid 61 and Xorg lock-up - #30 by collinvandyck
I also have a Ryzen CPU (2990WX) and an ASUS board (399A) which seems to be associated with that issue. That would be good news at least wrt the fix of the bug from this thread. So I’ll also try the suggestions from that thread…
авг 12 13:40:22 kuramshin kwin_x11[4832]: kwin_core: XCB error: 152 (BadDamage), sequence: 57212, resource id: 1527>
авг 12 13:40:22 kuramshin kwin_x11[4832]: kwin_core: XCB error: 3 (BadWindow), sequence: 57220, resource id: 419451>
авг 12 13:41:59 kuramshin kernel: BUG: kernel NULL pointer dereference, address: 0000000000000008
авг 12 13:41:59 kuramshin kernel: #PF: supervisor read access in kernel mode
авг 12 13:41:59 kuramshin kernel: #PF: error_code(0x0000) - not-present page
авг 12 13:41:59 kuramshin kernel: PGD 0 P4D 0
авг 12 13:41:59 kuramshin kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
авг 12 13:41:59 kuramshin kernel: CPU: 8 PID: 1519 Comm: Xorg Tainted: G W OE 5.10.56-1-MANJARO #1
авг 12 13:41:59 kuramshin kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570 Pro4, BIOS P1.4>
авг 12 13:41:59 kuramshin kernel: RIP: 0010:gp100_vmm_pgt_mem+0xcb/0x180 [nouveau]
авг 12 13:41:59 kuramshin kernel: Code: 56 50 41 8d 51 ff 8b 44 24 0c 41 89 c5 44 8d 64 02 01 41 0f b7 57 12 49 8b >
авг 12 13:41:59 kuramshin kernel: RSP: 0018:ffffb50d8e35f7e0 EFLAGS: 00010206
авг 12 13:41:59 kuramshin kernel: RAX: 0000000000000009 RBX: 06000000008af001 RCX: 0000000000000010
авг 12 13:41:59 kuramshin kernel: RDX: 0000000000000000 RSI: 0000000000000148 RDI: ffff96509805af80
авг 12 13:41:59 kuramshin kernel: RBP: 0000000000000148 R08: ffffb50d8e35f9d8 R09: 000000000000000f
авг 12 13:41:59 kuramshin kernel: R10: 0000000000000000 R11: 0600000000000001 R12: 0000000000000018
авг 12 13:41:59 kuramshin kernel: R13: 000000000000000a R14: ffffb50d8e35f9d8 R15: ffff964cd07d0f00
авг 12 13:41:59 kuramshin kernel: FS: 00007f1641f8e940(0000) GS:ffff9650fea00000(0000) knlGS:0000000000000000
авг 12 13:41:59 kuramshin kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
авг 12 13:41:59 kuramshin kernel: CR2: 0000000000000008 CR3: 0000000133fac000 CR4: 0000000000350ee0
авг 12 13:41:59 kuramshin kernel: Call Trace:
авг 12 13:41:59 kuramshin kernel: nvkm_vmm_iter.constprop.0+0x2bf/0x860 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? gp100_vmm_pgt_sgl+0x180/0x180 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvkm_vmm_ptes_get_map+0x2c/0x90 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? gp100_vmm_pgt_sgl+0x180/0x180 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvkm_vmm_map+0x1d4/0x350 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvkm_vram_map+0x56/0x80 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvkm_uvmm_mthd+0x657/0x790 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvkm_ioctl+0xdc/0x180 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvif_object_mthd+0x104/0x130 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? nvif_object_mthd+0x117/0x130 [nouveau]
авг 12 13:41:59 kuramshin kernel: nvif_vmm_map+0x115/0x130 [nouveau]
авг 12 13:41:59 kuramshin kernel: ? unix_stream_read_generic+0x1db/0x860
авг 12 13:41:59 kuramshin kernel: nouveau_mem_map+0x8f/0x100 [nouveau]
авг 12 13:41:59 kuramshin kernel: nouveau_vma_new+0x1c7/0x1f0 [nouveau]
авг 12 13:41:59 kuramshin kernel: nouveau_gem_object_open+0xc5/0x130 [nouveau]
авг 12 13:41:59 kuramshin kernel: drm_gem_handle_create_tail+0xfa/0x1c0 [drm]
авг 12 13:41:59 kuramshin kernel: drm_gem_prime_fd_to_handle+0xfb/0x1d0 [drm]
авг 12 13:41:59 kuramshin kernel: ? drm_prime_destroy_file_private+0x20/0x20 [drm]
авг 12 13:41:59 kuramshin kernel: drm_ioctl_kernel+0xaa/0xf0 [drm]
авг 12 13:41:59 kuramshin kernel: drm_ioctl+0x220/0x3e0 [drm]
авг 12 13:41:59 kuramshin kernel: ? drm_prime_destroy_file_private+0x20/0x20 [drm]
авг 12 13:41:59 kuramshin kernel: ? __fget_files+0x6b/0xa0
авг 12 13:41:59 kuramshin kernel: nouveau_drm_ioctl+0x55/0xa0 [nouveau]
авг 12 13:41:59 kuramshin kernel: __x64_sys_ioctl+0x82/0xb0
авг 12 13:41:59 kuramshin kernel: do_syscall_64+0x33/0x40
авг 12 13:41:59 kuramshin kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
авг 12 13:41:59 kuramshin kernel: RIP: 0033:0x7f16429f159b
авг 12 13:41:59 kuramshin kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f >
авг 12 13:41:59 kuramshin kernel: RSP: 002b:00007fff9cfd7b28 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
авг 12 13:41:59 kuramshin kernel: RAX: ffffffffffffffda RBX: 00007fff9cfd7b6c RCX: 00007f16429f159b
авг 12 13:41:59 kuramshin kernel: RDX: 00007fff9cfd7b6c RSI: 00000000c00c642e RDI: 0000000000000012
авг 12 13:41:59 kuramshin kernel: RBP: 00000000c00c642e R08: 000055953a954870 R09: 00007f1642abca60
авг 12 13:41:59 kuramshin kernel: R10: 000055953a95a1c0 R11: 0000000000000246 R12: 0000000000000076
авг 12 13:41:59 kuramshin kernel: R13: 0000000000000012 R14: 00005595369eebf8 R15: 0000000000000000
авг 12 13:41:59 kuramshin kernel: Modules linked in: ccm vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) rfcomm xt_CHECKS>
авг 12 13:41:59 kuramshin kernel: kvm_amd mxm_wmi snd_soc_core video ccp ucsi_ccg rng_core i2c_algo_bit ttm snd_co>
авг 12 13:41:59 kuramshin kernel: CR2: 0000000000000008
авг 12 13:41:59 kuramshin kernel: ---[ end trace 89d953530843d185 ]---
авг 12 13:41:59 kuramshin kernel: RIP: 0010:gp100_vmm_pgt_mem+0xcb/0x180 [nouveau]
авг 12 13:41:59 kuramshin kernel: Code: 56 50 41 8d 51 ff 8b 44 24 0c 41 89 c5 44 8d 64 02 01 41 0f b7 57 12 49 8b >
авг 12 13:41:59 kuramshin kernel: RSP: 0018:ffffb50d8e35f7e0 EFLAGS: 00010206
авг 12 13:41:59 kuramshin kernel: RAX: 0000000000000009 RBX: 06000000008af001 RCX: 0000000000000010
авг 12 13:41:59 kuramshin kernel: RDX: 0000000000000000 RSI: 0000000000000148 RDI: ffff96509805af80
авг 12 13:41:59 kuramshin kernel: RBP: 0000000000000148 R08: ffffb50d8e35f9d8 R09: 000000000000000f
авг 12 13:41:59 kuramshin kernel: R10: 0000000000000000 R11: 0600000000000001 R12: 0000000000000018
авг 12 13:41:59 kuramshin kernel: R13: 000000000000000a R14: ffffb50d8e35f9d8 R15: ffff964cd07d0f00
авг 12 13:41:59 kuramshin kernel: FS: 00007f1641f8e940(0000) GS:ffff9650fea00000(0000) knlGS:0000000000000000
авг 12 13:41:59 kuramshin kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
авг 12 13:41:59 kuramshin kernel: CR2: 0000000000000008 CR3: 0000000133fac000 CR4: 0000000000350ee0
Kernel: 5.10.56-1-MANJARO x86_64 bits: 64 compiler: gcc v: 11.1.0
Desktop: KDE Plasma 5.22.4 Distro: Manjaro Linux base: Arch Linux
MB: ASRock model: X570 Pro4
UEFI-[Legacy]: American Megatrends v: P1.40 date: 08/12/2019
CPU: AMD Ryzen 9 3950X
Graphics: Device-1: NVIDIA TU102 [GeForce RTX 2080 Ti] driver: nouveau v: kernel bus-ID: 08:00.0
Device-2: Microdia Camera type: USB driver: snd-usb-audio,uvcvideo bus-ID: 7-3:3
Display: x11 server: X.Org 1.20.13 driver: loaded: modesetting,nouveau resolution: 1920x1080~60Hz
OpenGL: renderer: NV162 v: 4.3 Mesa 21.1.6 direct render: Yes
As a workaround for me, I blacked listed nouveau module,
GRUB_CMDLINE_LINUX_DEFAULT=“loglevel=3 quiet modprobe.blacklist=nouveau”
The error happens when trying to poweroff the laptop (it never poweroff), even in the asus site they have this workaround ( ASUS NoteBook Linux )
----------------DMESG------------------
2.800916] input: HD-Audio Generic HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:08.1/0000:04:00.1/sound/card1/input15
[ 2.800989] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.1/0000:01:00.1/sound/card0/input11
[ 2.801030] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.1/0000:01:00.1/sound/card0/input12
[ 2.801067] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.1/0000:01:00.1/sound/card0/input13
[ 2.801091] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:01.1/0000:01:00.1/sound/card0/input14
[ 2.801802] pci 0000:01:00.0: optimus capabilities: enabled, status dynamic power, hda bios codec supported
[ 2.801811] VGA switcheroo: detected Optimus DSM method _SB_.PCI0.GPP0.PEGP handle
[ 2.801812] nouveau: detected PR support, will not use DSM
[ 2.801854] nouveau 0000:01:00.0: enabling device (0000 → 0003)
[ 2.802149] Console: switching to colour dummy device 80x25
[ 2.802206] nouveau 0000:01:00.0: NVIDIA GA106 (b76000a1)
[ 2.817286] usb 3-1: New USB device found, idVendor=2109, idProduct=d141, bcdDevice=80.51
[ 2.817289] usb 3-1: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[ 2.817291] usb 3-1: Product: USB2.0 Hub
[ 2.817292] usb 3-1: Manufacturer: VIA Labs, Inc.
[ 2.831418] usb 1-1: New USB device found, idVendor=046d, idProduct=c52b, bcdDevice=24.11
[ 2.831421] usb 1-1: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[ 2.831422] usb 1-1: Product: USB Receiver
[ 2.831423] usb 1-1: Manufacturer: Logitech
[ 2.856615] hub 3-1:1.0: USB hub found
[ 2.857548] hub 3-1:1.0: 4 ports detected
[ 2.916686] nouveau 0000:01:00.0: bios: version 94.06.15.00.7a
[ 2.918859] nouveau 0000:01:00.0: fb: 6144 MiB GDDR6
[ 2.929671] nouveau 0000:01:00.0: DRM: VRAM: 6144 MiB
[ 2.929672] nouveau 0000:01:00.0: DRM: GART: 536870912 MiB
[ 2.929673] nouveau 0000:01:00.0: DRM: BIT table ‘A’ not found
[ 2.929674] nouveau 0000:01:00.0: DRM: BIT table ‘L’ not found
[ 2.929675] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
[ 2.929676] nouveau 0000:01:00.0: DRM: DCB version 4.1
[ 2.929676] nouveau 0000:01:00.0: DRM: DCB outp 00: 04000f76 04600010
[ 2.929678] nouveau 0000:01:00.0: DRM: DCB outp 01: 04000f72 00020010
[ 2.929679] nouveau 0000:01:00.0: DRM: DCB conn 00: 01000046
[ 2.930036] nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
[ 2.931485] snd_hda_intel 0000:01:00.1: bound 0000:01:00.0 (ops nv50_audio_component_bind_ops [nouveau])
[ 2.932095] BUG: kernel NULL pointer dereference, address: 0000000000000020
[ 2.932098] #PF: supervisor read access in kernel mode
[ 2.932099] #PF: error_code(0x0000) - not-present page
[ 2.932100] PGD 0 P4D 0
[ 2.932102] Oops: 0000 [#1] PREEMPT SMP NOPTI
[ 2.932104] CPU: 9 PID: 377 Comm: (udev-worker) Not tainted 6.1.38-2-lts #1 68b6d871287d3448dd49d65f1f674cec627eeb71
[ 2.932107] Hardware name: ASUSTeK COMPUTER INC. ROG Zephyrus G14 GA401QM_GA401QM/GA401QM, BIOS GA401QM.412 08/30/2022
[ 2.932109] RIP: 0010:nvif_object_mthd+0xbc/0x200 [nouveau]
[ 2.932168] Code: af e1 41 8d 56 20 49 8b 44 24 08 83 fa 17 0f 86 33 01 00 00 4c 39 e0 0f 84 e8 00 00 00 4c 89 63 10 31 c9 48 89 de c6 43 06 ff <48> 8b 78 20 48 8b 40 38 48 8b 40 28 e8 e3 c7 ec e1 48 8b 3c 24 4c
[ 2.932170] RSP: 0018:ffffbdcd41597628 EFLAGS: 00010246
[ 2.932172] RAX: 0000000000000000 RBX: ffffbdcd41597630 RCX: 0000000000000000
[ 2.932173] RDX: 0000000000000028 RSI: ffffbdcd41597630 RDI: ffffbdcd41597658
[ 2.932174] RBP: ffff9d3f8baf9000 R08: ffffbdcd41597880 R09: ffff9d3f8554f3e8
[ 2.932176] R10: ffff9d3f83fbad00 R11: ffffbdcd43865fff R12: ffff9d3f8f754508
[ 2.932177] R13: ffffbdcd41597630 R14: 0000000000000008 R15: ffffbdcd41597650
[ 2.932178] FS: 00007ff205d1a200(0000) GS:ffff9d4456840000(0000) knlGS:0000000000000000
[ 2.932180] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2.932181] CR2: 0000000000000020 CR3: 000000010b08e000 CR4: 0000000000750ee0
[ 2.932183] PKRU: 55555554
[ 2.932184] Call Trace:
[ 2.932185]
JOURNAL
Jul 19 18:34:58 ayasumi kernel: ACPI Warning: _SB.PCI0.GPP0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20220331/nsarguments-61)
Jul 19 18:34:58 ayasumi systemd-udevd[357]: 0000:01:00.0: Worker [366] terminated by signal 9 (KILL).
Jul 19 18:34:58 ayasumi kernel: BUG: kernel NULL pointer dereference, address: 0000000000000020
Jul 19 18:34:58 ayasumi kernel: #PF: supervisor read access in kernel mode
Jul 19 18:34:58 ayasumi kernel: #PF: error_code(0x0000) - not-present page
Jul 19 18:34:58 ayasumi kernel: PGD 0 P4D 0
Jul 19 18:34:58 ayasumi kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Jul 19 18:34:58 ayasumi kernel: CPU: 4 PID: 366 Comm: (udev-worker) Not tainted 6.1.38-2-lts #1 68b6d871287d3448dd49d65f1f674cec627eeb71
Jul 19 18:34:58 ayasumi kernel: Hardware name: ASUSTeK COMPUTER INC. ROG Zephyrus G14 GA401QM_GA401QM/GA401QM, BIOS GA401QM.412 08/30/2022
Jul 19 18:34:58 ayasumi kernel: RIP: 0010:nvif_object_mthd+0xbc/0x200 [nouveau]
Jul 19 18:34:58 ayasumi kernel: Code: 1e c8 41 8d 56 20 49 8b 44 24 08 83 fa 17 0f 86 33 01 00 00 4c 39 e0 0f 84 e8 00 00 00 4c 89 63 10 31 c9 48 89 de c6 43 06 ff <48> 8b 78 20 48 8b 40 38 48 8b 40 28 e8 e3 87 5b c8 48 8b 3c 24 4c
Jul 19 18:34:58 ayasumi kernel: RSP: 0018:ffffae998380f6b8 EFLAGS: 00010246
Jul 19 18:34:58 ayasumi kernel: RAX: 0000000000000000 RBX: ffffae998380f6c0 RCX: 0000000000000000
Jul 19 18:34:58 ayasumi kernel: RDX: 0000000000000028 RSI: ffffae998380f6c0 RDI: ffffae998380f6e8
Jul 19 18:34:58 ayasumi kernel: RBP: ffff9aa30b96b800 R08: ffffae998380f908 R09: ffff9aa305225ca8
Jul 19 18:34:58 ayasumi kernel: R10: ffff9aa305886340 R11: ffffae9980902fff R12: ffff9aa31a338508
Jul 19 18:34:58 ayasumi kernel: R13: ffffae998380f6c0 R14: 0000000000000008 R15: ffffae998380f6e0
Jul 19 18:34:58 ayasumi kernel: FS: 00007f83face8200(0000) GS:ffff9aa7d6700000(0000) knlGS:0000000000000000
Jul 19 18:34:58 ayasumi kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 19 18:34:58 ayasumi kernel: CR2: 0000000000000020 CR3: 000000010ae84000 CR4: 0000000000750ee0
Jul 19 18:34:58 ayasumi kernel: PKRU: 55555554
Jul 19 18:34:58 ayasumi kernel: Call Trace:
Jul 19 18:34:58 ayasumi kernel:
Jul 19 18:34:58 ayasumi kernel: ? __die_body.cold+0x1a/0x1f
Jul 19 18:34:58 ayasumi kernel: ? page_fault_oops+0x15a/0x2d0
Jul 19 18:34:58 ayasumi kernel: ? nvkm_timer_wait_test+0x21/0x80 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: ? exc_page_fault+0x7c/0x180
Jul 19 18:34:58 ayasumi kernel: ? asm_exc_page_fault+0x26/0x30
Jul 19 18:34:58 ayasumi kernel: ? nvif_object_mthd+0xbc/0x200 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: ? nvif_object_mthd+0x142/0x200 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: nvif_conn_hpd_status+0x39/0xf0 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: nouveau_dp_detect+0x86/0x410 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: nouveau_connector_detect+0xa4/0x560 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: drm_helper_probe_detect+0x88/0xb0
Jul 19 18:34:58 ayasumi kernel: drm_helper_probe_single_connector_modes+0x353/0x530
Jul 19 18:34:58 ayasumi kernel: ? __kmem_cache_alloc_node+0x1a5/0x2d0
Jul 19 18:34:58 ayasumi kernel: drm_client_modeset_probe+0x247/0x14e0
Jul 19 18:34:58 ayasumi kernel: ? nouveau_cli_init+0x377/0x430 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: ? __pm_runtime_suspend+0x6e/0x100
Jul 19 18:34:58 ayasumi kernel: __drm_fb_helper_initial_config_and_unlock+0x44/0x4e0
Jul 19 18:34:58 ayasumi kernel: ? drm_client_init+0x11a/0x180
Jul 19 18:34:58 ayasumi kernel: nouveau_fbcon_init+0x14e/0x1c0 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: nouveau_drm_device_init+0x1fc/0x790 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: ? pci_update_current_state+0x72/0xb0
Jul 19 18:34:58 ayasumi kernel: nouveau_drm_probe+0x12c/0x1f0 [nouveau f460d3c4199058cc72ce99550335a19c68719c5e]
Jul 19 18:34:58 ayasumi kernel: local_pci_probe+0x45/0x80
Jul 19 18:34:58 ayasumi kernel: pci_device_probe+0xc1/0x250
Jul 19 18:34:58 ayasumi kernel: ? sysfs_do_create_link_sd+0x6e/0xe0
Jul 19 18:34:58 ayasumi kernel: really_probe+0xde/0x380
Jul 19 18:34:58 ayasumi kernel: ? pm_runtime_barrier+0x54/0x90
Jul 19 18:34:58 ayasumi kernel: __driver_probe_device+0x78/0x120
Jul 19 18:34:58 ayasumi kernel: driver_probe_device+0x1f/0x90
Jul 19 18:34:58 ayasumi kernel: __driver_attach+0xd2/0x1c0
Jul 19 18:34:58 ayasumi kernel: ? __device_attach_driver+0x110/0x110
Jul 19 18:34:58 ayasumi kernel: bus_for_each_dev+0x8b/0xd0
Jul 19 18:34:58 ayasumi kernel: bus_add_driver+0x1b2/0x200
Jul 19 18:34:58 ayasumi kernel: driver_register+0x8d/0xe0
Jul 19 18:34:58 ayasumi kernel: ? 0xffffffffc2531000
Jul 19 18:34:58 ayasumi kernel: do_one_initcall+0x5d/0x230
Jul 19 18:34:58 ayasumi kernel: do_init_module+0x4a/0x1e0
Jul 19 18:34:58 ayasumi kernel: __do_sys_init_module+0x17f/0x1b0
Jul 19 18:34:58 ayasumi kernel: do_syscall_64+0x60/0x90
Jul 19 18:34:58 ayasumi kernel: ? exc_page_fault+0x7c/0x180
Jul 19 18:34:58 ayasumi kernel: entry_SYSCALL_64_after_hwframe+0x63/0xcd
Jul 19 18:34:58 ayasumi kernel: RIP: 0033:0x7f83fb721f9e
Jul 19 18:34:58 ayasumi kernel: Code: 48 8b 0d bd ed 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 8a ed 0c 00 f7 d8 64 89 01 48
Jul 19 18:34:58 ayasumi kernel: RSP: 002b:00007ffd10966ed8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Jul 19 18:34:58 ayasumi kernel: RAX: ffffffffffffffda RBX: 000055628eec90e0 RCX: 00007f83fb721f9e
Jul 19 18:34:58 ayasumi kernel: RDX: 00007f83fbc81343 RSI: 00000000005591d8 RDI: 00007f83f8d1e010
Jul 19 18:34:58 ayasumi kernel: RBP: 00007f83fbc81343 R08: 000000000057a000 R09: 0000000000000000
Jul 19 18:34:58 ayasumi kernel: R10: 000000000003e1c1 R11: 0000000000000246 R12: 0000000000020000
Jul 19 18:34:58 ayasumi kernel: R13: 000055628eed5960 R14: 000055628eec90e0 R15: 000055628ee8fa60
Jul 19 18:34:58 ayasumi kernel:
Jul 19 18:34:58 ayasumi kernel: Modules linked in: amdgpu(+) mac80211(+) snd_acp_pci snd_intel_dspcfg kvm_amd(+) snd_pci_acp6x libarc4 asus_nb_wmi snd_intel_sdw_acpi nouveau(+) kvm snd_hda_coec snd_pci_acp5x asus_wmi gpu_sched irqbypass cfg80211 mxm_wmi snd_hda_core drm_buddy snd>
Jul 19 18:34:58 ayasumi kernel: CR2: 0000000000000020
Jul 19 18:34:58 ayasumi kernel: —[ end trace 0000000000000000 ]—
Jul 19 18:34:58 ayasumi kernel: RIP: 0010:nvif_object_mthd+0xbc/0x200 [nouveau]
Jul 19 18:34:58 ayasumi kernel: Code: 1e c8 41 8d 56 20 49 8b 44 24 08 83 fa 17 0f 86 33 01 00 00 4c 39 e0 0f 84 e8 00 00 00 4c 89 63 10 31 c9 48 89 de c6 43 06 ff <48> 8b 78 20 48 8b 40 38 48 8b 40 28 e8 e3 87 5b c8 48 8b 3c 24 4c
Jul 19 18:34:58 ayasumi kernel: RSP: 0018:ffffae998380f6b8 EFLAGS: 00010246
Jul 19 18:34:58 ayasumi kernel: RAX: 0000000000000000 RBX: ffffae998380f6c0 RCX: 0000000000000000
Jul 19 18:34:58 ayasumi kernel: RDX: 0000000000000028 RSI: ffffae998380f6c0 RDI: ffffae998380f6e8
Jul 19 18:34:58 ayasumi kernel: RBP: ffff9aa30b96b800 R08: ffffae998380f908 R09: ffff9aa305225ca8
Jul 19 18:34:58 ayasumi kernel: R10: ffff9aa305886340 R11: ffffae9980902fff R12: ffff9aa31a338508
Jul 19 18:34:58 ayasumi kernel: R13: ffffae998380f6c0 R14: 0000000000000008 R15: ffffae998380f6e0
Jul 19 18:34:58 ayasumi kernel: FS: 00007f83face8200(0000) GS:ffff9aa7d6700000(0000) knlGS:0000000000000000
Jul 19 18:34:58 ayasumi kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 19 18:34:58 ayasumi kernel: CR2: 0000000000000020 CR3: 000000010ae84000 CR4: 0000000000750ee0
Jul 19 18:34:58 ayasumi kernel: PKRU: 55555554
Jul 19 18:34:58 ayasumi kernel: note: (udev-worker)[366] exited with irqs disabled