Bug report: 455.23.04 - Kernel Panic due to NULL pointer dereference

Crash still happening with the patched drivers at 455.23.04: Page allocation failure in kernel module at random points
Can confirm it happening at least 2 more times with the patched drivers. I just hard crashed with these drivers and could not ssh or go into a TTY.

I have recently fully updated to nvidia-dkms 455.45.01-1 on Arch Linux 5.9.12-zen1-1-zen in good faith as of 2020/12/08 it’s been 75 days since the first official report on 2020/08/24 @ 455.23.04: Page allocation failure in kernel module at random points.

This issue is definitely still not fixed and still affecting people. The freezes today are not fully hard and I could go out to a TTY. In fact while typing this message I’ve had it freeze another 2 times both recoverable from a TTY.

I have included 3 bug reports:
0: right after my display froze and I went to TTY2 to run “sudo nvidia-bug-report.sh”
nvidia-bug-report0.log.gz (340.8 KB)

1: while typing up v1 (good thing the website saves posts) of this post I had the screen freeze again. I went to TTY2 to capture this. Then “systemctl restart sddm”
nvidia-bug-report1.log.gz (502.4 KB)

2: Freeze again while typing this current post, I didn’t have to restart sddm but it just worked right after switching back to TTY7 from TTY2.
nvidia-bug-report2.log.gz (555.0 KB)

For some reason once it does happen it likes to keep happening. I’ve just experience another 2 soft freezes on top of the 3 previous reports (recover from TTY2 with no sddm restart) while finishing up this post’s formatting.

nvidia-bug-report3.log.gz (608.5 KB)

Maybe do something with the record $3B gross this quarter (enough to hire 15000*4 devs at 200k per year) and actually fix some stuff?

And to top off my post I’ve had 2 other soft freezes while typing out my napkin maths.
nvidia-bug-report4.log.gz (660.9 KB)

fix. your. drivers.

455.45.01 using VDPAU triggered by watching DVB TV through TV Tuner on VLC with VDPAU enabled. Without VDPAU, e.g. using OpenGL video output in VLC, no freeze is observed. Many Bothans died of boredom while being forced to watch US daytime TV to bring you this information.

This bug report was gathered after a freeze but before a reboot, but even running with the additional args sudo nvidia-bug-report.sh --safe-mode --extra-system-data, a few lines of the bug report script had to be commented out for the report to complete.

nvidia-bug-report.log.gz (90.9 KB)

The GTX 760 at PCI 28:00.0 is being passed through via VFIO, so isn’t being handled by the nvidia driver.

@yuannan I peeked at your nvidia-bug-report0.log.gz and it is not a NULL pointer dereference bug, it’s a page allocation failure bug. Which leads me to suspect maybe you didn’t apply the patch when you compiled the kernel module. Did DKMS display a message that it is applying the patch? Or better yet, look at nvkms_alloc disassembly to verify that the allocation size is compared against 4096 (which is what the patch changed):

  1. Find the compiled nvidia-modeset kernel module. On my system it is here: /lib/modules/<kernel-version>/updates/dkms/nvidia-modeset.ko. Note that the kernel version must match the kernel you are running.
  2. Disassemble it with objdump -S nvidia-modeset.ko >nvidia-modeset.S
  3. In nvidia-modeset.S, search for nvkms_alloc function.
  4. In its initial instructions, there will be a cmp $0x1000,%rdi. Here, 0x1000 is 4096, so the patch is applied. The %rdi register may be different, if the compiler generated the code differently in your case. If it says 0x20000 then the patch is not applied.

I am now having this bug with 455.45.01 with the patch posted on the other thread applied and confirmed as Lastique has outlined above in the kernel module. It is triggered by watching kodi with vdpau. It also happened with the previous 450.80.02 driver.

I have attached my bug report log but as others have said it simply would not complete without commenting out certain lines of the script. Also I have most debug and coredump functionality disabled in kernel so may not be much help, I don’t know.

What I do find interesting is that I was using 450.80.02 with kernel 5.9.0 for a couple of months without having the bug once. Then 1 day after installing a slew of updates to my system, excluding kernel and nvidia which I had left unchanged at that point, I first hit this bug, leading me to think it wasn’t directly related to either of those but to some other package I had updated, but I could not see any likely candidates, but then I am no expert and this all could have just been coincidental.

Anyway now with 450.80.02 and 455.45.01 with and without the patch and any 5.9.x kernel this bug is recurring for me. I will have to restore a backup and/or go back to the LTS kernel and do some more testing. If there is any other way to help diagnose and get this resolved I would be happy to hear any suggestion.

nvidia-bug-report.log.gz (68.8 KB)

The nvidia-bug-report.sh hangs with any possible options, but generates some logs anyway.
nvidia-bug-report.log.gz (50.7 KB) nvidia-bug-report2.log.gz (1.2 KB)

Just had my system crash because of what I believe to be this issue. Also had a similar crash yesterday. Both times I had my browser (brave, a chromium derivative) open and I was unable to switch into a TTY. Had to reboot the system by raising the elephant (SysRQ). My card is a Geforce GTX 1060 3GB from Gigabyte.

This is the output of uname -a:

Linux jonasdesktop 5.9.14-arch1-1 #1 SMP PREEMPT Sat, 12 Dec 2020 14:37:12 +0000 x86_64 GNU/Linux

Here’s the error message from the kernel:

Dez 20 11:31:10 jonasdesktop kernel: BUG: kernel NULL pointer dereference, address: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: #PF: supervisor read access in kernel mode
Dez 20 11:31:10 jonasdesktop kernel: #PF: error_code(0x0000) - not-present page
Dez 20 11:31:10 jonasdesktop kernel: PGD 800000064ff43067 P4D 800000064ff43067 PUD 0 
Dez 20 11:31:10 jonasdesktop kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Dez 20 11:31:10 jonasdesktop kernel: CPU: 0 PID: 571 Comm: irq/127-nvidia Tainted: P           OE     5.9.14-arch1-1 #1
Dez 20 11:31:10 jonasdesktop kernel: Hardware name: MSI MS-7982/B150M PRO-VDH (MS-7982), BIOS 3.H0 07/10/2018
Dez 20 11:31:10 jonasdesktop kernel: RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel: Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dez 20 11:31:10 jonasdesktop kernel: RSP: 0000:ffffa52f40b07be0 EFLAGS: 00010202
Dez 20 11:31:10 jonasdesktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dez 20 11:31:10 jonasdesktop kernel: RDX: ffff9220953e2808 RSI: ffffffffffffffff RDI: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: RBP: ffff9220a927d940 R08: ffffffffc276d530 R09: ffff9220a927d920
Dez 20 11:31:10 jonasdesktop kernel: R10: ffffffffc13b8820 R11: ffff9220d01ab808 R12: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: R13: 0000000000000000 R14: ffff9220a927daa8 R15: ffff9220a927dbb0
Dez 20 11:31:10 jonasdesktop kernel: FS:  0000000000000000(0000) GS:ffff9220d5c00000(0000) knlGS:0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000020 CR3: 000000064876c005 CR4: 00000000003706f0
Dez 20 11:31:10 jonasdesktop kernel: Call Trace:
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv029950rm+0x1b/0x90 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv025474rm+0x18/0x60 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv011691rm+0x13d/0x1c0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv000083rm+0x12f/0x1a0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv011619rm+0xff/0x180 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018449rm+0x1af/0x210 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018389rm+0xd9a/0xe90 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018390rm+0xde/0x260 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018356rm+0x72/0xc0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018370rm+0x235/0x2d0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv026076rm+0x10/0x10 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv018403rm+0xac/0xe0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv027734rm+0x820/0xdc0 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv007566rm+0x155/0x270 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv027742rm+0x8d/0x180 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? _nv000712rm+0xa9/0x200 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? disable_irq_nosync+0x10/0x10
Dez 20 11:31:10 jonasdesktop kernel:  ? rm_isr_bh+0x1c/0x60 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? nvidia_isr_kthread_bh+0x1b/0x40 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread_fn+0x20/0x60
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread+0xf5/0x1a0
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_finalize_oneshot.part.0+0xe0/0xe0
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread_check_affinity+0xd0/0xd0
Dez 20 11:31:10 jonasdesktop kernel:  ? kthread+0x142/0x160
Dez 20 11:31:10 jonasdesktop kernel:  ? __kthread_bind_mask+0x60/0x60
Dez 20 11:31:10 jonasdesktop kernel:  ? ret_from_fork+0x22/0x30
Dez 20 11:31:10 jonasdesktop kernel:  ? rm_isr_bh+0x1c/0x60 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? nvidia_isr_kthread_bh+0x1b/0x40 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread_fn+0x20/0x60
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread+0xf5/0x1a0
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_finalize_oneshot.part.0+0xe0/0xe0
Dez 20 11:31:10 jonasdesktop kernel:  ? irq_thread_check_affinity+0xd0/0xd0
Dez 20 11:31:10 jonasdesktop kernel:  ? kthread+0x142/0x160
Dez 20 11:31:10 jonasdesktop kernel:  ? __kthread_bind_mask+0x60/0x60
Dez 20 11:31:10 jonasdesktop kernel:  ? ret_from_fork+0x22/0x30
Dez 20 11:31:10 jonasdesktop kernel: Modules linked in: rfcomm veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c br_netfilter bridge stp>
Dez 20 11:31:10 jonasdesktop kernel:  mdio_devres glue_helper rapl snd_hda_core ecdh_generic intel_cstate of_mdio fixed_phy intel_uncore snd_hwdep rfkill pcspkr drm_kms_helper ecc i2c_i801 libphy i2c_smbus snd_pcm cec tpm_crb intel_lpss_pci snd_timer rc_core snd sysco>
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: ---[ end trace 27edec6ea959a89f ]---
Dez 20 11:31:10 jonasdesktop kernel: RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel: Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dez 20 11:31:10 jonasdesktop kernel: RSP: 0000:ffffa52f40b07be0 EFLAGS: 00010202
Dez 20 11:31:10 jonasdesktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dez 20 11:31:10 jonasdesktop kernel: RDX: ffff9220953e2808 RSI: ffffffffffffffff RDI: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: RBP: ffff9220a927d940 R08: ffffffffc276d530 R09: ffff9220a927d920
Dez 20 11:31:10 jonasdesktop kernel: R10: ffffffffc13b8820 R11: ffff9220d01ab808 R12: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: R13: 0000000000000000 R14: ffff9220a927daa8 R15: ffff9220a927dbb0
Dez 20 11:31:10 jonasdesktop kernel: FS:  0000000000000000(0000) GS:ffff9220d5c00000(0000) knlGS:0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000020 CR3: 000000064876c005 CR4: 00000000003706f0
Dez 20 11:31:10 jonasdesktop kernel: BUG: kernel NULL pointer dereference, address: 0000000000000930
Dez 20 11:31:10 jonasdesktop kernel: #PF: supervisor write access in kernel mode
Dez 20 11:31:10 jonasdesktop kernel: #PF: error_code(0x0002) - not-present page
Dez 20 11:31:10 jonasdesktop kernel: PGD 800000064ff43067 P4D 800000064ff43067 PUD 0
Dez 20 11:31:10 jonasdesktop kernel: Oops: 0002 [#2] PREEMPT SMP PTI
Dez 20 11:31:10 jonasdesktop kernel: CPU: 0 PID: 571 Comm: irq/127-nvidia Tainted: P      D    OE     5.9.14-arch1-1 #1
Dez 20 11:31:10 jonasdesktop kernel: Hardware name: MSI MS-7982/B150M PRO-VDH (MS-7982), BIOS 3.H0 07/10/2018
Dez 20 11:31:10 jonasdesktop kernel: RIP: 0010:mutex_lock+0x10/0x20
Dez 20 11:31:10 jonasdesktop kernel: Code: 03 31 c0 c3 eb d4 0f 1f 40 00 0f 1f 44 00 00 be 02 00 00 00 e9 61 fa ff ff 90 0f 1f 44 00 00 31 c0 65 48 8b 14 25 c0 7b 01 00 <f0> 48 0f b1 17 75 01 c3 eb d6 66 0f 1f 44 00 00 0f 1f 44 00 00 41
Dez 20 11:31:10 jonasdesktop kernel: RSP: 0000:ffffa52f40b07e30 EFLAGS: 00010246
Dez 20 11:31:10 jonasdesktop kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: RDX: ffff9220cd2b0000 RSI: 0000000000000000 RDI: 0000000000000930
Dez 20 11:31:10 jonasdesktop kernel: RBP: 0000000000000930 R08: 000000000000000f R09: 0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: R10: ffff9220a9c5d800 R11: ffffa52f40b07801 R12: ffff9220cd2b07cc
Dez 20 11:31:10 jonasdesktop kernel: R13: 0000000000000000 R14: 0000000000000001 R15: ffff9220cd2b0000
Dez 20 11:31:10 jonasdesktop kernel: FS:  0000000000000000(0000) GS:ffff9220d5c00000(0000) knlGS:0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000930 CR3: 000000064876c005 CR4: 00000000003706f0
Dez 20 11:31:10 jonasdesktop kernel: Call Trace:
Dez 20 11:31:10 jonasdesktop kernel:  perf_event_exit_task+0x30/0x440
Dez 20 11:31:10 jonasdesktop kernel:  ? put_cpu_partial+0x92/0x140
Dez 20 11:31:10 jonasdesktop kernel:  ? kfree+0x40f/0x440
Dez 20 11:31:10 jonasdesktop kernel:  do_exit+0x37f/0xaa0
Dez 20 11:31:10 jonasdesktop kernel:  ? task_work_run+0x5c/0x90
Dez 20 11:31:10 jonasdesktop kernel:  ? do_exit+0x36f/0xaa0
Dez 20 11:31:10 jonasdesktop kernel:  ? kthread+0x142/0x160
Dez 20 11:31:10 jonasdesktop kernel:  ? rewind_stack_do_exit+0x17/0x17
Dez 20 11:31:10 jonasdesktop kernel: Modules linked in: rfcomm veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c br_netfilter bridge stp>
Dez 20 11:31:10 jonasdesktop kernel:  mdio_devres glue_helper rapl snd_hda_core ecdh_generic intel_cstate of_mdio fixed_phy intel_uncore snd_hwdep rfkill pcspkr drm_kms_helper ecc i2c_i801 libphy i2c_smbus snd_pcm cec tpm_crb intel_lpss_pci snd_timer rc_core snd sysco>
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000930
Dez 20 11:31:10 jonasdesktop kernel: ---[ end trace 27edec6ea959a8a0 ]---
Dez 20 11:31:10 jonasdesktop kernel: RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
Dez 20 11:31:10 jonasdesktop kernel: Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dez 20 11:31:10 jonasdesktop kernel: RSP: 0000:ffffa52f40b07be0 EFLAGS: 00010202
Dez 20 11:31:10 jonasdesktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dez 20 11:31:10 jonasdesktop kernel: RDX: ffff9220953e2808 RSI: ffffffffffffffff RDI: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: RBP: ffff9220a927d940 R08: ffffffffc276d530 R09: ffff9220a927d920
Dez 20 11:31:10 jonasdesktop kernel: R10: ffffffffc13b8820 R11: ffff9220d01ab808 R12: 0000000000000020
Dez 20 11:31:10 jonasdesktop kernel: R13: 0000000000000000 R14: ffff9220a927daa8 R15: ffff9220a927dbb0
Dez 20 11:31:10 jonasdesktop kernel: FS:  0000000000000000(0000) GS:ffff9220d5c00000(0000) knlGS:0000000000000000
Dez 20 11:31:10 jonasdesktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dez 20 11:31:10 jonasdesktop kernel: CR2: 0000000000000930 CR3: 000000064876c005 CR4: 00000000003706f0
Dez 20 11:31:10 jonasdesktop kernel: Fixing recursive fault but reboot is needed!

I am also attaching the bug report log, but the error message is not included since I created the file after the reboot and the script seems to only include the events since last boot.
nvidia-bug-report.log.gz (297.2 KB)

EDIT: Code block contained wrong error message. (Just noticed that I have many occurances of the NVIDIA driver causing null pointer dereference errors in my log - the oldest one is from October 18th)

I have the same problem. My kernel version is 5.9.11-3-MANJARO. Driver and card version:

 mhwd -l -d
--------------------------------------------------------------------------------
> PCI Device: /devices/pci0000:00/0000:00:01.0/0000:01:00.0 (0300:10de:1401)
  Display controller nVidia Corporation GM206 [GeForce GTX 960]
--------------------------------------------------------------------------------
  > INSTALLED:

   NAME:	video-nvidia-455xx
   ATTACHED:	PCI
   VERSION:	2020.10.04
   INFO:	Closed source NVIDIA drivers for linux.
   PRIORITY:	10
   FREEDRIVER:	false
   DEPENDS:	-
   CONFLICTS:	video*nvidia-* 
   CLASSIDS:	0300 0302 
   VENDORIDS:	10de 

This happened twice. Every time I had a video conference in Chrome using Google Meets and I change the tab in the meantime.
My journal log: journal.log (10.8 KB)

I’m on Nvidia 460.27.04, kernel version 5.10.1.
Here’s the error. This time there’s a lot more.

Dec 21 12:18:56 randys-arch-desktop kernel: BUG: kernel NULL pointer dereference, address: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: #PF: supervisor read access in kernel mode
Dec 21 12:18:56 randys-arch-desktop kernel: #PF: error_code(0x0000) - not-present page
Dec 21 12:18:56 randys-arch-desktop kernel: PGD 800000019b66e067 P4D 800000019b66e067 PUD 0 
Dec 21 12:18:56 randys-arch-desktop kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Dec 21 12:18:56 randys-arch-desktop kernel: CPU: 6 PID: 209 Comm: irq/137-nvidia Tainted: P     U     OE     5.10.1-103-tkg-pds #1
Dec 21 12:18:56 randys-arch-desktop kernel: Hardware name: System manufacturer System Product Name/PRIME Z270-A, BIOS 1302 03/15/2018
Dec 21 12:18:56 randys-arch-desktop kernel: RIP: 0010:_nv028347rm+0x9/0x90 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel: Code: 8e ff e8 8a af 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dec 21 12:18:56 randys-arch-desktop kernel: RSP: 0018:ffffa3ae406fbc20 EFLAGS: 00010202
Dec 21 12:18:56 randys-arch-desktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dec 21 12:18:56 randys-arch-desktop kernel: RDX: ffff9c4af0657608 RSI: ffffffffffffffff RDI: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: RBP: ffff9c46896359f0 R08: ffffffffc1b53860 R09: ffff9c46896359d0
Dec 21 12:18:56 randys-arch-desktop kernel: R10: ffff9c468b37c008 R11: ffff9c468b37d098 R12: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: R13: 0000000000000000 R14: ffff9c4689635b58 R15: ffff9c4689635c98
Dec 21 12:18:56 randys-arch-desktop kernel: FS:  0000000000000000(0000) GS:ffff9c4dded80000(0000) knlGS:0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 0000000000000020 CR3: 000000016cc1a005 CR4: 00000000003706e0
Dec 21 12:18:56 randys-arch-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Dec 21 12:18:56 randys-arch-desktop kernel: Call Trace:
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv030607rm+0x1b/0x90 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv026284rm+0x18/0x60 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv012950rm+0x13d/0x1c0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv000081rm+0x12f/0x1a0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv037576rm+0xc3/0x350 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv037575rm+0x63/0x80 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv012877rm+0x78/0xd0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv012877rm+0x1a/0xd0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv025428rm+0x251/0x3e0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv025377rm+0x1f/0xf0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv016675rm+0xd3/0x3c0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv028552rm+0xb23/0xdc0 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv028560rm+0x15d/0x400 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? _nv000708rm+0xa9/0x240 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? disable_irq_nosync+0x10/0x10
Dec 21 12:18:56 randys-arch-desktop kernel:  ? rm_isr_bh+0x1c/0x60 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? nvidia_isr_kthread_bh+0x1b/0x40 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel:  ? irq_thread_fn+0x20/0x60
Dec 21 12:18:56 randys-arch-desktop kernel:  ? irq_thread+0xf5/0x1a0
Dec 21 12:18:56 randys-arch-desktop kernel:  ? irq_finalize_oneshot.part.0+0xf0/0xf0
Dec 21 12:18:56 randys-arch-desktop kernel:  ? irq_thread_check_affinity+0xd0/0xd0
Dec 21 12:18:56 randys-arch-desktop kernel:  ? kthread+0x143/0x160
Dec 21 12:18:56 randys-arch-desktop kernel:  ? __kthread_bind_mask+0x60/0x60
Dec 21 12:18:56 randys-arch-desktop kernel:  ? ret_from_fork+0x22/0x30
Dec 21 12:18:56 randys-arch-desktop kernel: Modules linked in: bnep xt_connmark xt_mark iptable_raw xt_CHECKSUM ip6table_mangle ip6table_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_MASQUERADE xt_state iptable_mangle nf_nat_h323 nf_conntrack_h323 nf_nat_pptp nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_sip nf_conntrack_sip nf_nat_irc nf_conntrack_irc ebtable_filter ebtables iptable_nat br_netfilter overlay bridge stp llc wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_>
Dec 21 12:18:56 randys-arch-desktop kernel:  nf_nat nf_conntrack_ftp nf_conntrack snd_hda_codec_realtek nf_defrag_ipv6 snd_hda_codec_generic nf_defrag_ipv4 libcrc32c ledtrig_audio iptable_filter snd_usb_audio vmmon(OE) snd_usbmidi_lib snd_rawmidi vmw_vmci snd_seq_device joydev mc msr kvmgt snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec intel_rapl_msr raid1 intel_rapl_common snd_hda_core snd_hwdep hid_generic soundwire_bus i>
Dec 21 12:18:56 randys-arch-desktop kernel:  mac_hid rng_core vfio_mdev mdev vfio_iommu_type1 vfio i2c_algo_bit intel_gtt kvm irqbypass btusb btrtl btbcm btintel bluetooth ecdh_generic rfkill ecc crypto_user fuse ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 xhci_pci xhci_pci_renesas crc32c_intel xhci_hcd nvidia_drm(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core drm agpgart nvidia_uvm(POE) nvidia_modeset(POE) nvidia(POE)
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: ---[ end trace a98d3985a567011c ]---
Dec 21 12:18:56 randys-arch-desktop kernel: RIP: 0010:_nv028347rm+0x9/0x90 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel: Code: 8e ff e8 8a af 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dec 21 12:18:56 randys-arch-desktop kernel: RSP: 0018:ffffa3ae406fbc20 EFLAGS: 00010202
Dec 21 12:18:56 randys-arch-desktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dec 21 12:18:56 randys-arch-desktop kernel: RDX: ffff9c4af0657608 RSI: ffffffffffffffff RDI: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: RBP: ffff9c46896359f0 R08: ffffffffc1b53860 R09: ffff9c46896359d0
Dec 21 12:18:56 randys-arch-desktop kernel: R10: ffff9c468b37c008 R11: ffff9c468b37d098 R12: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: R13: 0000000000000000 R14: ffff9c4689635b58 R15: ffff9c4689635c98
Dec 21 12:18:56 randys-arch-desktop kernel: FS:  0000000000000000(0000) GS:ffff9c4dded80000(0000) knlGS:0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 0000000000000020 CR3: 000000016cc1a005 CR4: 00000000003706e0
Dec 21 12:18:56 randys-arch-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Dec 21 12:18:56 randys-arch-desktop kernel: BUG: kernel NULL pointer dereference, address: 00000000000006d1
Dec 21 12:18:56 randys-arch-desktop kernel: #PF: supervisor write access in kernel mode
Dec 21 12:18:56 randys-arch-desktop kernel: #PF: error_code(0x0002) - not-present page
Dec 21 12:18:56 randys-arch-desktop kernel: PGD 800000019b66e067 P4D 800000019b66e067 PUD 0 
Dec 21 12:18:56 randys-arch-desktop kernel: Oops: 0002 [#2] PREEMPT SMP PTI
Dec 21 12:18:56 randys-arch-desktop kernel: CPU: 6 PID: 209 Comm: irq/137-nvidia Tainted: P     UD    OE     5.10.1-103-tkg-pds #1
Dec 21 12:18:56 randys-arch-desktop kernel: Hardware name: System manufacturer System Product Name/PRIME Z270-A, BIOS 1302 03/15/2018
Dec 21 12:18:56 randys-arch-desktop kernel: RIP: 0010:mutex_lock+0x10/0x20
Dec 21 12:18:56 randys-arch-desktop kernel: Code: 03 31 c0 c3 eb d4 0f 1f 40 00 0f 1f 44 00 00 be 02 00 00 00 e9 61 fa ff ff 90 0f 1f 44 00 00 31 c0 65 48 8b 14 25 c0 7b 01 00 <f0> 48 0f b1 17 75 01 c3 eb d6 66 0f 1f 44 00 00 0f 1f 44 00 00 41
Dec 21 12:18:56 randys-arch-desktop kernel: RSP: 0018:ffffa3ae406fbe30 EFLAGS: 00010246
Dec 21 12:18:56 randys-arch-desktop kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: RDX: ffff9c4682cb3880 RSI: 0000000000000000 RDI: 00000000000006d1
Dec 21 12:18:56 randys-arch-desktop kernel: RBP: 00000000000006d1 R08: 0000000000000000 R09: ffffa3ae406fb890
Dec 21 12:18:56 randys-arch-desktop kernel: R10: 0000000000000001 R11: 0000000000000001 R12: ffff9c4682cb3dec
Dec 21 12:18:56 randys-arch-desktop kernel: R13: 0000000000000009 R14: 0000000000000001 R15: 0000000000000001
Dec 21 12:18:56 randys-arch-desktop kernel: FS:  0000000000000000(0000) GS:ffff9c4dded80000(0000) knlGS:0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 00000000000006d1 CR3: 000000016cc1a005 CR4: 00000000003706e0
Dec 21 12:18:56 randys-arch-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Dec 21 12:18:56 randys-arch-desktop kernel: Call Trace:
Dec 21 12:18:56 randys-arch-desktop kernel:  perf_event_exit_task+0x30/0x460
Dec 21 12:18:56 randys-arch-desktop kernel:  ? preempt_schedule_thunk+0x16/0x18
Dec 21 12:18:56 randys-arch-desktop kernel:  do_exit+0x352/0xa40
Dec 21 12:18:56 randys-arch-desktop kernel:  ? task_work_run+0x5c/0x90
Dec 21 12:18:56 randys-arch-desktop kernel:  ? do_exit+0x342/0xa40
Dec 21 12:18:56 randys-arch-desktop kernel:  ? kthread+0x143/0x160
Dec 21 12:18:56 randys-arch-desktop kernel:  ? rewind_stack_do_exit+0x17/0x17
Dec 21 12:18:56 randys-arch-desktop kernel: Modules linked in: bnep xt_connmark xt_mark iptable_raw xt_CHECKSUM ip6table_mangle ip6table_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_MASQUERADE xt_state iptable_mangle nf_nat_h323 nf_conntrack_h323 nf_nat_pptp nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_sip nf_conntrack_sip nf_nat_irc nf_conntrack_irc ebtable_filter ebtables iptable_nat br_netfilter overlay bridge stp llc wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_>
Dec 21 12:18:56 randys-arch-desktop kernel:  nf_nat nf_conntrack_ftp nf_conntrack snd_hda_codec_realtek nf_defrag_ipv6 snd_hda_codec_generic nf_defrag_ipv4 libcrc32c ledtrig_audio iptable_filter snd_usb_audio vmmon(OE) snd_usbmidi_lib snd_rawmidi vmw_vmci snd_seq_device joydev mc msr kvmgt snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec intel_rapl_msr raid1 intel_rapl_common snd_hda_core snd_hwdep hid_generic soundwire_bus i>
Dec 21 12:18:56 randys-arch-desktop kernel:  mac_hid rng_core vfio_mdev mdev vfio_iommu_type1 vfio i2c_algo_bit intel_gtt kvm irqbypass btusb btrtl btbcm btintel bluetooth ecdh_generic rfkill ecc crypto_user fuse ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 xhci_pci xhci_pci_renesas crc32c_intel xhci_hcd nvidia_drm(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core drm agpgart nvidia_uvm(POE) nvidia_modeset(POE) nvidia(POE)
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 00000000000006d1
Dec 21 12:18:56 randys-arch-desktop kernel: ---[ end trace a98d3985a567011d ]---
Dec 21 12:18:56 randys-arch-desktop kernel: RIP: 0010:_nv028347rm+0x9/0x90 [nvidia]
Dec 21 12:18:56 randys-arch-desktop kernel: Code: 8e ff e8 8a af 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
Dec 21 12:18:56 randys-arch-desktop kernel: RSP: 0018:ffffa3ae406fbc20 EFLAGS: 00010202
Dec 21 12:18:56 randys-arch-desktop kernel: RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
Dec 21 12:18:56 randys-arch-desktop kernel: RDX: ffff9c4af0657608 RSI: ffffffffffffffff RDI: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: RBP: ffff9c46896359f0 R08: ffffffffc1b53860 R09: ffff9c46896359d0
Dec 21 12:18:56 randys-arch-desktop kernel: R10: ffff9c468b37c008 R11: ffff9c468b37d098 R12: 0000000000000020
Dec 21 12:18:56 randys-arch-desktop kernel: R13: 0000000000000000 R14: ffff9c4689635b58 R15: ffff9c4689635c98
Dec 21 12:18:56 randys-arch-desktop kernel: FS:  0000000000000000(0000) GS:ffff9c4dded80000(0000) knlGS:0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 21 12:18:56 randys-arch-desktop kernel: CR2: 00000000000006d1 CR3: 000000016cc1a005 CR4: 00000000003706e0
Dec 21 12:18:56 randys-arch-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 21 12:18:56 randys-arch-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Dec 21 12:18:56 randys-arch-desktop kernel: Fixing recursive fault but reboot is needed!
Dec 21 12:19:01 randys-arch-desktop ananicy[722]: renice: firefox[2111/105435] 0 -> -3
Dec 21 12:19:01 randys-arch-desktop ananicy[722]: renice: firefox[2220/105423] 0 -> -3
Dec 21 12:19:11 randys-arch-desktop kernel: [UFW BLOCK] IN=enp0s31f6 OUT= MAC=60:45:cb:7f:97:e3:00:f6:20:67:31:5c:08:00 SRC=192.168.0.102 DST=192.168.0.200 LEN=549 TOS=0x00 PREC=0x00 TTL=64 ID=15771 DF PROTO=UDP SPT=49894 DPT=52800 LEN=529 
Dec 21 12:19:11 randys-arch-desktop ananicy[722]: renice: firefox[2111/105703] 0 -> -3
Dec 21 12:19:11 randys-arch-desktop ananicy[722]: renice: firefox[2271/105695] 0 -> -3
Dec 21 12:19:12 randys-arch-desktop kernel: [UFW BLOCK] IN=enp0s31f6 OUT= MAC=60:45:cb:7f:97:e3:00:f6:20:67:31:5c:08:00 SRC=192.168.0.102 DST=192.168.0.200 LEN=549 TOS=0x00 PREC=0x00 TTL=64 ID=15830 DF PROTO=UDP SPT=48051 DPT=52800 LEN=529 
Dec 21 12:19:13 randys-arch-desktop kernel: [UFW BLOCK] IN=enp0s31f6 OUT= MAC=60:45:cb:7f:97:e3:00:f6:20:67:31:5c:08:00 SRC=192.168.0.102 DST=192.168.0.200 LEN=549 TOS=0x00 PREC=0x00 TTL=64 ID=15871 DF PROTO=UDP SPT=60318 DPT=52800 LEN=529 
Dec 21 12:19:14 randys-arch-desktop kernel: [UFW BLOCK] IN=enp0s31f6 OUT= MAC=60:45:cb:7f:97:e3:00:f6:20:67:31:5c:08:00 SRC=192.168.0.102 DST=192.168.0.200 LEN=549 TOS=0x00 PREC=0x00 TTL=64 ID=15969 DF PROTO=UDP SPT=47029 DPT=52800 LEN=529 
Dec 21 12:19:16 randys-arch-desktop ananicy[722]: renice: firefox[2220/105824] 0 -> -3
Dec 21 12:19:16 randys-arch-desktop ananicy[722]: renice: spotify[23921/105828] 0 -> -3

Here’s a trace:

Dec 21 12:30:40 randys-arch-desktop kernel: WARNING: CPU: 0 PID: 123925 at /var/lib/dkms/nvidia/460.27.04/build/nvidia-drm/nvidia-drm-drv.c:530 nv_drm_master_set+0x22/0x30 [nvidia_drm]
Dec 21 12:30:40 randys-arch-desktop kernel: Modules linked in: bnep xt_connmark xt_mark iptable_raw xt_CHECKSUM ip6table_mangle ip6table_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_MASQUERADE xt_state iptable_mangle nf_nat_h323 nf_conntrack_h323 nf_nat_pptp nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_sip nf_conntrack_sip nf_nat_irc nf_conntrack_irc ebtable_filter ebtables iptable_nat br_netfilter overlay bridge stp llc wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_>
Dec 21 12:30:40 randys-arch-desktop kernel:  nf_conntrack snd_hda_codec_realtek nf_defrag_ipv6 snd_hda_codec_generic nf_defrag_ipv4 libcrc32c ledtrig_audio iptable_filter snd_usb_audio vmmon(OE) snd_usbmidi_lib snd_rawmidi vmw_vmci snd_seq_device joydev mc msr kvmgt snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec intel_rapl_msr raid1 intel_rapl_common snd_hda_core snd_hwdep hid_generic soundwire_bus iTCO_wdt mei_hdcp eeepc_w>
Dec 21 12:30:40 randys-arch-desktop kernel:  vfio_iommu_type1 vfio i2c_algo_bit intel_gtt kvm irqbypass btusb btrtl btbcm btintel bluetooth ecdh_generic rfkill ecc crypto_user fuse ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 xhci_pci xhci_pci_renesas crc32c_intel xhci_hcd nvidia_drm(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core drm agpgart nvidia_uvm(POE) nvidia_modeset(POE) nvidia(POE) [last unloaded: hwmon_vid]
Dec 21 12:30:40 randys-arch-desktop kernel: CPU: 0 PID: 123925 Comm: Xorg.wrap Tainted: P     UD    OE     5.10.1-103-tkg-pds #1
Dec 21 12:30:40 randys-arch-desktop kernel: Hardware name: System manufacturer System Product Name/PRIME Z270-A, BIOS 1302 03/15/2018
Dec 21 12:30:40 randys-arch-desktop kernel: RIP: 0010:nv_drm_master_set+0x22/0x30 [nvidia_drm]
Dec 21 12:30:40 randys-arch-desktop kernel: Code: 04 d1 66 d4 0f 1f 40 00 0f 1f 44 00 00 48 8b 47 48 48 8b 78 20 48 8b 05 dc 5c 00 00 48 8b 40 28 e8 f3 3a a1 d4 84 c0 74 01 c3 <0f> 0b c3 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 57 80
Dec 21 12:30:40 randys-arch-desktop kernel: RSP: 0018:ffffa3ae46d4fbe0 EFLAGS: 00010246
Dec 21 12:30:40 randys-arch-desktop kernel: RAX: 0000000000000000 RBX: ffff9c47aec76800 RCX: 0000000000000000
Dec 21 12:30:40 randys-arch-desktop kernel: RDX: 0000000000000001 RSI: 0000000000000296 RDI: 00000000ffffffff
Dec 21 12:30:40 randys-arch-desktop kernel: RBP: ffff9c46cef58f00 R08: 0000000000000008 R09: ffffa3ae46d4fbc8
Dec 21 12:30:40 randys-arch-desktop kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff9c46877eb800
Dec 21 12:30:40 randys-arch-desktop kernel: R13: 0000000000000000 R14: ffff9c46877eb800 R15: 000000008b481028
Dec 21 12:30:40 randys-arch-desktop kernel: FS:  00007f6cda452580(0000) GS:ffff9c4ddec00000(0000) knlGS:0000000000000000
Dec 21 12:30:40 randys-arch-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 21 12:30:40 randys-arch-desktop kernel: CR2: 00007f6cda2dabc0 CR3: 0000000135988003 CR4: 00000000003706f0
Dec 21 12:30:40 randys-arch-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 21 12:30:40 randys-arch-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Dec 21 12:30:40 randys-arch-desktop kernel: Call Trace:
Dec 21 12:30:40 randys-arch-desktop kernel:  drm_new_set_master+0x79/0x100 [drm]
Dec 21 12:30:40 randys-arch-desktop kernel:  drm_master_open+0x67/0x90 [drm]
Dec 21 12:30:40 randys-arch-desktop kernel:  drm_open+0xf5/0x240 [drm]
Dec 21 12:30:40 randys-arch-desktop kernel:  drm_stub_open+0xab/0x130 [drm]
Dec 21 12:30:40 randys-arch-desktop kernel:  chrdev_open+0xca/0x240
Dec 21 12:30:40 randys-arch-desktop kernel:  ? cdev_device_add+0x90/0x90
Dec 21 12:30:40 randys-arch-desktop kernel:  do_dentry_open+0x14e/0x380
Dec 21 12:30:40 randys-arch-desktop kernel:  path_openat+0xbb3/0x1030
Dec 21 12:30:40 randys-arch-desktop kernel:  ? page_add_file_rmap+0x13/0x1c0
Dec 21 12:30:40 randys-arch-desktop kernel:  ? alloc_set_pte+0xec/0x670
Dec 21 12:30:40 randys-arch-desktop kernel:  do_filp_open+0xa9/0x150
Dec 21 12:30:40 randys-arch-desktop kernel:  do_sys_openat2+0xb1/0x160
Dec 21 12:30:40 randys-arch-desktop kernel:  __x64_sys_openat+0x54/0x90
Dec 21 12:30:40 randys-arch-desktop kernel:  do_syscall_64+0x33/0x40
Dec 21 12:30:40 randys-arch-desktop kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Dec 21 12:30:40 randys-arch-desktop kernel: RIP: 0033:0x7f6cda378c1b
Dec 21 12:30:40 randys-arch-desktop kernel: Code: 25 00 00 41 00 3d 00 00 41 00 74 4b 64 8b 04 25 18 00 00 00 85 c0 75 67 44 89 e2 48 89 ee bf 9c ff ff ff b8 01 01 00 00 0f 05 <48> 3d 00 f0 ff ff 0f 87 91 00 00 00 48 8b 4c 24 28 64 48 2b 0c 25
Dec 21 12:30:40 randys-arch-desktop kernel: RSP: 002b:00007ffdc832b020 EFLAGS: 00000246 ORIG_RAX: 0000000000000101
Dec 21 12:30:40 randys-arch-desktop kernel: RAX: ffffffffffffffda RBX: 00007ffdc832b110 RCX: 00007f6cda378c1b
Dec 21 12:30:40 randys-arch-desktop kernel: RDX: 0000000000000002 RSI: 00007ffdc832b110 RDI: 00000000ffffff9c
Dec 21 12:30:40 randys-arch-desktop kernel: RBP: 00007ffdc832b110 R08: 0000000000000000 R09: 00007ffdc832af30
Dec 21 12:30:40 randys-arch-desktop kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000002
Dec 21 12:30:40 randys-arch-desktop kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 00007ffdc832b0c0
Dec 21 12:30:40 randys-arch-desktop kernel: ---[ end trace a98d3985a567011e ]---

Hope that helps! Thank you all!

Also the bug report hangs.

____________________________________________

Start of NVIDIA bug report log file.  Please include this file, along
with a detailed description of your problem, when reporting a graphics
driver bug via the NVIDIA Linux forum (see forums.developer.nvidia.com)
or by sending email to 'linux-bugs@nvidia.com'.

nvidia-bug-report.sh Version: 29373640

Date: Mon 21 Dec 2020 12:27:51 PM PST
uname: Linux randys-arch-desktop 5.10.1-103-tkg-pds #1 TKG SMP PREEMPT Thu, 17 Dec 2020 07:31:08 +0000 x86_64 GNU/Linux
command line flags: --safe-mode --extra-system-data

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.0/power/control
*** ls: -rw-r--r-- 1 root root 4096 2020-12-21 12:23:35.837043428 -0800 /sys/bus/pci/devices/0000:01:00.0/power/control
on

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.0/power/runtime_status
*** ls: -r--r--r-- 1 root root 4096 2020-12-21 12:23:35.839710076 -0800 /sys/bus/pci/devices/0000:01:00.0/power/runtime_status
active

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.0/power/runtime_usage
*** ls: -r--r--r-- 1 root root 4096 2020-12-21 12:23:35.841043400 -0800 /sys/bus/pci/devices/0000:01:00.0/power/runtime_usage
2

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.1/power/control
*** ls: -rw-r--r-- 1 root root 4096 2020-12-21 12:23:35.843710048 -0800 /sys/bus/pci/devices/0000:01:00.1/power/control
auto

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.1/power/runtime_status
*** ls: -r--r--r-- 1 root root 4096 2020-12-21 12:23:35.846376696 -0800 /sys/bus/pci/devices/0000:01:00.1/power/runtime_status
suspended

____________________________________________

*** /sys/bus/pci/devices/0000:01:00.1/power/runtime_usage
*** ls: -r--r--r-- 1 root root 4096 2020-12-21 12:23:35.847710020 -0800 /sys/bus/pci/devices/0000:01:00.1/power/runtime_usage
0

----

Pretty sure it’s useless though.

@amrits
Wasn’t there an issue in where it was fixed by someone shipping their computer to Nvidia?

We could do that.

Whatever it could be before, it’s not anymore – CUDA toolkit 11.2.0 requires Linux driver 460.27.04, CUDA toolkit 11.1.1 – Linux driver 455.32.00, CUDA toolkit 11.1.0 – Linux driver 455.23.05, all those versions are affected.

Following my reply a couple of weeks ago I moved back to the LTS kernel and encounter still this bug.

Linux 5.4.85 compiled with a default config
Nvidia 460.27.04 beta driver
GTX 980
8700k
Asus Z370-F board

Reproduced watching kodi and sometimes when interacting with a chrome window.

Have attached another bug report log incase it can help.

Think I will now go back to older 440.100 driver to test with this LTS kernel.
Unless does anybody know a kernel/driver combination that is unaffected?

Hopefully a fix is in the pipeline for this bug.

nvidia-bug-report.log.gz (182.2 KB)

This bug does not depend on a kernel being used, it occurs with any kernel release. This is the driver issue, don’t waste your time trying to find a working kernel. 440.100 works reliably with any supported kernel, just use it until the issue is fixed.

1 Like

I am using 450.80.02-r1 on gentoo with no issues. Anything starting 455 or 460 causes the issue.

Confirm.

I encountered this first on 450.80.02 and there is other reports on 450.xx series in this thread so I believe that is also affected.

440.100 works reliably with any supported kernel, just use it until the issue is fixed.

Thank you, will do.

1 Like

After upgrade to 455.45.01 on the same Gentoo with kernel 5.4.80, got this:

[30472.171728] BUG: kernel NULL pointer dereference, address: 0000000000000020
[30472.171732] #PF: supervisor read access in kernel mode
[30472.171734] #PF: error_code(0x0000) - not-present page
[30472.171735] PGD 751a50067 P4D 751a50067 PUD 0 
[30472.171738] Oops: 0000 [#1] PREEMPT SMP NOPTI
[30472.171740] CPU: 0 PID: 4799 Comm: irq/69-nvidia Tainted: P           O      5.4.80-gentoo-r1-x86_64 #1
[30472.171742] Hardware name: System manufacturer System Product Name/M4A89TD PRO USB3, BIOS 3029    09/07/2012
[30472.172052] RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
[30472.172055] Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
[30472.172057] RSP: 0018:ffffb3ae80fe3be0 EFLAGS: 00010202
[30472.172058] RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
[30472.172059] RDX: ffffa19e47a56a88 RSI: ffffffffffffffff RDI: 0000000000000020
[30472.172060] RBP: ffffa19ecf232c90 R08: ffffffffc1decc70 R09: ffffa19ecf232c70
[30472.172061] R10: ffffffffc0a36c20 R11: ffffa19ecffcd008 R12: 0000000000000020
[30472.172062] R13: 0000000000000000 R14: ffffa19ecf232df8 R15: ffffa19ecf232f38
[30472.172063] FS:  0000000000000000(0000) GS:ffffa19ed7a00000(0000) knlGS:0000000000000000
[30472.172064] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[30472.172065] CR2: 0000000000000020 CR3: 000000061e42a000 CR4: 00000000000006f0
[30472.172066] Call Trace:
[30472.172286]  ? _nv029950rm+0x1b/0x90 [nvidia]
[30472.172432]  ? _nv025474rm+0x18/0x60 [nvidia]
[30472.172578]  ? _nv011691rm+0x13d/0x1c0 [nvidia]
[30472.172739]  ? _nv000083rm+0x12f/0x1a0 [nvidia]
[30472.172930]  ? _nv030071rm+0xb9/0x330 [nvidia]
[30472.173120]  ? _nv030070rm+0x61/0x80 [nvidia]
[30472.173310]  ? _nv030070rm+0x37/0x80 [nvidia]
[30472.173496]  ? _nv011623rm+0x428/0x460 [nvidia]
[30472.173660]  ? _nv024757rm+0x251/0x3e0 [nvidia]
[30472.173845]  ? _nv024705rm+0x1f/0xf0 [nvidia]
[30472.174029]  ? _nv015452rm+0xcb/0x370 [nvidia]
[30472.174172]  ? _nv026076rm+0x10/0x10 [nvidia]
[30472.174358]  ? _nv027734rm+0x273/0xdc0 [nvidia]
[30472.174545]  ? _nv007566rm+0x155/0x270 [nvidia]
[30472.174732]  ? _nv027742rm+0x8d/0x180 [nvidia]
[30472.174869]  ? _nv000712rm+0xa9/0x200 [nvidia]
[30472.174873]  ? irq_forced_thread_fn+0x70/0x70
[30472.175010]  ? rm_isr_bh+0x1c/0x60 [nvidia]
[30472.175142]  ? nvidia_isr_kthread_bh+0x16/0x4d0 [nvidia]
[30472.175144]  ? irq_thread_fn+0x1b/0x60
[30472.175145]  ? irq_thread+0xd7/0x160
[30472.175147]  ? wake_threads_waitq+0x30/0x30
[30472.175148]  ? irq_thread_dtor+0x80/0x80
[30472.175151]  ? kthread+0x125/0x150
[30472.175152]  ? kthread_create_worker_on_cpu+0x60/0x60
[30472.175155]  ? ret_from_fork+0x22/0x40
[30472.175156] Modules linked in: fuse rfcomm cmac algif_hash algif_skcipher af_alg bnep ipv6 btusb uvcvideo btrtl btbcm videobuf2_vmalloc btintel videobuf2_memops videobuf2_v4l2 bluetooth videobuf2_common ecdh_generic ax88179_178a videodev usbnet rfkill ch341 ecc usbserial dm_mod hid_logitech_hidpp nvidia_drm(PO) hid_logitech_dj joydev hid_plantronics snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device mc usbhid nvidia_modeset(PO) snd_hda_codec_hdmi wmi_bmof amd64_edac_mod kvm_amd ccp kvm irqbypass snd_hda_codec_realtek snd_hda_codec_generic pcspkr ledtrig_audio k10temp snd_hda_intel snd_intel_nhlt i2c_piix4 nvidia(PO) snd_hda_codec ohci_pci snd_hda_core ohci_hcd snd_hwdep snd_pcm snd_timer firewire_ohci snd r8168(O) ata_generic firewire_core asus_atk0110 soundcore pata_acpi hwmon wmi acpi_cpufreq button nvme ehci_pci xhci_pci ahci pata_jmicron ehci_hcd xhci_hcd libahci nvme_core usbcore libata
[30472.175187] CR2: 0000000000000020
[30472.175190] ---[ end trace a7777cbb950f32a9 ]---
[30472.175340] RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
[30472.175342] Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
[30472.175343] RSP: 0018:ffffb3ae80fe3be0 EFLAGS: 00010202
[30472.175345] RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
[30472.175346] RDX: ffffa19e47a56a88 RSI: ffffffffffffffff RDI: 0000000000000020
[30472.175347] RBP: ffffa19ecf232c90 R08: ffffffffc1decc70 R09: ffffa19ecf232c70
[30472.175348] R10: ffffffffc0a36c20 R11: ffffa19ecffcd008 R12: 0000000000000020
[30472.175348] R13: 0000000000000000 R14: ffffa19ecf232df8 R15: ffffa19ecf232f38
[30472.175350] FS:  0000000000000000(0000) GS:ffffa19ed7a00000(0000) knlGS:0000000000000000
[30472.175351] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[30472.175352] CR2: 0000000000000020 CR3: 000000061e42a000 CR4: 00000000000006f0
[30472.175412] BUG: kernel NULL pointer dereference, address: 0000000000000000
[30472.175413] #PF: supervisor instruction fetch in kernel mode
[30472.175414] #PF: error_code(0x0010) - not-present page
[30472.175415] PGD 751a50067 P4D 751a50067 PUD 0 
[30472.175417] Oops: 0010 [#2] PREEMPT SMP NOPTI
[30472.175419] CPU: 0 PID: 4799 Comm: irq/69-nvidia Tainted: P      D    O      5.4.80-gentoo-r1-x86_64 #1
[30472.175420] Hardware name: System manufacturer System Product Name/M4A89TD PRO USB3, BIOS 3029    09/07/2012
[30472.175421] RIP: 0010:0x0
[30472.175424] Code: Bad RIP value.
[30472.175424] RSP: 0018:ffffb3ae80fe3e98 EFLAGS: 00010282
[30472.175425] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000000
[30472.175426] RDX: ffffb3ae80fe3ec8 RSI: 0000000000000000 RDI: ffffb3ae80fe3ec8
[30472.175427] RBP: ffffa19ea88588d0 R08: ffffa19ecc8ba1b0 R09: 0000000000000000
[30472.175428] R10: 0000000000000046 R11: ffffb3ae80fe393d R12: ffffa19ea8858240
[30472.175429] R13: ffffffffa17698b0 R14: 0000000000000000 R15: ffffa19ea885890c
[30472.175430] FS:  0000000000000000(0000) GS:ffffa19ed7a00000(0000) knlGS:0000000000000000
[30472.175431] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[30472.175432] CR2: ffffffffffffffd6 CR3: 000000061e42a000 CR4: 00000000000006f0
[30472.175432] Call Trace:
[30472.175435]  task_work_run+0x8e/0xb0
[30472.175437]  do_exit+0x34a/0xac0
[30472.175439]  ? irq_thread_dtor+0x80/0x80
[30472.175441]  ? kthread+0x125/0x150
[30472.175443]  rewind_stack_do_exit+0x17/0x20
[30472.175444] RIP: 0000:0x0
[30472.175446] Code: Bad RIP value.
[30472.175447] RSP: 0000:0000000000000000 EFLAGS: 00000000 ORIG_RAX: 0000000000000000
[30472.175448] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[30472.175449] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[30472.175449] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[30472.175450] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[30472.175451] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[30472.175452] Modules linked in: fuse rfcomm cmac algif_hash algif_skcipher af_alg bnep ipv6 btusb uvcvideo btrtl btbcm videobuf2_vmalloc btintel videobuf2_memops videobuf2_v4l2 bluetooth videobuf2_common ecdh_generic ax88179_178a videodev usbnet rfkill ch341 ecc usbserial dm_mod hid_logitech_hidpp nvidia_drm(PO) hid_logitech_dj joydev hid_plantronics snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device mc usbhid nvidia_modeset(PO) snd_hda_codec_hdmi wmi_bmof amd64_edac_mod kvm_amd ccp kvm irqbypass snd_hda_codec_realtek snd_hda_codec_generic pcspkr ledtrig_audio k10temp snd_hda_intel snd_intel_nhlt i2c_piix4 nvidia(PO) snd_hda_codec ohci_pci snd_hda_core ohci_hcd snd_hwdep snd_pcm snd_timer firewire_ohci snd r8168(O) ata_generic firewire_core asus_atk0110 soundcore pata_acpi hwmon wmi acpi_cpufreq button nvme ehci_pci xhci_pci ahci pata_jmicron ehci_hcd xhci_hcd libahci nvme_core usbcore libata
[30472.175471] CR2: 0000000000000000
[30472.175472] ---[ end trace a7777cbb950f32aa ]---
[30472.175627] RIP: 0010:_nv027527rm+0x9/0x90 [nvidia]
[30472.175629] Code: 90 ff e8 ea b0 00 00 31 c0 48 83 c4 08 c3 31 c0 eb bf 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 ec 08 48 85 ff 74 57 <48> 8b 17 31 c0 48 85 d2 75 0e eb 2b 0f 1f 00 48 8b 52 10 48 85 d2
[30472.175630] RSP: 0018:ffffb3ae80fe3be0 EFLAGS: 00010202
[30472.175632] RAX: 0000000000000020 RBX: 0000000000000020 RCX: 0000000000000010
[30472.175632] RDX: ffffa19e47a56a88 RSI: ffffffffffffffff RDI: 0000000000000020
[30472.175633] RBP: ffffa19ecf232c90 R08: ffffffffc1decc70 R09: ffffa19ecf232c70
[30472.175634] R10: ffffffffc0a36c20 R11: ffffa19ecffcd008 R12: 0000000000000020
[30472.175635] R13: 0000000000000000 R14: ffffa19ecf232df8 R15: ffffa19ecf232f38
[30472.175636] FS:  0000000000000000(0000) GS:ffffa19ed7a00000(0000) knlGS:0000000000000000
[30472.175637] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[30472.175638] CR2: ffffffffffffffd6 CR3: 000000061e42a000 CR4: 00000000000006f0
[30472.175639] Fixing recursive fault but reboot is needed!

This is nearly identical to my previous crash with 455.28. It also happened soon enough after reboot, and no other error messages (about allocation or faults) were in the kernel log at that time.

1 Like

I got the same while playing Wasteland 3 on gentoo, same driver version 455.45.01-r1 and kernel version 5.4.80-gentoo-r1

Same here and this bug is clearly more and more present with the latest version of the driver.

I won’t send any log because … it’s the same like all the log here, exactly the same errors.

The crash began with Chromium based browser open (active or not it’s the same result but the crash seems to be more present when I use it).

I downgraded to 440.xx, the 455.xx is really unusable for now.

Kernel 5.4.80-02 | Manjaro | GTX 860M

1 Like

The errors may look similar, however there is a significant variety of functions in the stack trace. I was surprised how similar my two logs are while others are all over the place.

Also since function names in proprietary blob are obfuscated, they may contain valuable information if resolved to whatever is in the original source.

And in three months, Nvidia didn’t solved this bug (and what’s about Nvidia tests to reproduce this bug ?).

I’m sorry but we don’t have any news day after day, week after week and why Nvidia can’t reproduce this bug with all the information here ?

Whatever, I have exactly (exactly) the same log as you.