[BUG] Nvidia 440.64 + kernel 5.5.6/stable -- boot trace; WAS Nvidia 440.59 + kernel 5.5.1/stable -- boot trace

You’re absolutely correct. Guess I really should’ve noticed that the changes are quite ancient… Guess I’ll have to blame it on being lazy and somewhat tired at the moment. Sorry about that.

Note, I did not test the patch. It can as well be it because of some other change in the future from that one.

Feels unlikely at this point, looking back on it with a clear head; but I guess removing it from the post was unnecessary. Figured I’d not waste anyone’s time trying. Still, I re-added it.

Did you use https://github.com/torvalds/linux/commits/master/drivers/gpu/drm/drm_atomic_helper.c
there is a fixit https://github.com/torvalds/linux/commit/8de679abc8ae81463d3fec495a21a6ca0a65bade#diff-4b72d79074d15bfc203b0655cccd9a6b which was a bug in the kernel and was fixed in 5.6-rc1

I worked off my local source repository when I copied the relevant parts. Can’t honestly say which version I had checked out when I did, but likely v5.5.3.

However, the only thing the snippet above changes are the nv_drm_atomic_helper_disable_all function, and the addition of the two new macros in “kernel/nvidia-drm/nvidia-drm-helper.h”. The macros comes from “include/drm/drm_atomic.h” in the kernel source, but those will only get used if they’re not found, otherwise they get defined to the kernel versions.

I have a transcoding with signals in 1080i at the input, the output is in 1080p, I need the output to be 1080i, it has the version 418.87.00 do not work, URGENT I need a solution, thank you very much

pls open a new issue instead of hijacking one

inxi -Gx
Graphics:  Device-1: Intel UHD Graphics 630 vendor: Lenovo driver: i915 v: kernel bus ID: 00:02.0 
           Device-2: NVIDIA GP107M [GeForce GTX 1050 3 GB Max-Q] vendor: Lenovo driver: nvidia v: 440.59 bus ID: 01:00.0 
           Display: server: Fedora Project X.org 1.20.6 driver: modesetting,nvidia unloaded: fbdev,nouveau,vesa 
           resolution: 1920x1080~60Hz 
           OpenGL: renderer: Mesa DRI Intel UHD Graphics 630 (Coffeelake 3x8 GT2) v: 4.5 Mesa 19.2.8 direct render: Yes
uname -rm
5.5.5-200.fc31.x86_64 x86_64
[   15.486705] ------------[ cut here ]------------
[   15.486707] refcount_t: underflow; use-after-free.
[   15.486724] WARNING: CPU: 7 PID: 1935 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
[   15.486725] Modules linked in: cmac bnep sunrpc vfat fat snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc snd_sof_xtensa_dsp snd_sof_intel_hda_common x86_pkg_temp_thermal snd_soc_hdac_hda intel_powerclamp snd_sof_intel_hda snd_sof snd_soc_skl snd_soc_sst_ipc coretemp snd_soc_sst_dsp snd_hda_ext_core kvm_intel nvidia_drm(POE) snd_soc_acpi_intel_match nvidia_modeset(POE) snd_soc_acpi snd_hda_codec_realtek snd_hda_codec_hdmi snd_soc_core nvidia_uvm(OE) kvm snd_compress snd_hda_codec_generic ledtrig_audio ac97_bus irqbypass snd_pcm_dmaengine snd_hda_intel btusb uvcvideo snd_intel_dspcfg mei_hdcp iTCO_wdt 8821ce(OE) iTCO_vendor_support crct10dif_pclmul crc32_pclmul btrtl videobuf2_vmalloc intel_rapl_msr snd_hda_codec btbcm videobuf2_memops ghash_clmulni_intel intel_cstate snd_hda_core btintel videobuf2_v4l2 videobuf2_common snd_hwdep intel_uncore bluetooth intel_rapl_perf snd_seq videodev nvidia(POE) snd_seq_device pcspkr intel_wmi_thunderbolt wmi_bmof ecdh_generic cfg80211 snd_pcm i2c_i801 mc
[   15.486759]  ecc snd_timer mei_me snd processor_thermal_device mei joydev intel_rapl_common idma64 ipmi_msghandler soundcore intel_soc_dts_iosf intel_pch_thermal ideapad_laptop int3403_thermal sparse_keymap int340x_thermal_zone int3400_thermal acpi_thermal_rel acpi_pad acpi_tad vboxnetadp(OE) vboxnetflt(OE) binfmt_misc vboxdrv(OE) ip_tables rfkill i915 i2c_algo_bit hid_rmi drm_kms_helper rmi_core drm crc32c_intel nvme serio_raw nvme_core r8169 i2c_hid pinctrl_cannonlake video wmi pinctrl_intel fuse [last unloaded: ipmi_devintf]
[   15.486780] CPU: 7 PID: 1935 Comm: Xorg Tainted: P           OE     5.5.5-200.fc31.x86_64 #1
[   15.486781] Hardware name: LENOVO 81LK/LNVNB161216, BIOS BGCN24WW 08/19/2019
[   15.486783] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
[   15.486785] Code: 05 ee 0e 2e 01 01 e8 ab 95 bc ff 0f 0b c3 80 3d dc 0e 2e 01 00 75 95 48 c7 c7 70 8f 3c 85 c6 05 cc 0e 2e 01 01 e8 8c 95 bc ff <0f> 0b c3 80 3d bb 0e 2e 01 00 0f 85 72 ff ff ff 48 c7 c7 c8 8f 3c
[   15.486786] RSP: 0018:ffffb931c3f0bd80 EFLAGS: 00010282
[   15.486787] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
[   15.486789] RDX: 0000000000000007 RSI: 0000000000000092 RDI: ffff8fac665d9cc0
[   15.486790] RBP: ffff8fac3af80ce8 R08: 0000000000000446 R09: 0000000000000003
[   15.486791] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8fac5dba2ae8
[   15.486792] R13: ffff8fac5dba2800 R14: 0000000000000008 R15: 0000000000000000
[   15.486793] FS:  00007f5d68ae0f00(0000) GS:ffff8fac665c0000(0000) knlGS:0000000000000000
[   15.486795] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   15.486796] CR2: 000055590f53d6f8 CR3: 0000000237d6a004 CR4: 00000000003606e0
[   15.486797] Call Trace:
[   15.486804]  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
[   15.486809]  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
[   15.486829]  drm_drop_master+0x1e/0x30 [drm]
[   15.486846]  drm_master_release+0x9f/0xb0 [drm]
[   15.486863]  drm_file_free.part.0+0x21d/0x270 [drm]
[   15.486879]  drm_release+0xa7/0xe0 [drm]
[   15.486883]  __fput+0xc1/0x250
[   15.486887]  task_work_run+0x8a/0xb0
[   15.486891]  exit_to_usermode_loop+0x102/0x130
[   15.486894]  do_syscall_64+0x1a4/0x1c0
[   15.486897]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[   15.486899] RIP: 0033:0x7f5d6903f8e7
[   15.486901] Code: 64 89 02 48 c7 c0 ff ff ff ff eb bb 0f 1f 80 00 00 00 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 e3 fb ff ff
[   15.486902] RSP: 002b:00007ffd8f53b658 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
[   15.486904] RAX: 0000000000000000 RBX: 000055590f517340 RCX: 00007f5d6903f8e7
[   15.486905] RDX: 0000000000000000 RSI: 000055590f5174b0 RDI: 000000000000000c
[   15.486906] RBP: 000000000000000c R08: 0000000000000000 R09: 000055590f519df0
[   15.486907] R10: fffffffffffff206 R11: 0000000000000246 R12: 000055590f5174b0
[   15.486908] R13: 000055590f517380 R14: 0000000000000000 R15: 0000000000000000
[   15.486922] ---[ end trace 83c6332780b02ed3 ]---
uname -rm
	5.5.6-27.geca1eba-default x86_64

nvidia-settings -v
	nvidia-settings:  version 440.64
	  The NVIDIA X Server Settings tool.

dmesg
	...
	[   26.643102] ------------[ cut here ]------------
	[   26.643103] refcount_t: underflow; use-after-free.
	[   26.643116] WARNING: CPU: 3 PID: 3043 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
	[   26.643117] Modules linked in: nvidia_uvm(OE) ipmi_devintf iscsi_ibft iscsi_boot_sysfs vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) dmi_sysfs bluetooth ecdh_generic ecc cachefiles fscache squashfs loop sch_fq_codel edac_mce_amd kvm_amd kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel eeepc_wmi asus_wmi crypto_simd battery sparse_keymap cryptd rfkill glue_helper video wmi_bmof sp5100_tco pcspkr i2c_piix4 snd_hda_codec_realtek ccp i2c_dev acpi_cpufreq raid10 uas usb_storage md_mod hid_generic usbhid nvidia_drm(POE) nvidia_modeset(POE) tcp_bbr snd_usb_audio nvidia(POE) snd_usbmidi_lib mc snd_rawmidi snd_seq_device snd_hda_codec_via snd_hda_codec_hdmi snd_hda_codec_generic snd_hda_intel ipmi_msghandler ledtrig_audio snd_intel_dspcfg mxm_wmi sg snd_hda_codec drm_kms_helper nct6775 snd_hwdep hwmon_vid syscopyarea sysfillrect snd_hda_core sysimgblt msr fb_sys_fops crc32c_intel xhci_pci snd_pcm mpt3sas drm xhci_hcd snd_timer k10temp snd raid_class igb scsi_transport_sas
	[   26.643150]  soundcore r8169 sr_mod usbcore cdrom realtek dca libphy i2c_algo_bit wmi pinctrl_amd button sunrpc nbd dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua efivarfs
	[   26.643160] CPU: 3 PID: 3043 Comm: X Tainted: P           OE     5.5.6-27.geca1eba-default #1 openSUSE Tumbleweed (unreleased)
	[   26.643161] Hardware name: System manufacturer System Product Name/PRIME X570-PRO, BIOS 1405 11/19/2019
	[   26.643163] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
	[   26.643165] Code: 05 b2 55 05 01 01 e8 7b 04 bb ff 0f 0b c3 80 3d a0 55 05 01 00 75 95 48 c7 c7 48 95 12 b1 c6 05 90 55 05 01 01 e8 5c 04 bb ff <0f> 0b c3 80 3d 7f 55 05 01 00 0f 85 72 ff ff ff 48 c7 c7 a0 95 12
	[   26.643165] RSP: 0018:ffffa271c3b63d88 EFLAGS: 00010282
	[   26.643167] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
	[   26.643167] RDX: 0000000000000007 RSI: 0000000000000092 RDI: ffff92911eadbdd0
	[   26.643168] RBP: ffff929116ffcce8 R08: 00000000000006b5 R09: 0000000000000003
	[   26.643169] R10: 0000000000000000 R11: 0000000000000001 R12: ffff928247071ae8
	[   26.643169] R13: ffff928247071800 R14: 0000000000000004 R15: 0000000000000000
	[   26.643171] FS:  00007f808dbebec0(0000) GS:ffff92911eac0000(0000) knlGS:0000000000000000
	[   26.643172] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
	[   26.643172] CR2: 00007f8817ffd488 CR3: 0000000f98ae4000 CR4: 0000000000340ee0
	[   26.643173] Call Trace:
	[   26.643180]  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
	[   26.643183]  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
	[   26.643195]  drm_drop_master+0x1e/0x30 [drm]
	[   26.643206]  drm_master_release+0x9f/0xb0 [drm]
	[   26.643217]  drm_file_free.part.0+0x1fe/0x260 [drm]
	[   26.643227]  drm_release+0x9a/0xd0 [drm]
	[   26.643230]  __fput+0xc1/0x250
	[   26.643232]  task_work_run+0xa1/0xc0
	[   26.643235]  exit_to_usermode_loop+0x10c/0x130
	[   26.643238]  do_syscall_64+0x1fa/0x240
	[   26.643240]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
	[   26.643242] RIP: 0033:0x7f808b85bf24
	[   26.643243] Code: 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 80 00 00 00 00 8b 05 ca c4 20 00 48 63 ff 85 c0 75 13 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 44 f3 c3 66 90 48 83 ec 18 48 89 7c 24 08 e8
	[   26.643243] RSP: 002b:00007fff56bba8a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
	[   26.643245] RAX: 0000000000000000 RBX: 000000000000000a RCX: 00007f808b85bf24
	[   26.643245] RDX: 0000555717805020 RSI: 0000000000000002 RDI: 000000000000000a
	[   26.643246] RBP: 000055571784a600 R08: 0000555717862650 R09: 00007f808b5cb470
	[   26.643246] R10: 0000000000000079 R11: 0000000000000246 R12: 000055571784a4f0
	[   26.643247] R13: 000055571784a3c0 R14: 0000000000000000 R15: 0000000000000000
	[   26.643249] ---[ end trace 51017c11887d48a8 ]---

make you wonder: has anyone from Nvidia dev even looked at these reports? afaict, they’ve not commented. clearly, not addressed.

Happens for me too

[   14.718249] ------------[ cut here ]------------
[   14.718946] refcount_t: underflow; use-after-free.
[   14.719634] WARNING: CPU: 2 PID: 1184 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
[   14.720326] Modules linked in: ip6t_REJECT nf_reject_ipv6 ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bnep sunrpc vfat fat bcache crc64 intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm nvidia_drm(POE) nvidia_modeset(POE) irqbypass nvidia_uvm(OE) iTCO_wdt iTCO_vendor_support mei_hdcp mei_wdt crct10dif_pclmul crc32_pclmul ppdev raid1 ghash_clmulni_intel intel_cstate intel_uncore intel_rapl_perf pcspkr snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi btusb btrtl snd_hda_intel btbcm btintel snd_intel_dspcfg snd_hda_codec bluetooth lpc_ich nvidia(POE) i2c_i801 snd_hda_core ses enclosure snd_hwdep scsi_transport_sas snd_seq
[   14.720347]  ecdh_generic joydev snd_seq_device usblp ecc snd_pcm ipmi_devintf mei_me ipmi_msghandler mei snd_timer snd parport_pc parport soundcore tpm_infineon ip_tables rfkill hid_logitech_hidpp uas usb_storage i915 mxm_wmi crc32c_intel i2c_algo_bit drm_kms_helper drm e1000e wmi video hid_logitech_dj fuse
[   14.726334] CPU: 2 PID: 1184 Comm: Xorg Tainted: P           OE     5.5.6-201.fc31.x86_64 #1
[   14.727290] Hardware name: MSI MS-7830/CSM-Q87M-E43 (MS-7830), BIOS V10.3 05/30/2014
[   14.728249] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
[   14.729200] Code: 05 fe 09 2e 01 01 e8 bb 92 bc ff 0f 0b c3 80 3d ec 09 2e 01 00 75 95 48 c7 c7 08 95 3c b2 c6 05 dc 09 2e 01 01 e8 9c 92 bc ff <0f> 0b c3 80 3d cb 09 2e 01 00 0f 85 72 ff ff ff 48 c7 c7 60 95 3c
[   14.730215] RSP: 0018:ffffa90a83e7fd80 EFLAGS: 00010282
[   14.731230] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
[   14.732253] RDX: 0000000000000007 RSI: 0000000000000092 RDI: ffff96b30eb19cc0
[   14.733264] RBP: ffff96b2ef56ace8 R08: 00000000000004c6 R09: ffffa90a9091521c
[   14.734272] R10: 0000000000aaaaaa R11: 0000000000000000 R12: ffff96b3077112e8
[   14.735281] R13: ffff96b307711000 R14: 0000000000000008 R15: 0000000000000000
[   14.736295] FS:  00007f122d9f0f00(0000) GS:ffff96b30eb00000(0000) knlGS:0000000000000000
[   14.737322] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   14.738352] CR2: 00005622cacfe138 CR3: 00000008062ee002 CR4: 00000000001606e0
[   14.739389] Call Trace:
[   14.740432]  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
[   14.741488]  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
[   14.742553]  drm_drop_master+0x1e/0x30 [drm]
[   14.743610]  drm_master_release+0x9f/0xb0 [drm]
[   14.744670]  drm_file_free.part.0+0x21d/0x270 [drm]
[   14.745730]  drm_release+0xa7/0xe0 [drm]
[   14.746783]  __fput+0xc1/0x250
[   14.747835]  task_work_run+0x8a/0xb0
[   14.748885]  exit_to_usermode_loop+0x102/0x130
[   14.749931]  do_syscall_64+0x1a4/0x1c0
[   14.750973]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[   14.752020] RIP: 0033:0x7f122df4f8e7
[   14.753062] Code: 64 89 02 48 c7 c0 ff ff ff ff eb bb 0f 1f 80 00 00 00 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 e3 fb ff ff
[   14.754177] RSP: 002b:00007fff1bdb9a08 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
[   14.755301] RAX: 0000000000000000 RBX: 00005622cacfc5d0 RCX: 00007f122df4f8e7
[   14.756423] RDX: 0000000000000000 RSI: 00005622cacfc740 RDI: 000000000000000c
[   14.757541] RBP: 000000000000000c R08: 0000000000000000 R09: 00005622cacfd110
[   14.758666] R10: fffffffffffff206 R11: 0000000000000246 R12: 00005622cacfc740
[   14.759791] R13: 00005622cacfc610 R14: 0000000000000000 R15: 0000000000000000
[   14.760917] ---[ end trace d9e61cc6624f267d ]---

EDIT: driver version 440.59

@aplattner

can we get some sort of official comment from Nvidia on kernel 5.5.x fix/support?
are we just to write-off the entire release cycle, and hope/wait for 5.6.x?

We have the same crash on Fedora31 with latest drivers from Fusion repo
https://ask.fedoraproject.org/t/kernel-tainted-after-running-updates/5487/7

The crash was introduced with 5.5.5 upgrade and it’s still here on 5.5.6

[ 7.503213] CPU: 1 PID: 414 Comm: plymouthd Tainted: P OE 5.5.6-201.fc31.x86_64 #1
[ 7.503218] Hardware name: Micro-Star International Co., Ltd. PS42 Modern 8RC/MS-14B2, BIOS E14B2IMS.106 12/06/2018
[ 7.503230] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
[ 7.503239] Code: 05 fe 09 2e 01 01 e8 bb 92 bc ff 0f 0b c3 80 3d ec 09 2e 01 00 75 95 48 c7 c7 08 95 3c bb c6 05 dc 09 2e 01 01 e8 9c 92 bc ff <0f> 0b c3 80 3d cb 09 2e 01 00 0f 85 72 ff ff ff 48 c7 c7 60 95 3c
[ 7.503244] RSP: 0018:ffffb290407cbcb8 EFLAGS: 00010286
[ 7.503250] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
[ 7.503254] RDX: 0000000000000007 RSI: 0000000000000086 RDI: ffff97732ec59cc0
[ 7.503258] RBP: ffff9773150874e8 R08: 0000000000000382 R09: 0000000000000003
[ 7.503262] R10: 0000000000000000 R11: 0000000000000001 R12: ffff9773257282e8
[ 7.503265] R13: ffff977325728000 R14: 0000000000000000 R15: ffff977325750a00
[ 7.503272] FS: 00007fe4f52e9f00(0000) GS:ffff97732ec40000(0000) knlGS:0000000000000000
[ 7.503276] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 7.503280] CR2: 00007f7ed9315a30 CR3: 0000000464a1e001 CR4: 00000000003606e0
[ 7.503284] Call Trace:
[ 7.503313] nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
[ 7.503333] nv_drm_master_drop+0x22/0x60 [nvidia_drm]
[ 7.503396] drm_drop_master+0x1e/0x30 [drm]
[ 7.503452] drm_dropmaster_ioctl+0x4c/0x90 [drm]
[ 7.503506] ? drm_setmaster_ioctl+0xb0/0xb0 [drm]
[ 7.503565] drm_ioctl_kernel+0xaa/0xf0 [drm]
[ 7.503631] drm_ioctl+0x208/0x390 [drm]
[ 7.503686] ? drm_setmaster_ioctl+0xb0/0xb0 [drm]
[ 7.503701] ? do_filp_open+0xa5/0x100
[ 7.503718] do_vfs_ioctl+0x461/0x6d0
[ 7.503743] ksys_ioctl+0x5e/0x90
[ 7.503756] __x64_sys_ioctl+0x16/0x20
[ 7.503769] do_syscall_64+0x5b/0x1c0
[ 7.503785] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 7.503794] RIP: 0033:0x7fe4f55a738b
[ 7.503802] Code: 0f 1e fa 48 8b 05 fd 9a 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d cd 9a 0c 00 f7 d8 64 89 01 48
[ 7.503806] RSP: 002b:00007ffc4d2ede78 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 7.503813] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fe4f55a738b
[ 7.503817] RDX: 0000000000000000 RSI: 000000000000641f RDI: 000000000000000b
[ 7.503821] RBP: 000000000000641f R08: 0000555981b9bd50 R09: 00007fe4f56ba380
[ 7.503824] R10: 0000000000000000 R11: 0000000000000246 R12: 0000555981b9bd80
[ 7.503828] R13: 000000000000000b R14: 0000000000000000 R15: 0000000000000000
[ 7.503839] —[ end trace fe605e9abea0643f ]—

We can’t report bugs when a kernel is tainted so let’s hope Nvidia guys could fix it in the future.
The good news that this crash is happening only once during initial boot. Everything is working fine after the initial boot.

This very crash is related to plymouthd

~/# dnf list installed akmod-nvidia
Installed Packages
akmod-nvidia.x86_64 3:440.59-1.fc31 @rpmfusion-nonfree-updates

And we got crash in nv_drm_atomic_helper_disable_all function during plymouth init.

To be clear … do you see this^ (or similar) crash if plymouth is DISabled?

Sorry for confusing message, it was copy-pasted from another forum we were talking a little bit different things. Sure plymouth is not a reason. Without plymouth it is crashing with Xorg.
Call-trace is pretty much the same,

[    9.571753] CPU: 4 PID: 1183 Comm: Xorg Tainted: P           OE     5.5.6-201.fc31.x86_64 #1
[    9.571754] Hardware name: Micro-Star International Co., Ltd. PS42 Modern 8RC/MS-14B2, BIOS E14B2IMS.106 12/06/2018
[    9.571756] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
[    9.571758] Code: 05 fe 09 2e 01 01 e8 bb 92 bc ff 0f 0b c3 80 3d ec 09 2e 01 00 75 95 48 c7 c7 08 95 3c 8f c6 05 dc 09 2e 01 01 e8 9c 92 bc ff <0f> 0b c3 80 3d cb 09 2e 01 00 0f 85 72 ff ff ff 48 c7 c7 60 95 3c
[    9.571759] RSP: 0018:ffffaa120174bd80 EFLAGS: 00010282
[    9.571760] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
[    9.571761] RDX: 0000000000000007 RSI: 0000000000000092 RDI: ffff8d706ed19cc0
[    9.571762] RBP: ffff8d7066050ce8 R08: 00000000000003a8 R09: 0000000000000003
[    9.571763] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8d7066086ae8
[    9.571764] R13: ffff8d7066086800 R14: 0000000000000000 R15: dead000000000100
[    9.571765] FS:  00007fdea078bf00(0000) GS:ffff8d706ed00000(0000) knlGS:0000000000000000
[    9.571766] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    9.571767] CR2: 00005561797269e8 CR3: 0000000413ce6003 CR4: 00000000003606e0
[    9.571768] Call Trace:
[    9.571776]  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
[    9.571781]  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
[    9.571801]  drm_drop_master+0x1e/0x30 [drm]
[    9.571816]  drm_master_release+0x9f/0xb0 [drm]
[    9.571831]  drm_file_free.part.0+0x21d/0x270 [drm]
[    9.571847]  drm_release+0xa7/0xe0 [drm]
[    9.571851]  __fput+0xc1/0x250
[    9.571854]  task_work_run+0x8a/0xb0
[    9.571857]  exit_to_usermode_loop+0x102/0x130
[    9.571860]  do_syscall_64+0x1a4/0x1c0
[    9.571864]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[    9.571866] RIP: 0033:0x7fdea0cea8e7
[    9.571868] Code: 64 89 02 48 c7 c0 ff ff ff ff eb bb 0f 1f 80 00 00 00 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 e3 fb ff ff
[    9.571869] RSP: 002b:00007ffeced06898 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
[    9.571870] RAX: 0000000000000000 RBX: 00005561797134f0 RCX: 00007fdea0cea8e7
[    9.571871] RDX: 0000556179712ce0 RSI: 0000556179713660 RDI: 000000000000000c
[    9.571872] RBP: 000000000000000c R08: 0000000000000006 R09: 0000556179713ea0
[    9.571873] R10: 0000000000000000 R11: 0000000000000246 R12: 0000556179713660
[    9.571873] R13: 0000556179713530 R14: 0000000000000000 R15: 0000000000000000
[    9.571876] ---[ end trace 2c19b3c5d8948c7b ]---

Got a new kernel and lots of new nvidia stuff from rpmfusion this morning, but it apparently had nothing to do with this, still get the same call trace when X first starts. Here’s the latest stuff I’m now running:

Linux tomh 5.5.7-200.fc31.x86_64 #1 SMP Fri Feb 28 17:18:37 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

xorg-x11-drv-nvidia-libs-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-cuda-libs-440.64-1.fc31.i686
kmod-nvidia-5.5.5-200.fc31.x86_64-440.59-1.fc31.x86_64
kmod-nvidia-5.5.6-201.fc31.x86_64-440.64-1.fc31.x86_64
akmod-nvidia-440.64-1.fc31.x86_64
nvidia-settings-440.64-1.fc31.x86_64
kmod-nvidia-5.5.7-200.fc31.x86_64-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-kmodsrc-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-cuda-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-cuda-libs-440.64-1.fc31.x86_64
nvidia-persistenced-440.64-1.fc31.x86_64
xorg-x11-drv-nvidia-libs-440.64-1.fc31.i686

Edit: Right, the one user I had test a couple of changes, first claimed this had fixed it…and then, now, later said it was back… And now another claim that it did nothing, so I’m removing the changeset as it’s pretty conclusive that it doesn’t change anything. Sorry for the noise.

Bug in redhat tracker
https://bugzilla.redhat.com/show_bug.cgi?id=1806257

same here, but with Vulkan drivers 440.66.03 and kernel 5.5.10

└───╼  modinfo nvidia
filename:       /lib/modules/5.5.10-arch1-1/kernel/drivers/video/nvidia.ko.xz
alias:          char-major-195-*
version:        440.66.03
supported:      external
license:        NVIDIA
srcversion:     DC0048D50541FC60098682F
alias:          pci:v000010DEd*sv*sd*bc03sc02i00*
alias:          pci:v000010DEd*sv*sd*bc03sc00i00*
depends:        ipmi_msghandler
retpoline:      Y
name:           nvidia
vermagic:       5.5.10-arch1-1 SMP preempt mod_unload 

[   41.924407] ------------[ cut here ]------------
[   41.924409] refcount_t: underflow; use-after-free.
[   41.924426] WARNING: CPU: 25 PID: 1324 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
[   41.924427] Modules linked in: tun mousedev input_leds pktcdvd hid_generic intel_rapl_msr ucsi_ccg typec_ucsi nvidia_drm(POE) iTCO_wdt nvidia_modeset(POE) typec iTCO_vendor_support cmac algif_hash xt_MASQUERADE algif_skcipher eeepc_wmi iptable_nat usbhid asus_wmi af_alg nf_nat battery hid bnep wmi_bmof sparse_key
map mxm_wmi nvidia(POE) nf_conntrack nct7904 nf_defrag_ipv6 nf_defrag_ipv4 nct6775 libcrc32c msr hwmon_vid iptable_filter intel_rapl_common sb_edac x86_pkg_temp_thermal coretemp kvm_intel kvm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg crct10dif_pclmul crc32_pclmul snd_hd
a_codec ghash_clmulni_intel snd_hda_core nls_iso8859_1 snd_hwdep nls_cp437 snd_pcm aesni_intel vfat crypto_simd drm_kms_helper cryptd fat igb glue_helper snd_timer ipmi_devintf mei_me intel_cstate ipmi_msghandler snd intel_uncore syscopyarea sr_mod sysfillrect i2c_algo_bit sysimgblt intel_rapl_perf pcspkr cdrom i2c_
i801 sd_mod mei lpc_ich dca soundcore i2c_nvidia_gpu
[   41.924466]  fb_sys_fops btusb btrtl btbcm btintel bluetooth wmi acpi_power_meter ecdh_generic rfkill ecc evdev mac_hid vhba(OE) uinput fuse eeprom sg br_netfilter bridge drm stp llc crypto_user agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 ahci crc32c_intel libahci libata xhci_pci ehci_pci sc
si_mod xhci_hcd ehci_hcd vfio_pci irqbypass vfio_virqfd vfio_iommu_type1 vfio
[   41.924485] CPU: 25 PID: 1324 Comm: Xorg.wrap Tainted: P           OE     5.5.10-arch1-1 #1
[   41.924486] Hardware name: ASUSTeK COMPUTER INC. Z10PE-D8 WS/Z10PE-D8 WS, BIOS 3703 04/13/2018
[   41.924488] RIP: 0010:refcount_warn_saturate+0xa6/0xf0
[   41.924490] Code: 05 79 ec 09 01 01 e8 8b 65 c1 ff 0f 0b c3 80 3d 67 ec 09 01 00 75 95 48 c7 c7 78 3b 34 b6 c6 05 57 ec 09 01 01 e8 6c 65 c1 ff <0f> 0b c3 80 3d 46 ec 09 01 00 0f 85 72 ff ff ff 48 c7 c7 d0 3b 34
[   41.924491] RSP: 0018:ffffa66e0a44fd90 EFLAGS: 00010286
[   41.924492] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[   41.924493] RDX: 0000000000000001 RSI: 0000000000000002 RDI: 00000000ffffffff
[   41.924493] RBP: ffff8d3cdb4ddce8 R08: 00000009c2e3b7db R09: ffff8d44fff6f680
[   41.924494] R10: 00000000000006e2 R11: 00000000000593f4 R12: ffff8d44db8fc2e8
[   41.924495] R13: ffff8d44db8fc000 R14: 0000000000000008 R15: 0000000000000000
[   41.924496] FS:  00007f2db8523540(0000) GS:ffff8d3cdfb40000(0000) knlGS:0000000000000000
[   41.924497] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   41.924498] CR2: 00007f2db83ac490 CR3: 00000006f8732003 CR4: 00000000003606e0
[   41.924499] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   41.924499] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   41.924500] Call Trace:
[   41.924508]  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
[   41.924515]  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
[   41.924547]  drm_drop_master+0x1e/0x30 [drm]
[   41.924557]  drm_master_release+0x9f/0xb0 [drm]
[   41.924567]  drm_file_free.part.0+0x1fe/0x260 [drm]
[   41.924577]  drm_release+0x9a/0xd0 [drm]
[   41.924581]  __fput+0xae/0x230
[   41.924585]  task_work_run+0x93/0xb0
[   41.924589]  exit_to_usermode_loop+0xda/0x100
[   41.924592]  do_syscall_64+0x11f/0x150
[   41.924595]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[   41.924597] RIP: 0033:0x7f2db844bc37
[   41.924599] Code: ff ff e8 dc e4 01 00 66 2e 0f 1f 84 00 00 00 00 00 66 90 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 d3 4e f9 ff
[   41.924600] RSP: 002b:00007ffd3142dc78 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
[   41.924601] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 00007f2db844bc37
[   41.924602] RDX: 00007ffd3142dc90 RSI: 00000000c04064a0 RDI: 0000000000000003
[   41.924602] RBP: 00007ffd3142dce0 R08: 0000000000000000 R09: 00007ffd3142db00
[   41.924603] R10: 000055e2c659d64b R11: 0000000000000246 R12: 0000000000000003
[   41.924603] R13: 0000000000000001 R14: 0000000000000000 R15: 00007ffd3142dc90
[   41.924606] ---[ end trace d7c78baae500c2f3 ]---

greetings

Same here.
System: Zen2 TRX40, RTX 2080, NVIDIA 440.64, Ubuntu 20.04, Kernel 5.6.0.

Because of this useless bug, DRM is NOT working…
Are NVIDIA developers on holiday?

[    8.803373] Call Trace:
[    8.803379]  nv_drm_atomic_helper_disable_all+0xed/0x290 [nvidia_drm]
[    8.803380]  nv_drm_master_drop+0x28/0x60 [nvidia_drm]
[    8.803388]  drm_drop_master+0x22/0x30 [drm]
[    8.803393]  drm_dropmaster_ioctl+0x51/0x90 [drm]
[    8.803399]  ? drm_setmaster_ioctl+0xb0/0xb0 [drm]
[    8.803405]  drm_ioctl_kernel+0xae/0xf0 [drm]
[    8.803411]  drm_ioctl+0x234/0x3d0 [drm]
[    8.803417]  ? drm_setmaster_ioctl+0xb0/0xb0 [drm]
[    8.803419]  ? putname+0x4a/0x50
[    8.803421]  ? do_sys_openat2+0x1a9/0x2a0
[    8.803422]  ksys_ioctl+0x9d/0xd0
[    8.803423]  __x64_sys_ioctl+0x1a/0x20
[    8.803425]  do_syscall_64+0x57/0x1b0
[    8.803427]  entry_SYSCALL_64_after_hwframe+0x44/0xa9

That particular warning (the underflow one) shouldn’t, to my knowledge, have any effect of the DRM subsystem.
What’s the problem you’re facing, exactly? Missing HW acceleration; decoding; general crashes; or the like? If that the case, I would recommend opening a new topic for that.
You can, at least, test direct rendering by installing whatever package on your distribution got glxinfo, and running glxinfo|grep 'direct rendering'

Lastly, any “holiday” they might be having is probably just due that pesky “flu” that got everyone all riled up at the moment.