565.57 builds fine but crashes at boot.
[ 5.424961] ------------[ cut here ]------------
[ 5.424967] WARNING: CPU: 4 PID: 1041 at drivers/gpu/drm/drm_file.c:312 drm_open_helper+0x132/0x150
[ 5.424971] Modules linked in: vfat fat hid_logitech_hidpp(+) nvidia_drm(OE+) iwldvm nvidia_modeset(OE) joydev mac80211 snd_sof_pci_intel_tgl snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_intel_hda_mlink snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp libarc4 snd_sof snd_sof_utils snd_soc_acpi_intel_match soundwire_generic_allocation iwlwifi snd_soc_acpi nvidia(OE) soundwire_bus cfg80211 snd_soc_avs snd_hda_codec_realtek snd_soc_hda_codec snd_hda_codec_generic snd_hda_ext_core snd_hda_scodec_component intel_rapl_msr snd_soc_core intel_rapl_common eeepc_wmi snd_hda_codec_hdmi asus_wmi sparse_keymap intel_uncore_frequency platform_profile intel_uncore_frequency_common snd_compress intel_tcc_cooling ac97_bus x86_pkg_temp_thermal intel_powerclamp snd_pcm_dmaengine coretemp snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi kvm_intel snd_hda_codec snd_hda_core snd_hwdep kvm snd_seq spi_nor mei_hdcp mei_pxp iTCO_wdt rapl snd_seq_device
[ 5.424997] intel_cstate intel_pmc_bxt mtd spd5118 iTCO_vendor_support snd_pcm mei_me intel_uncore nzxt_kraken3 snd_timer uas spi_intel_pci wmi_bmof igc snd drm_ttm_helper mei usb_storage spi_intel rfkill i2c_i801 soundcore ttm i2c_smbus idma64 hid_logitech_dj intel_pmc_core intel_vsec pmt_telemetry pmt_class acpi_pad acpi_tad fuse loop nfnetlink zram crct10dif_pclmul crc32_pclmul nvme crc32c_intel polyval_clmulni polyval_generic ghash_clmulni_intel nvme_core sha512_ssse3 sha256_ssse3 sha1_ssse3 nvme_auth vmd video wmi pinctrl_alderlake pkcs8_key_parser i2c_dev
[ 5.425018] CPU: 4 UID: 0 PID: 1041 Comm: ddcutil Tainted: G OE ------- — 6.12.0-0.rc4.39.fc42.x86_64 #1
[ 5.425020] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[ 5.425021] Hardware name: ASUS System Product Name/PRIME Z690-A, BIOS 3802 08/08/2024
[ 5.425022] RIP: 0010:drm_open_helper+0x132/0x150
[ 5.425023] Code: 54 67 00 31 c0 48 83 c4 08 5b 5d 41 5c 41 5d 41 5e 41 5f c3 cc cc cc cc 48 89 df 89 44 24 04 e8 d4 f9 ff ff 8b 44 24 04 eb db <0f> 0b b8 ea ff ff ff eb d2 b8 ea ff ff ff eb cb b8 f0 ff ff ff eb
[ 5.425024] RSP: 0018:ffffa73240c377e0 EFLAGS: 00010246
[ 5.425025] RAX: ffffffffc1a0f380 RBX: ffff8ec1ac5f96e8 RCX: ffffffff9eb351f0
[ 5.425026] RDX: 0000000000000000 RSI: ffff8ec1ac5f96e8 RDI: ffff8ec19b5af500
[ 5.425027] RBP: ffff8ec19b5af500 R08: 0000000000035f68 R09: 0000000000000014
[ 5.425027] R10: 0000000000000000 R11: ffff8ec8df0217c0 R12: ffff8ec186449800
[ 5.425028] R13: ffffffffc1a0f380 R14: 00000000ffffffed R15: ffff8ec18335b8d0
[ 5.425029] FS: 00007f7fc1c6fb80(0000) GS:ffff8ec8df000000(0000) knlGS:0000000000000000
[ 5.425029] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5.425030] CR2: 00007f98c13510b0 CR3: 000000011b5bc000 CR4: 0000000000f52ef0
[ 5.425031] PKRU: 55555554
[ 5.425031] Call Trace:
[ 5.425032]
[ 5.425033] ? drm_open_helper+0x132/0x150
[ 5.425034] ? __warn.cold+0x93/0xfa
[ 5.425037] ? drm_open_helper+0x132/0x150
[ 5.425040] ? report_bug+0xff/0x140
[ 5.425041] ? handle_bug+0x58/0x90
[ 5.425043] ? exc_invalid_op+0x17/0x70
[ 5.425043] ? asm_exc_invalid_op+0x1a/0x20
[ 5.425047] ? drm_open_helper+0x132/0x150
[ 5.425048] drm_open+0x73/0x110
[ 5.425049] drm_stub_open+0x9b/0xd0
[ 5.425051] chrdev_open+0xb2/0x230
[ 5.425053] ? __pfx_chrdev_open+0x10/0x10
[ 5.425054] do_dentry_open+0x14c/0x4a0
[ 5.425056] vfs_open+0x2e/0xe0
[ 5.425057] path_openat+0x82e/0x12e0
[ 5.425059] ? drm_fb_memcpy+0x142/0x270
[ 5.425061] do_filp_open+0xc4/0x170
[ 5.425063] do_sys_openat2+0xae/0xe0
[ 5.425064] __x64_sys_openat+0x55/0xa0
[ 5.425066] do_syscall_64+0x82/0x160
[ 5.425068] ? drm_atomic_state_default_clear+0x1c0/0x2e0
[ 5.425070] ? drm_modeset_drop_locks+0x52/0x70
[ 5.425071] ? drm_client_modeset_commit_atomic+0x1c3/0x250
[ 5.425072] ? __memcg_slab_free_hook+0x11a/0x170
[ 5.425074] ? drm_fb_helper_lastclose+0x49/0x80
[ 5.425075] ? __x64_sys_close+0x3c/0x80
[ 5.425076] ? kmem_cache_free+0x3fa/0x450
[ 5.425077] ? syscall_exit_to_user_mode_prepare+0x149/0x170
[ 5.425079] ? syscall_exit_to_user_mode+0x10/0x210
[ 5.425081] ? do_syscall_64+0x8e/0x160
[ 5.425082] ? __memcg_slab_free_hook+0x11a/0x170
[ 5.425083] ? __x64_sys_close+0x3c/0x80
[ 5.425084] ? kmem_cache_free+0x3fa/0x450
[ 5.425085] ? syscall_exit_to_user_mode_prepare+0x149/0x170
[ 5.425086] ? syscall_exit_to_user_mode+0x10/0x210
[ 5.425087] ? do_syscall_64+0x8e/0x160
[ 5.425089] ? syscall_exit_to_user_mode_prepare+0x149/0x170
[ 5.425090] ? syscall_exit_to_user_mode+0x10/0x210
[ 5.425091] ? do_syscall_64+0x8e/0x160
[ 5.425092] ? exc_page_fault+0x7e/0x180
[ 5.425094] entry_SYSCALL_64_after_hwframe+0x76/0x7e
[ 5.425095] RIP: 0033:0x7f7fc1d8c2d6
[ 5.425105] Code: 89 df e8 8d c1 00 00 8b 93 08 03 00 00 59 5e 48 83 f8 fc 75 15 83 e2 39 83 fa 08 75 0d e8 32 ff ff ff 66 90 48 8b 45 10 0f 05 <48> 8b 5d f8 c9 c3 0f 1f 40 00 f3 0f 1e fa 55 48 89 e5 48 83 ec 08
[ 5.425105] RSP: 002b:00007ffe35bfb1c0 EFLAGS: 00000202 ORIG_RAX: 0000000000000101
[ 5.425106] RAX: ffffffffffffffda RBX: 00007f7fc1c6fb80 RCX: 00007f7fc1d8c2d6
[ 5.425107] RDX: 0000000000000002 RSI: 0000558f96fd7ee0 RDI: ffffffffffffff9c
[ 5.425108] RBP: 00007ffe35bfb1d0 R08: 0000000000000000 R09: 0000000000000000
[ 5.425108] R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000000
[ 5.425109] R13: 0000558f96fd7980 R14: 0000558f96cac4a8 R15: 0000000000000010
[ 5.425110]
[ 5.425110] —[ end trace 0000000000000000 ]—
Hi, please see the following
opened 05:09PM - 30 Sep 24 UTC
bug
NV-Triaged
### NVIDIA Open GPU Kernel Modules Version
ed4be649623435ebb04f5e93f859bf46d977… daa4
### Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.
- [ ] I confirm that this does not happen with the proprietary driver package.
### Operating System and Version
CachyOS (ArchLinux)
### Kernel Release
6.12.0rc1
### Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.
- [ ] I am running on a stable kernel release.
### Hardware: GPU
GPU 0: NVIDIA GeForce RTX 4070 SUPER (UUID: GPU-8c5baf85-cb1f-fe26-95d5-ff3fd51249bb)
### Describe the bug
Since the 6.12.0rc1 Release the kernel drm-helper is crashing with the 560.35.03 drivers.
Following patches were pulled in, to make the driver compatible with 6.12, these were extracted out of the 550.120 release:
drm_fbdev fixup for 6.11+: https://github.com/CachyOS/kernel-patches/blob/master/6.12/misc/nvidia/0004-6.11-Add-fix-for-fbdev.patch
drm_outpull_pill for 6.12: https://github.com/CachyOS/kernel-patches/blob/master/6.12/misc/nvidia/0005-6.12-drm_outpull_pill-changed-check.patch
Additional patch to make the module compilation happy (Introduced in commit https://github.com/torvalds/linux/commit/32f51ead3d7771cdec29f75e08d50a76d2c6253d ):
```diff
diff --git a/kernel-open/nvidia-uvm/uvm_hmm.c b/kernel-open/nvidia-uvm/uvm_hmm.c
index 93e64424..dc64184e 100644
--- a/kernel-open/nvidia-uvm/uvm_hmm.c
+++ b/kernel-open/nvidia-uvm/uvm_hmm.c
@@ -2694,7 +2694,7 @@ static NV_STATUS dmamap_src_sysmem_pages(uvm_va_block_t *va_block,
continue;
}
- if (PageSwapCache(src_page)) {
+ if (folio_test_swapcache(page_folio(src_page))) {
// TODO: Bug 4050579: Remove this when swap cached pages can be
// migrated.
status = NV_WARN_MISMATCHED_TARGET;
```
with these patches the DKMS Compilation is successful and the driver works fine with the 6.11.x kernel.
Booting into 6.12.0rc1 results into that the driver crashes, at drm_open_helper and there is graphical interface available anymore. The tty is working fine.
Following is visible in the dmesg log:
```
[ 5.090174] Console: switching to colour frame buffer device 240x67
[ 5.090176] nvidia 0000:01:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device
[ 5.096243] ------------[ cut here ]------------
[ 5.096244] WARNING: CPU: 0 PID: 453 at drivers/gpu/drm/drm_file.c:312 drm_open_helper+0x135/0x150
[ 5.096249] Modules linked in: nvidia_uvm(OE) nvidia_drm(OE) drm_ttm_helper btrfs ttm blake2b_generic nvidia_modeset(OE) libcrc32c crc32c_generic xor hid_generic raid6_pq nvme nvme_core crc32c_intel video sha256_ssse3 usbhid nvme_auth wmi nvidia(OE)
[ 5.096255] CPU: 0 UID: 0 PID: 453 Comm: plymouthd Tainted: G OE 6.12.0-rc1-1-cachyos-rc #1 12df37afa12b373ced2670803975698fbda2ce5d
[ 5.096257] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[ 5.096257] Hardware name: ASRock X670E Pro RS/X670E Pro RS, BIOS 3.08 09/18/2024
[ 5.096258] RIP: 0010:drm_open_helper+0x135/0x150
[ 5.096259] Code: 5d 41 5c c3 cc cc cc cc 48 89 df e8 c5 82 fe ff 85 c0 0f 84 7a ff ff ff 48 89 df 89 44 24 0c e8 c1 f9 ff ff 8b 44 24 0c eb d1 <0f> 0b b8 ea ff ff ff eb c8 b8 ea ff ff ff eb c1 b8 f0 ff ff ff eb
[ 5.096260] RSP: 0018:ffffa643409ffb20 EFLAGS: 00010246
[ 5.096261] RAX: ffffffffc15df380 RBX: ffff89f744740f28 RCX: 0000000000000000
[ 5.096262] RDX: ffff89f755ee0000 RSI: ffff89f744740f28 RDI: ffff89f74df1cd80
[ 5.096262] RBP: ffff89f74df1cd80 R08: 0000000000000006 R09: ffff89f740213cd0
[ 5.096263] R10: 00000000000000e2 R11: 0000000000000002 R12: ffff89f75735a000
[ 5.096263] R13: ffffffffc15df380 R14: 00000000ffffffed R15: ffffa643409ffe1c
[ 5.096264] FS: 00007f6b595ce480(0000) GS:ffff8a065ce00000(0000) knlGS:0000000000000000
[ 5.096264] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5.096265] CR2: 000055da04c46558 CR3: 000000010d18c000 CR4: 0000000000f50ef0
[ 5.096265] PKRU: 55555554
[ 5.096266] Call Trace:
[ 5.096267] <TASK>
[ 5.096267] ? drm_open_helper+0x135/0x150
[ 5.096268] ? __warn.cold+0xad/0x116
[ 5.096270] ? drm_open_helper+0x135/0x150
[ 5.096272] ? report_bug+0x127/0x170
[ 5.096273] ? handle_bug+0x58/0x90
[ 5.096275] ? exc_invalid_op+0x1b/0x80
[ 5.096276] ? asm_exc_invalid_op+0x1a/0x20
[ 5.096279] ? drm_open_helper+0x135/0x150
[ 5.096279] drm_open+0x81/0x110
[ 5.096280] drm_stub_open+0xaf/0x100
[ 5.096282] chrdev_open+0xc5/0x260
[ 5.096285] ? __pfx_chrdev_open+0x10/0x10
[ 5.096286] do_dentry_open+0x14b/0x490
[ 5.096287] vfs_open+0x30/0xe0
[ 5.096289] path_openat+0x84d/0x1320
[ 5.096290] ? __alloc_pages_noprof+0x183/0x350
[ 5.096292] do_filp_open+0xd2/0x180
[ 5.096293] do_sys_openat2+0xca/0x100
[ 5.096294] __x64_sys_openat+0x55/0xa0
[ 5.096295] do_syscall_64+0x82/0x190
[ 5.096296] ? handle_mm_fault+0x1d9/0x2e0
[ 5.096297] ? do_user_addr_fault+0x38d/0x6c0
[ 5.096299] entry_SYSCALL_64_after_hwframe+0x76/0x7e
[ 5.096300] RIP: 0033:0x7f6b59899ae5
[ 5.096301] Code: 75 53 89 f0 f7 d0 a9 00 00 41 00 74 48 80 3d d1 b5 0d 00 00 74 6c 45 89 e2 89 da 48 89 ee bf 9c ff ff ff b8 01 01 00 00 0f 05 <48> 3d 00 f0 ff ff 0f 87 8f 00 00 00 48 8b 54 24 28 64 48 2b 14 25
[ 5.096302] RSP: 002b:00007fffbdc08760 EFLAGS: 00000202 ORIG_RAX: 0000000000000101
[ 5.096303] RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007f6b59899ae5
[ 5.096303] RDX: 0000000000000002 RSI: 000055da04c42a40 RDI: 00000000ffffff9c
[ 5.096303] RBP: 000055da04c42a40 R08: 0000000000000000 R09: 0000000000000007
[ 5.096304] R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000000
[ 5.096304] R13: 00007f6b599a1a50 R14: 000000000000000b R15: 000055da04c43e30
[ 5.096305] </TASK>
[ 5.096305] ---[ end trace 0000000000000000 ]---
[ 5.173332] systemd-journald[355]: Received SIGTERM from PID 1 (systemd).
```
### To Reproduce
1. Compile 6.12.0.rc1 Kernel
2. Apply above mentioned patches on 560.35.03
3. Compile the Module and boot into
### Bug Incidence
Always
### nvidia-bug-report.log.gz
[nvidia-bug-report.log.gz](https://github.com/user-attachments/files/17193814/nvidia-bug-report.log.gz)
### More Info
_No response_
We are aware of the issue and have made an internal fix for it to support linux kernel 6.12 when it releases in the next beta driver release.
We do not support release candidate kernels with our releases.
1 Like
system
Closed
November 12, 2024, 12:38am
3
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.