Nv driver bug?

[pnip@cgpu102 127.0.0.1-2021-07-21-04:42:59]$ sudo crash /usr/lib/debug/lib/modules/3.10.0-1160.15.2.el7.x86_64/vmlinux vmcore
crash 7.2.3-11.el7_9.1
Copyright (C) 2002-2017 Red Hat, Inc.
Copyright (C) 2004, 2005, 2006, 2010 IBM Corporation
Copyright (C) 1999-2006 Hewlett-Packard Co
Copyright (C) 2005, 2006, 2011, 2012 Fujitsu Limited
Copyright (C) 2006, 2007 VA Linux Systems Japan K.K.
Copyright (C) 2005, 2011 NEC Corporation
Copyright (C) 1999, 2002, 2007 Silicon Graphics, Inc.
Copyright (C) 1999, 2000, 2001, 2002 Mission Critical Linux, Inc.
This program is free software, covered by the GNU General Public License,
and you are welcome to change it and/or distribute copies of it under
certain conditions. Enter “help copying” to see the conditions.
This program has absolutely no warranty. Enter “help warranty” for details.
GNU gdb (GDB) 7.6
Copyright (C) 2013 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type “show copying”
and “show warranty” for details.
This GDB was configured as “x86_64-unknown-linux-gnu”…
WARNING: kernel relocated [366MB]: patching 87300 gdb minimal_symbol values
KERNEL: /usr/lib/debug/lib/modules/3.10.0-1160.15.2.el7.x86_64/vmlinux
DUMPFILE: vmcore [PARTIAL DUMP]
CPUS: 72
DATE: Wed Jul 21 04:40:51 2021
UPTIME: 6 days, 17:52:24
LOAD AVERAGE: 2.92, 3.26, 3.35
TASKS: 1949
NODENAME: xxxxxxxx
RELEASE: 3.10.0-1160.15.2.el7.x86_64
VERSION: #1 SMP Thu Jan 21 16:15:07 EST 2021
MACHINE: x86_64 (2600 Mhz)
MEMORY: 382.7 GB
PANIC: “BUG: unable to handle kernel NULL pointer dereference at 0000000000000548”
PID: 0
COMMAND: “swapper/19”
TASK: ffff8b4d3cfc5280 (1 of 72) [THREAD_INFO: ffff8b4d3d7c0000]
CPU: 19
STATE: TASK_RUNNING (PANIC)
crash> bt
PID: 0 TASK: ffff8b4d3cfc5280 CPU: 19 COMMAND: “swapper/19”
#0 [ffff8b7acd8439b0] machine_kexec at ffffffff97e662c4
#1 [ffff8b7acd843a10] __crash_kexec at ffffffff97f227a2
#2 [ffff8b7acd843ae0] crash_kexec at ffffffff97f22890
#3 [ffff8b7acd843af8] oops_end at ffffffff9858c798
#4 [ffff8b7acd843b20] no_context at ffffffff97e75d14
#5 [ffff8b7acd843b70] __bad_area_nosemaphore at ffffffff97e75fe2
#6 [ffff8b7acd843bc0] bad_area_nosemaphore at ffffffff97e76104
#7 [ffff8b7acd843bd0] __do_page_fault at ffffffff9858f750
#8 [ffff8b7acd843c40] do_page_fault at ffffffff9858f975
#9 [ffff8b7acd843c70] page_fault at ffffffff9858b778
[exception RIP: _nv030802rm+160]
RIP: ffffffffc78bb690 RSP: ffff8b7acd843d28 RFLAGS: 00010046
RAX: 0000000000000000 RBX: ffff8b76e9f84008 RCX: ffff8b748af6adb0
RDX: ffffffffc82fcc7d RSI: 000000000000004f RDI: ffff8b4b0093b000
RBP: ffff8b748af6ad80 R8: 0000000000000020 R9: ffff8b748af6aed8
R10: ffffffffc758bea0 R11: 0000000000000005 R12: ffffffffc82fcc7d
R13: ffff8b748af6ae48 R14: 0000000000000000 R15: 000000000000004f
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#10 [ffff8b7acd843d20] _nv030802rm at ffffffffc78bb67f [nvidia]
#11 [ffff8b7acd843d50] _nv029350rm at ffffffffc720fb8e [nvidia]
#12 [ffff8b7acd843d70] _nv030884rm at ffffffffc78c9b75 [nvidia]
#13 [ffff8b7acd843da0] _nv008372rm at ffffffffc71b8fff [nvidia]
#14 [ffff8b7acd843dd0] _nv034071rm at ffffffffc71b93a9 [nvidia]
#15 [ffff8b7acd843df0] _nv031020rm at ffffffffc72105a2 [nvidia]
#16 [ffff8b7acd843e20] rm_run_rc_callback at ffffffffc78c4824 [nvidia]
#17 [ffff8b7acd843e40] nvidia_rc_timer_callback at ffffffffc7166cac [nvidia]
#18 [ffff8b7acd843e58] nv_timer_callback_typed_data at ffffffffc716647d [nvidia]
#19 [ffff8b7acd843e68] call_timer_fn at ffffffff97eabcf8
#20 [ffff8b7acd843ea0] run_timer_softirq at ffffffff97eae38d
#21 [ffff8b7acd843f18] __do_softirq at ffffffff97ea4b35
#22 [ffff8b7acd843f88] call_softirq at ffffffff985984ec
#23 [ffff8b7acd843fa0] do_softirq at ffffffff97e2f715
#24 [ffff8b7acd843fc0] irq_exit at ffffffff97ea4eb5
#25 [ffff8b7acd843fd8] smp_apic_timer_interrupt at ffffffff98599a88
#26 [ffff8b7acd843ff0] apic_timer_interrupt at ffffffff98595fba
— —
#27 [ffff8b4d3d7c3db8] apic_timer_interrupt at ffffffff98595fba
[exception RIP: cpuidle_enter_state+87]
RIP: ffffffff983c74a7 RSP: ffff8b4d3d7c3e60 RFLAGS: 00000206
RAX: 000211ff9e9f6682 RBX: 00000000000159a0 RCX: 0000000000000018
RDX: 0000000225c17d03 RSI: ffff8b4d3d7c3fd8 RDI: 000211ff9e9f6682
RBP: ffff8b4d3d7c3e88 R8: 0000000000097d75 R9: 0000000000000018
R10: 000000000002439a R11: 7fffffffffffffff R12: 0000000000000001
R13: ffffffff97ec9f1d R14: ffff8b4d3d7c3e28 R15: 0000000000000087
ORIG_RAX: ffffffffffffff10 CS: 0010 SS: 0018
#28 [ffff8b4d3d7c3e90] cpuidle_idle_call at ffffffff983c75fe
#29 [ffff8b4d3d7c3ed0] arch_cpu_idle at ffffffff97e37c8e
#30 [ffff8b4d3d7c3ee0] cpu_startup_entry at ffffffff97f0142a
#31 [ffff8b4d3d7c3f28] start_secondary at ffffffff97e5a827
#32 [ffff8b4d3d7c3f50] start_cpu at ffffffff97e000d5
crash>
nvidia-bug-report.log.gz (4.7 MB)