Kernel panic has occurred when doing reboot stress test with Jetson Linux 35.1

Hi,

We flashed the Jetson Linux 35.1 image from sdk manager into ther Xavier NX devkit + P3668-0001 NX SOM (EMMC SKU). We used the NVP Model clock configuration tool for set the power mode to “20W-6 Core” and did the reboot stress test for testing system stabiblity, We found some kernel panic message in console log, that device was stucked and can’t boot into desktop environment. After few minutes, the device will auto reboot again… please help to check the logs and provide the siolution
jetpack502.log (1.2 MB)
.

[ 7.015734] **************************************
[ 7.015789] ------------[ cut here ]------------
[ 7.025012] kernel BUG at drivers/soc/tegra/cbb/tegra194-cbb.c:1896!
[ 7.031574] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP
[ 7.036827] Modules linked in:
[ 7.039806] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.10.104-tegra #1
[ 7.046446] Hardware name: Unknown NVIDIA Jetson Xavier NX Developer Kit/NVIDIA Jetson Xavier NX Developer Kit, BIOS 1.0-d7fb19b 08/10/2022
[ 7.058958] pstate: 60400089 (nZCv daIf +PAN -UAO -TCO BTYPE=–)
[ 7.065020] pc : tegra194_cbb_err_isr+0x190/0x1b0
[ 7.069460] lr : tegra194_cbb_err_isr+0x114/0x1b0
[ 7.074183] sp : ffff800010003de0
[ 7.077859] x29: ffff800010003de0 x28: 0000000000000001
[ 7.083282] x27: 0000000000000005 x26: ffffab6ea41c3f58
[ 7.088446] x25: ffffab6ea4b1ce10 x24: 0000000000000001
[ 7.093886] x23: ffffab6ea44b7000 x22: ffffab6ea493ea00
[ 7.099208] x21: 000000000000000f x20: ffff0013c0e66f80
[ 7.104980] x19: ffffab6ea493ea00 x18: 0000000000000010
[ 7.110147] x17: ffffab6ea426d008 x16: 00000000134c32e4
[ 7.115741] x15: ffffab6ea47b2bf0 x14: 0720072007200720
[ 7.121253] x13: 0720072007200720 x12: 0720072007200720
[ 7.126766] x11: 0720072007200720 x10: 0720072007200720
[ 7.132022] x9 : ffff800010003cf0 x8 : 2a2a2a2a2a2a2a2a
[ 7.137533] x7 : 2a2a2a2a2a2a2a2a x6 : 000000002a6af221
[ 7.143303] x5 : 000000000000000c x4 : 00000000fffff2bd
[ 7.148729] x3 : 00000000ffffffff x2 : ffffab6ea2c5e090
[ 7.154071] x1 : ffffab6ea47b2680 x0 : 0000000100010001
[ 7.159403] Call trace:
[ 7.161859] tegra194_cbb_err_isr+0x190/0x1b0
[ 7.166145] __handle_irq_event_percpu+0x68/0x2a0
[ 7.170693] handle_irq_event_percpu+0x40/0xa0
[ 7.174897] handle_irq_event+0x50/0xf0
[ 7.178660] handle_fasteoi_irq+0xc0/0x170
[ 7.182686] generic_handle_irq+0x40/0x60
[ 7.186984] __handle_domain_irq+0x70/0xd0
[ 7.191169] efi_header_end+0xb0/0xf0
[ 7.194410] el1_irq+0xd0/0x180
[ 7.197644] cpuidle_enter_state+0xb8/0x410
[ 7.201584] cpuidle_enter+0x40/0x60
[ 7.205085] call_cpuidle+0x44/0x80
[ 7.208586] do_idle+0x208/0x270
[ 7.212255] cpu_startup_entry+0x2c/0x70
[ 7.215761] rest_init+0xdc/0xe8
[ 7.219275] arch_call_rest_init+0x18/0x20
[ 7.223199] start_kernel+0x514/0x54c
[ 7.226788] Code: a9425bf5 a9446bf9 a8c77bfd d65f03c0 (d4210000)
[ 7.233000] —[ end trace b9b85043b81cca5a ]—
[ 7.237717] Kernel panic - not syncing: Oops - BUG: Fatal exception in interrupt
[ 7.245244] SMP: stopping secondary CPUs
[ 7.248859] Kernel Offset: 0x2b6e92aa0000 from 0xffff800010000000
[ 7.255217] PHYS_OFFSET: 0xffffffed40000000
[ 7.259334] CPU features: 0x8240002,03802a30
[ 7.263533] Memory Limit: none
[ 7.266691] —[ end Kernel panic - not syncing: Oops - BUG: Fatal exception in interrupt ]—
jetpack502.log (273.6 KB)

Thanks

Seen the same error randomly with the same configuration (NX devkit eMMC with 35.1)

Please help to open your own topic to follow up. Thanks

Hi,

The post Can’t boot from EMMC when doing reboot stress test with Jetson Linux 35.1 - Jetson & Embedded Systems / Jetson Xavier NX - NVIDIA Developer Forums figure out the root cause as following:

There is a problem in HDMI driver and will cause panic. We are still checking.
And there is a mechanism to put board into recovery boot in rel-35.1. If kernel panic too many times, then it will happen.

[ 6.988564] tegradc 15200000.display: Bootloader disp_param detected. Detected mode: 8x4 (on 0x0mm) pclk=148350937