Warming reset cause ccb errors

Hi

Jetson Xavier NX on JP5.02 sometime reported kernel panic during system warming reset. It seems to be tegra194-cbb.c bug as follows. when it happened, you have to power off and reboot, system can bring up normally.

[ 1.298793] Initramfs unpacking failed: invalid magic at start of compressed archive
[ 1.336377] tegra186-dpaux-pinctrl 155f0000.dpaux: can not get clock
▒▒rm_rail_debugfs_init: /rm/vdd_cpu: failed
rm_rail_debugfs_init: /rm/vdd_cpu: failed
debugfs initialized
▒▒[ 23.062809] Camera-FW on t194-rce-safe started
TCU early console enabled.
[ 23.131297] Camera-FW on t194-rce-safe ready SHA1=d48f1e27 (crt 0.780 ms, total boot 69.298 ms)
▒▒[ 2.460642] tegra-hsp b950000.tegra-hsp: Try increasing MBOX_TX_QUEUE_LEN
[ 2.460992] tegra-hsp b950000.tegra-hsp: Try increasing MBOX_TX_QUEUE_LEN
[ 2.461194] tegra-hsp b950000.tegra-hsp: Try increasing MBOX_TX_QUEUE_LEN
[ 2.461498] tegra-hsp b950000.tegra-hsp: Try increasing MBOX_TX_QUEUE_LEN
[ 2.461643] tegra-hsp b950000.tegra-hsp: Try increasing MBOX_TX_QUEUE_LEN
[ 4.335948] tegradc 15200000.display: hdmi: can’t get adpater for ddc bus 3
[ 4.454512] CPU:0, Error:cbb-noc@0x2300000,irq=15
[ 4.454659] **************************************
[ 4.454788] CPU:0, Error:cbb-noc
[ 4.454877] Error Logger : 0
[ 4.454962] ErrLog0 : 0x80030000
[ 4.455053] Transaction Type : RD - Read, Incrementing
[ 4.455185] Error Code : SLV
[ 4.455269] Error Source : Target
[ 4.455358] Error Description : Target error detected by CBB slave
[ 4.455526] AXI2APB_5 bridge error: RDFIFOF - Read Response FIFO Full interrupt
[ 4.455713] Packet header Lock : 0
[ 4.455803] Packet header Len1 : 3
[ 4.455909] NOC protocol version : version >= 2.7
[ 4.456028] ErrLog1 : 0x352424
[ 4.456129] ErrLog2 : 0x0
[ 4.456202] RouteId : 0x352424
[ 4.456288] InitFlow : ccroc_p2ps/I/ccroc_p2ps
[ 4.456423] Targflow : host1x_p2pm/T/host1x_p2pm
[ 4.456543] TargSubRange : 18
[ 4.456634] SeqId : 0
[ 4.456709] ErrLog3 : 0x4045c
[ 4.456790] ErrLog4 : 0x0
[ 4.456996] Address accessed : 0x15b4045c
[ 4.457507] ErrLog5 : 0x909f851
[ 4.461004] Non-Modify : 0x1
[ 4.464416] AXI ID : 0x12
[ 4.467310] Master ID : CCPLEX
[ 4.470635] Security Group(GRPSEC): 0x7e
[ 4.475095] Cache : 0x1 – Bufferable
[ 4.479038] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 4.485860] FALCONSEC : 0x0
[ 4.489009] Virtual Queuing Channel(VQC): 0x0
[ 4.493648] **************************************
[ 4.498668] kernel BUG at drivers/soc/tegra/cbb/tegra194-cbb.c:1896!
[ 4.505025] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP
[ 4.510447] Modules linked in:
[ 4.513686] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.10.104-tegra #1
[ 4.520329] Hardware name: Unknown NVIDIA Jetson Xavier NX Developer Kit/NVIDIA Jetson Xavier NX Developer Kit, BIOS r35.0-8f953e493 09/02/2022
[ 4.533107] pstate: 60400089 (nZCv daIf +PAN -UAO -TCO BTYPE=–)
[ 4.538719] pc : tegra194_cbb_err_isr+0x188/0x1a0
[ 4.543694] lr : tegra194_cbb_err_isr+0x10c/0x1a0
[ 4.548416] sp : ffff800010003de0
[ 4.551572] x29: ffff800010003de0 x28: 0000000000000001
[ 4.557084] x27: 0000000000000005 x26: ffffbf0235b12290
[ 4.562509] x25: ffffbf023645ce10 x24: 0000000000000001
[ 4.568191] x23: ffffbf0235df7000 x22: ffffbf023627ea00
[ 4.573270] x21: 000000000000000f x20: ffff49dc40e67880
[ 4.579042] x19: ffffbf023627ea00 x18: 0000000000000010
[ 4.584468] x17: 0000000000006270 x16: 0000000000004350
[ 4.589547] x15: ffffbf02360f2bf0 x14: 0720072007200720
[ 4.595317] x13: 0720072007200720 x12: 0720072007200720
[ 4.600652] x11: 0720072007200720 x10: 0720072007200720
[ 4.605908] x9 : 0720072007200720 x8 : 07200720072a072a
[ 4.611678] x7 : 072a072a072a072a x6 : c0000000ffffefff
[ 4.616933] x5 : 0000000000057fa8 x4 : ffffbf0236107968
[ 4.622618] x3 : 00000000ffffffff x2 : ffffbf02345f7380
[ 4.627697] x1 : ffffbf02360f2680 x0 : 0000000100010001
[ 4.633291] Call trace:
[ 4.635489] tegra194_cbb_err_isr+0x188/0x1a0
[ 4.640033] __handle_irq_event_percpu+0x60/0x2a0
[ 4.644582] handle_irq_event_percpu+0x3c/0xa0
[ 4.649043] handle_irq_event+0x4c/0xf0
[ 4.652806] handle_fasteoi_irq+0xbc/0x170
[ 4.656833] generic_handle_irq+0x3c/0x60
[ 4.660634] __handle_domain_irq+0x6c/0xc0
[ 4.664800] efi_header_end+0xa8/0xf0
[ 4.668301] el1_irq+0xd0/0x180
[ 4.671292] cpuidle_enter_state+0xb4/0x400
[ 4.675731] cpuidle_enter+0x3c/0x50
[ 4.679232] call_cpuidle+0x40/0x70
[ 4.682748] do_idle+0x1fc/0x260
[ 4.686159] cpu_startup_entry+0x2c/0x70
[ 4.689909] rest_init+0xd8/0xe4
[ 4.693162] arch_call_rest_init+0x14/0x1c
[ 4.697087] start_kernel+0x50c/0x540
[ 4.700691] Code: a9425bf5 a9446bf9 a8c77bfd d65f03c0 (d4210000)
[ 4.706886] —[ end trace 71baef3ab592dcd1 ]—
[ 4.711615] Kernel panic - not syncing: Oops - BUG: Fatal exception in interrupt
[ 4.719131] SMP: stopping secondary CPUs
[ 4.722746] Kernel Offset: 0x3f0224440000 from 0xffff800010000000
[ 4.729105] PHYS_OFFSET: 0xffffb624c0000000
[ 4.733219] CPU features: 0x8240002,03802a30
[ 4.737677] Memory Limit: none
[ 4.740576] —[ end Kernel panic - not syncing: Oops - BUG: Fatal exception in interrupt ]—

It’s known issue suppose will fix by next release.

Thanks

Hi Shane

Could you please share which release will fix such issue? JP5.1, right? Thanks.

Suppose yes.

Thanks