Can I automatically rebooting when startup fails?

Jetson AGX XAVIER
L4T 32.7.2

Hi.
Let me ask a question about rebooting when startup fails.

If I have “A/B Redundancy” enabled and one slot fails to start, is there any way to have it automatically restart and start in the other slot?

When a startup fails using this forum procedure, the following log is displayed and operation stops.

[0019.302] panic (caller 0xa0601238): die
[0019.306] HALT: spinning forever...

We have verified that after about 7 minutes the logs start flowing again and after 7 times it starts in recovery mode,
but we would like to have it automatically repeat the reboot upon startup failure and start from the other slot.

I don’t think it will be “start in recovery mode”. Are you sure you are using AGX Xavier devkit?

Yes.
When the log is restarted 7 times, it will start in recovery mode.
When rebooting by unplugging and plugging in the power cord 7 times, it starts from the other slot.

Hi,

Do you mean you only boot slot A and it didn’t go into slot B but directly went into recovery mode?

Yes.
If I wanted to start in slot B, I had to reboot manually.

Sorry that I don’t quite understand. Do you mean it can still boot from slot B but it will always go to recovery mode automatically?

Or something else?

Sorry for the explanation that is difficult to convey.

Let me explain using the Update Engine state machine flow.

When in S5 state, if the Retry_count reaches 0, it is said to start in slot B.
However, there was no automatic restart, and it would not start in slot B unless it was manually restarted.

So I would like to know if there is a way to do an automatic restart when in S5 state.

Could you share the error log in S5 state which causes you not able to switch to slot B automatically?

The log is shown below.

[ 5318.214108] watchdog: watchdog0: watchdog did not stop!
[ 5318.221469] systemd-shutdow: 38 output lines suppressed due to ratelimiting
[ 5319.286795] reboot: Restarting system
ÿäÿâShutdown state requested 1
Rebooting system ...
ÿâ
[0000.054] W> RATCHET: MB1 binary ratchet value 4 is too large than ratchet level 2 from HW fuses.
[0000.062] I> MB1 (prd-version: 1.5.1.9-t194-41334769-73a9b7ef)
[0000.067] I> Boot-mode: Coldboot
[0000.070] I> Chip revision : A02 
[0000.073] I> Bootrom patch version : 15 (correctly patched)
[0000.079] I> ATE fuse revision : 0x200
[0000.082] I> Ram repair fuse : 0x0
[0000.085] I> Ram Code : 0x2
[0000.088] I> rst_source : 0xb
[0000.090] I> rst_level : 0x1
[0000.094] I> Boot-device: eMMC
[0000.109] I> sdmmc DDR50 mode
[0000.113] I> Active Boot chain : 0
[0000.116] I> Boot-device: eMMC
[0000.120] W> MB1_PLATFORM_CONFIG: device prod data is empty in MB1 BCT.
[0000.126] I> Temperature = 42500
[0000.129] W> Skipping boost for clk: BPMP_CPU_NIC
[0000.133] W> Skipping boost for clk: BPMP_APB
[0000.137] W> Skipping boost for clk: AXI_CBB
[0000.141] W> Skipping boost for clk: AON_CPU_NIC
[0000.146] W> Skipping boost for clk: CAN1
[0000.149] W> Skipping boost for clk: CAN2
[0000.154] I> Boot-device: eMMC
[0000.156] I> Boot-device: eMMC
[0000.166] I> Sdmmc: HS400 mode enabled
[0000.170] I> ECC region[0]: Start:0x0, End:0x0
[0000.175] I> ECC region[1]: Start:0x0, End:0x0
[0000.179] I> ECC region[2]: Start:0x0, End:0x0
[0000.183] I> ECC region[3]: Start:0x0, End:0x0
[0000.187] I> ECC region[4]: Start:0x0, End:0x0
[0000.191] I> Non-ECC region[0]: Start:0x80000000, End:0x100000000
[0000.197] I> Non-ECC region[1]: Start:0x0, End:0x0
[0000.201] I> Non-ECC region[2]: Start:0x0, End:0x0
[0000.206] I> Non-ECC region[3]: Start:0x0, End:0x0
[0000.210] I> Non-ECC region[4]: Start:0x0, End:0x0
[0000.216] E> FAILED: Thermal config
[0000.223] E> FAILED: MEMIO rail config
[0000.241] I> Boot-device: eMMC
[0000.250] I> sdmmc bdev is already initialized
[0000.325] I> MB1 done

ÿýÿàmain enter
SPE VERSION #: R01.00.14 Created: Sep 19 2018 @ 11:03:21
HW Function test
Start Scheduler.
in late init
ÿâ
[0000.333] I> Welcome to MB2(TBoot-BPMP) (version: 00.00.2018.32-mobile-6fc80c72)
[0000.334] I> DMA Heap @ [0x526fa000 - 0x52ffa000]
[0000.335] I> Default Heap @ [0xd486400 - 0xd48a400]
[0000.335] E> DEVICE_PROD: Invalid value data = 70020000, size = 0.
[0000.341] W> device prod register failed
[0000.345] I> gpio framework initialized
[0000.349] I> tegrabl_gpio_driver_register: register 'nvidia,tegra194-gpio' driver
[0000.356] I> tegrabl_gpio_driver_register: register 'nvidia,tegra194-gpio-aon' driver
[0000.364] I> No valid sdcard_params in mb1_bct
[0000.368] I> Boot-device: eMMC
[0000.371] I> Boot_device: SDMMC_BOOT instance: 3
[0000.380] I> sdmmc-3 params source = boot args
[0000.381] I> sdmmc bdev is already initialized
[0000.384] I> sdmmc-3 params source = boot args
[0000.418] I> Found 17 partitions in SDMMC_BOOT (instance 3)
[0000.434] I> Found 43 partitions in SDMMC_USER (instance 3)
[0000.435] I> Active Boot chain : 0
[0000.441] I> parsing oem signed section of bpmp-fw header done
[0000.448] I> bpmp-fw binary init read from storage
[0000.449] I> oem authentication of bpmp-fw header done
[0000.451] I> bpmp-fw binary done read from storage
[0000.452] I> bpmp-fw: Authentication init Done
[0000.459] I> parsing oem signed section of cpubl header done
[0000.465] I> cpubl binary init read from storage
[0000.466] I> bpmp-fw: Authentication Finalize Done
[0000.467] I> oem authentication of cpubl header done
[0000.467] I> cpubl binary done read from storage
[0000.468] I> cpubl: Authentication init Done
[0000.476] I> parsing oem signed section of rce header done
[0000.483] I> rce binary init read from storage
[0000.483] I> Relocating BR-BCT
[0000.484] I> cpubl: Authentication Finalize Done
[0000.487] I> oem authentication of rce header done
[0000.491] I> rce binary done read from storage
[0000.496] I> rce: Authentication init Done
[0000.506] I> parsing oem signed section of ape header done
[0000.512] I> ape binary init read from storage
[0000.513] I> rce: Authentication Finalize Done
[0000.514] I> oem authentication of ape header done
[0000.518] I> ape binary done read from storage
[0000.522] I> ape: Authentication init Done
[0000.533] I> parsing oem signed section of tos header done
[0000.539] I> tos binary init read from storage
[0000.540] I> ape: Authentication Finalize Done
[0000.541] I> oem authentication of tos header done
[0000.545] I> tos binary done read from storage
[0000.549] I> tos: Authentication init Done
[0000.559] I> parsing oem signed section of bpmp-fw-dtb header done
[0000.566] I> bpmp-fw-dtb binary init read from storage
[0000.567] I> tos: Authentication Finalize Done
[0000.570] I> oem authentication of bpmp-fw-dtb header done
[0000.577] I> bpmp-fw-dtb binary done read from storage
[0000.579] I> bpmp-fw-dtb: Authentication init Done
[0000.590] I> parsing oem signed section of cpubl-dtb header done
[0000.596] I> cpubl-dtb binary init read from storage
[0000.597] I> bpmp-fw-dtb: Authentication Finalize Done
[0000.632] I> oem authentication of cpubl-dtb header done
[0000.633] I> cpubl-dtb binary done read from storage
[0000.633] I> cpubl-dtb: Authentication init Done
[0000.640] I> parsing oem signed section of eks header done
[0000.647] I> eks binary init read from storage
[0000.647] I> cpubl-dtb: Authentication Finalize Done
[0000.648] I> oem authentication of eks header done
[0000.648] I> eks binary done read from storage
[0000.649] I> eks: Authentication init Done
[0000.649] I> eks: Authentication Finalize Done
[0000.653] I> EKB detected (length: 0x410) @ VA:0x52700400
ÿäNOTICE:  BL31: v1.3(release):b5eeb33
NOTICE:  BL31: Built : 02:21:00, Apr 17 2022
ipc-unittest-main: 1519: Welcome to IPC unittest!!!
ipc-unittest-main: 1531: waiting forever
ipc-unittest-srv: 329: Init unittest services!!!
hwkey-agent: 41: hwkey-agent is running!!
hwkey-agent: 347: key_mgnt_processing .......
hwkey-agent: 255: Setting EKB key 0 to slot 14
hwkey-agent: 178: Init hweky-agent services!!
luks-srv: 40: luks-srv is running!!
luks-srv: 157: Init luks-srv IPC services!!
platform_bootstrap_epilog: trusty bootstrap complete
ÿâ

welcome to lk
calling constructors
initializing heap
creating bootstrap completion thread
top of bootstrap2()
initializing platform
bpmp: platform_init
tag is e73a758761f0c6d24a1e69a2ac6b5035
tag_show initialized
dt initialized
mail initialized
chipid initialized
fuse initialized
sku initialized
speedo initialized
ec_get_ec_list: found 45 ecs
ec initialized
ec_mrq initialized
vmon_populate_monitors: found 3 monitors
vmon initialized
adc initialized
fmon_populate_monitors: found 73 monitors
fmon initialized
fmon_mrq initialized
reset initialized
nvhs initialized
392 clocks registered
WARNING: pll_c4 has no dyn ramp
clk_mrq_init: mrq handler registered
clk initialized
nvlink initialized
io_dpd initialized
io_dpd initialized
thermal initialized
i2c5 controller initialized
initialized i2c mrq handling
i2c initialized
regulator initialized
avfs_clk_platform initialized
soctherm initialized
aotag initialized
powergate initialized
dvs initialized
pm initialized
pg_late initialized
strap initialized
tag initialized
emc initialized
clk_dt initialized
avfs_ccplex_platform initialized
tj_max: dt node not found
tj_init initialized
uphy_mrq_init: mrq handler registered
uphy_dt initialized
uphy initialized
safereg_init: period 80 ms
ec_late initialized
mrq initialized
ÿá
[0001.158] I> Welcome to Cboot
ÿâfmon_post initialized
ÿá[0001.158] I> Cboot Version: t194-38d30025
[0001.159] I> CPU-BL Params @ 0xf2820000
[0001.159] I>  0) Base:0x00000000 Size:0x00000000
[0001.162] I>  1) Base:0xf1100000 Size:0x00100000
[0001.167] I>  2) Base:0xf2000000 Size:0x00200000
[0001.171] I>  3) Base:0xf1200000 Size:0x00200000
[0001.176] I>  4) Base:0xf1000000 Size:0x00100000
[0001.180] I>  5) Base:0xf0f00000 Size:0x00100000
[0001.185] I>  6) Base:0xf3800000 Size:0x00400000
ÿâclk_set_parent failed for clk i2c2, parent pll_aon (-22)
clk_set_parent failed for clk i2c8, parent pll_aon (-22)
clk_dt_late initialized
machine_check initialized
pm_post initialized
dbells initialized
avfs_clk_platform_post initialized
dmce initialized
cvc initialized
ccplex_avfs_hw_init: nafll_cluster0: not monitored
ccplex_avfs_hw_init: nafll_cluster1: not monitored
ccplex_avfs_hw_init: nafll_cluster2: not monitored
ccplex_avfs_hw_init: nafll_cluster3: not monitored
avfs_clk_mach_post initialized
regulator_post initialized
rm initialized
sc7_diag initialized
thermal_test initialized
serial_late initialized
clk_post initialized
clk_dt_post initialized
mc_reg initialized
pg_post initialized
dyn_modules initialized
sku_debugfs initialized
speedo_debugfs initialized
adc_debugfs initialized
clk_debugfs initialized
ÿá[0001.189] I>  7) Base:0xf1c00000 Size:0x00400000
[0001.269] I>  8) Base:0xf0e00000 Size:0x00100000
[0001.273] I>  9) Base:0xf0d00000 Size:0x00100000
[0001.277] I> 10) Base:0xf3000000 Size:0x00800000
[0001.282] I> 11) Base:0x40000000 Size:0x00040000
[0001.286] I> 12) Base:0xf0c00000 Size:0x00100000
[0001.291] I> 13) Base:0x40046000 Size:0x00002000
[0001.295] I> 14) Base:0x40048000 Size:0x00002000
[0001.300] I> 15) Base:0xac000000 Size:0x00004000
[0001.304] I> 16) Base:0x4004a000 Size:0x00002000
[0001.309] I> 17) Base:0xf0b00000 Size:0x00100000
ÿâemc_debugfs initialized
dvs_debugfs initialized
fmon_debugfs initialized
vmon_debugfs initialized
pg_debugfs initialized
profile_fs initialized
debugfs_cons initialized
mail_fs initialized
profile initialized
cvc_debugfs initialized
dmce_debugfs initialized
ec_debugfs initialized
rm_debugfs initialized
soctherm_debug initialized
gr_reader initialized
mods initialized
dt_fs initialized
debugfs_mrq initialized
debug_mrq initialized
debug_safereg initialized
initializing target
calling apps_init()
starting app shell
entering main console loop
] ÿá[0001.313] I> 18) Base:0x4004c000 Size:0x00002000
[0001.368] I> 19) Base:0xf2200000 Size:0x00600000
[0001.372] I> 20) Base:0x4004e000 Size:0x00002000
[0001.377] I> 21) Base:0xf0ad0000 Size:0x0000c000
[0001.381] I> 22) Base:0x00000000 Size:0x00000000
[0001.386] I> 23) Base:0xf0ae0000 Size:0x00020000
[0001.390] I> 24) Base:0xf6000000 Size:0x02000000
[0001.395] I> 25) Base:0x40050000 Size:0x00002000
[0001.399] I> 26) Base:0x40040000 Size:0x00006000
[0001.404] I> 27) Base:0xf1800000 Size:0x00400000
[0001.408] I> 28) Base:0xf4c00000 Size:0x01400000
[0001.413] I> 29) Base:0xf1400000 Size:0x00400000
[0001.417] I> 30) Base:0x00000000 Size:0x00000000
[0001.422] I> 31) Base:0x00000000 Size:0x00000000
[0001.426] I> 32) Base:0xf8000000 Size:0x08000000
[0001.430] I> 33) Base:0x00000000 Size:0x00000000
[0001.435] I> 34) Base:0xf3c00000 Size:0x01000000
[0001.439] I> 35) Base:0xab000000 Size:0x01000000
[0001.444] I> 36) Base:0xa0000000 Size:0x0b000000
[0001.448] I> 37) Base:0xf2800000 Size:0x00800000
[0001.453] I> 38) Base:0x80000000 Size:0x20000000
[0001.457] I> 39) Base:0xb0000000 Size:0x08000000
[0001.462] I> 40) Base:0x00000000 Size:0x00000000
[0001.466] I> 41) Base:0x00000000 Size:0x00000000
[0001.471] I> 42) Base:0x00000000 Size:0x00000000
[0001.475] I> 43) Base:0x00000000 Size:0x00000000
[0001.480] I> 44) Base:0x00000000 Size:0x00000000
[0001.484] I> 45) Base:0x00000000 Size:0x00000000
[0001.488] GIC-SPI Target CPU: 0
[0001.491] Interrupts Init done
[0001.494] calling constructors
[0001.497] initializing heap
[0001.500] I> Heap: [0xa069ab80 ... 0xab000000]
[0001.504] initializing threads
[0001.507] initializing timers
[0001.510] creating bootstrap completion thread
[0001.514] top of bootstrap2()
[0001.517] CPU: MIDR: 0x4E0F0040, MPIDR: 0x80000000
[0001.522] initializing platform
[0001.525] E> DEVICE_PROD: Invalid value data = 0, size = 0.
[0001.530] W> device prod register failed
[0001.534] I> Bl_dtb @0xaaf00000
[0001.540] W> "plugin-manager" doesn't exist, creating
[0001.542] W> "ids" doesn't exist, creating
[0001.546] W> "connection" doesn't exist, creating
[0001.550] W> "configs" doesn't exist, creating
[0001.565] E> failed to read label property for node 227404: 13
[0001.568] E> failed to read reg property for node 227492: 13
[0001.570] E> failed to read label property for node 227544: 13
[0001.573] E> failed to read label property for node 227612: 13
[0001.579] E> failed to read label property for node 227648: 13
[0001.584] E> failed to read label property for node 227684: 13
[0001.589] E> failed to read reg property for node 227752: 13
[0001.595] E> failed to read label property for node 227804: 13
[0001.601] I> Find /i2c@3160000's alias i2c0
[0001.604] I> Reading eeprom i2c=0 address=0x50
[0001.633] I> Device at /i2c@3160000:0x50
[0001.633] I> Reading eeprom i2c=0 address=0x56
[0001.658] I> Device at /i2c@3160000:0x56
[0001.659] I> Find /i2c@3180000's alias i2c2
[0001.659] I> Reading eeprom i2c=2 address=0x54
[0001.661] E> I2C: slave not found in slaves.
[0001.661] E> I2C: Could not write 0 bytes to slave: 0x00a8 with repeat start true.
[0001.662] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.663] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa8 at 0x00000000 via instance 2.
[0001.672] E> eeprom: Failed to read I2C slave device
[0001.677] I> Eeprom read failed 0x3526070d
[0001.681] I> Reading eeprom i2c=2 address=0x57
[0001.685] E> I2C: slave not found in slaves.
[0001.689] E> I2C: Could not write 0 bytes to slave: 0x00ae with repeat start true.
[0001.697] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.702] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xae at 0x00000000 via instance 2.
[0001.712] E> eeprom: Failed to read I2C slave device
[0001.717] I> Eeprom read failed 0x3526070d
[0001.721] I> Reading eeprom i2c=2 address=0x52
[0001.725] E> I2C: slave not found in slaves.
[0001.729] E> I2C: Could not write 0 bytes to slave: 0x00a4 with repeat start true.
[0001.737] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.743] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa4 at 0x00000000 via instance 2.
[0001.752] E> eeprom: Failed to read I2C slave device
[0001.757] I> Eeprom read failed 0x3526070d
[0001.762] I> Find /i2c@c240000's alias i2c1
[0001.765] I> Reading eeprom i2c=1 address=0x52
[0001.771] E> I2C: slave not found in slaves.
[0001.773] E> I2C: Could not write 0 bytes to slave: 0x00a4 with repeat start true.
[0001.781] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.787] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa4 at 0x00000000 via instance 1.
[0001.796] E> eeprom: Retry to read I2C slave device.
[0001.801] E> I2C: slave not found in slaves.
[0001.805] E> I2C: Could not write 0 bytes to slave: 0x00a4 with repeat start true.
[0001.813] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.819] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa4 at 0x00000000 via instance 1.
[0001.828] E> eeprom: Failed to read I2C slave device
[0001.833] I> Eeprom read failed 0x3526070d
[0001.837] I> Reading eeprom i2c=1 address=0x50
[0001.841] E> I2C: slave not found in slaves.
[0001.845] E> I2C: Could not write 0 bytes to slave: 0x00a0 with repeat start true.
[0001.853] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.859] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa0 at 0x00000000 via instance 1.
[0001.868] E> eeprom: Retry to read I2C slave device.
[0001.873] E> I2C: slave not found in slaves.
[0001.877] E> I2C: Could not write 0 bytes to slave: 0x00a0 with repeat start true.
[0001.885] E> I2C_DEV: Failed to send register address 0x00000000.
[0001.890] E> I2C_DEV: Could not read 256 registers of size 1 from slave 0xa0 at 0x00000000 via instance 1.
[0001.900] E> eeprom: Failed to read I2C slave device
[0001.905] I> Eeprom read failed 0x3526070d
[0001.909] I> create_pm_ids: id: 2888-0004-400-K, len: 15
[0001.914] I> config: mem-type:00,power-config:00,misc-config:00,modem-config:00,touch-config:00,display-config:00,, len: 93
[0001.925] I> create_pm_ids: id: 2822-0000-600-G, len: 15
[0001.930] I> config: mem-type:00,power-config:00,misc-config:00,modem-config:00,touch-config:00,display-config:00,, len: 93
[0001.941] I> Adding plugin-manager/ids/2888-0004-400=/i2c@3160000:module@0x50
[0001.949] W> "i2c@3160000" doesn't exist, creating
[0001.953] W> "module@0x50" doesn't exist, creating
[0001.958] I> Adding plugin-manager/ids/2822-0000-600=/i2c@3160000:module@0x56
[0001.965] W> "module@0x56" doesn't exist, creating
[0001.970] I> Adding plugin-manager/cvm
[0001.973] W> "chip-id" doesn't exist, creating
[0001.977] I> Adding plugin-manager/chip-id/A02
[0001.981] I> Plugin-manager override starting
[0001.986] I> node /plugin-manager/fragement-tegra-wdt-en matches
[0001.994] I> node /plugin-manager/fragement-soft-wdt matches
[0002.002] I> node /plugin-manager/fragment-pcie-c5-rp matches
[0002.006] I> node /plugin-manager/fragment-tegra-ufs-lane10 matches
[0002.018] I> Disable plugin-manager status in FDT
[0002.019] I> Plugin-manager override finished successfully
[0002.019] I> gpio framework initialized
[0002.023] I> tegrabl_gpio_driver_register: register 'nvidia,tegra194-gpio' driver
[0002.031] I> tegrabl_gpio_driver_register: register 'nvidia,tegra194-gpio-aon' driver
[0002.037] I> tegrabl_tca9539_init: i2c bus: 1, slave addr: 0x46
[0002.045] W> fetch_driver_phandle_from_dt: failed to get node with compatible ti,tca9539
[0002.052] W> fetch_driver_phandle_from_dt: failed to get node with compatible nxp,tca9539
[0002.059] W> tegrabl_tca9539_init: failed to fetch phandle from dt
[0002.065] I> tegrabl_tca9539_init: i2c bus: 1, slave addr: 0x44
[0002.072] W> fetch_driver_phandle_from_dt: failed to get node with compatible ti,tca9539
[0002.080] W> fetch_driver_phandle_from_dt: failed to get node with compatible nxp,tca9539
[0002.087] W> tegrabl_tca9539_init: failed to fetch phandle from dt
[0002.094] I> fixed regulator driver initialized
[0002.106] I> register 'maxim' power off handle
[0002.107] I> virtual i2c enabled
[0002.107] I> registered 'maxim,max20024' pmic
[0002.109] I> tegrabl_gpio_driver_register: register 'max20024-gpio' driver
[0002.115] I> Boot-device: eMMC
[0002.118] I> Boot_device: SDMMC_BOOT instance: 3
[0002.127] I> sdmmc-3 params source = boot args
[0002.127] I> create_pm_ids: id: 2888-0004-400-K, len: 15
[0002.132] I> config: mem-type:00,power-config:00,misc-config:00,modem-config:00,touch-config:00,display-config:00,, len: 93
[0002.143] I> create_pm_ids: id: 2822-0000-600-G, len: 15
[0002.148] I> config: mem-type:00,power-config:00,misc-config:00,modem-config:00,touch-config:00,display-config:00,, len: 93
[0002.159] I> sdmmc bdev is already initialized
[0002.164] I> sdmmc-3 params source = boot args
[0002.194] I> Found 17 partitions in SDMMC_BOOT (instance 3)
[0002.207] I> Found 43 partitions in SDMMC_USER (instance 3)
[0002.218] I> enabling 'vdd-hdmi-5v0' regulator
[0002.224] I> regulator 'vdd-hdmi-5v0' already enabled
[0002.224] I> hdmi cable connected
[0002.227] W> set volts not configured for 'vdd-1v0'
[0002.229] W> set volts not configured for 'vdd-1v8-hs'
[0002.230] I> retrieved tmds range from prod_list_hdmi_soc
[0002.235] E> invalid display type
[0002.239] E> invalid display type
[0002.240] E> cannot find any other nvdisp nodes
[0002.256] I> edid read success
[0002.268] I> edid read success
[0002.268] I> width = 640, height = 480, frequency = 25174825
[0002.269] I> width = 1920, height = 1080, frequency = 148500000
[0002.269] I> width = 1280, height = 720, frequency = 74250000
[0002.270] I> width = 640, height = 480, frequency = 25174825
[0002.270] I> width = 720, height = 480, frequency = 26973026
[0002.274] I> width = 720, height = 480, frequency = 26973026
[0002.279] I> width = 1280, height = 720, frequency = 74175824
[0002.285] I> width = 1920, height = 1080, frequency = 148351648
[0002.291] I> Best mode Width = 1920, Height = 1080, freq = 148351648
[0002.301] I> hdmi_enable, starting HDMI initialisation
[0002.306] I> hdmi_enable, HDMI initialisation complete
[0002.316] I> Load in CBoot Boot Options partition and parse it
[0002.323] E> Error -9 when finding node with path /boot-configuration
[0002.323] E> tegrabl_cbo_parse_info: "boot-configuration" not found in CBO file.
[0002.326] I> Using default boot order
[0002.330] I> boot-dev-order :-
[0002.333] I> 1.sd
[0002.334] I> 2.usb
[0002.336] I> 3.nvme
[0002.338] I> 4.emmc
[0002.340] I> 5.net
[0002.342] I> Hit any key to stop autoboot:	4	3	2	1
[0004.350] initializing target
[0004.350] calling apps_init()
[0004.351] starting app kernel_boot_app
[0004.370] I> found decompressor handler: lz4-legacy
[0004.371] I> decompressing BMP blob ...
[0004.382] I> Kernel type = Normal
[0004.383] I> ########## SD (0) boot ##########
[0004.383] I> No sdcard
[0004.383] I> sdcard-0 params source = DT-BL
[0004.384] E> Blockdev open: exit error
[0004.384] E> SD boot failed, err: 724238353
[0004.384] I> ########## USB (0) boot ##########
[0004.395] I> Validate XUSB-FW ...
[0004.396] I> T19x: Authenticate XUSB-FW (bin_type: 11), max size 0x28000
[0004.397] I> Encryption fuse is not ON
[0004.398] I> USB Firmware Version: 60.09 release
[0004.456] I> regulator of usb2-0 already enabled
[0004.465] I> regulator of usb2-1 already enabled
[0004.474] I> regulator of usb2-2 already enabled
[0004.485] I> enabling 'vdd-5v-sata' regulator
[0005.554] I> USB 2.0 port 4 new low-speed USB device detected
[0005.556] W> WARNING: event and command not matching, cmd_trb_ptr = 0xa4ad3000, cmd_ring.dma = 0xa4ad30c0
[0005.657] I> Start to enumerate device
[0005.658] W> handle_command_completion_event: WARNING: Command was not successfully completed (0x11)
[0008.666] W> xusbh_wait_irq: Timed out! status = 0x00
[0011.674] W> xusbh_wait_irq: Timed out! status = 0x00
[0011.675] E> Failed to enumerate USB device
[0011.675] E> failed to start xhci controller
[0011.675] E> Error in init of XUSB host driver, err: 7979000c
[0011.676] W> Failed to initialize device 5-0
[0011.676] E> USB boot failed, err: 2037973004
[0011.676] I> ########## NVME (0) boot ##########
[0011.681] I> Initializing nvme device instance 0
[0011.685] I> Initializing nvme controller
[0011.689] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14180000
[0011.695] I> vpcie3v3-supply not found
[0011.699] I> vpcie12v-supply not found
[0011.703] W> Failed to get nvidia,plat-gpios
[0011.707] I> tegrabl_pcie_soc_preinit: (0):
[0011.711] I> Unpowergate
[0011.713] I> tegrabl_car_clk_disable(0) ...
[0011.717] I> tegrabl_car_rst_set(CORE, 0) ...
[0011.721] I> tegrabl_car_rst_set(APB, 0) ...
[0011.725] I> tegrabl_car_clk_enable(0) ...
[0011.729] I> tegrabl_car_rst_clear(APB, 0) ...
[0011.734] I> tegrabl_set_ctrl_state(0)
[0011.737] I> CLR PCIE_APB:6
[0011.740] I> tegrabl_pcie_soc_init: (0):
[0011.744] I> APPL initialization ...
[0011.747] I> poweron phys
[0011.750] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14180000
[0011.756] I> tegrabl_power_on_phy: power on phy @0x3e30000
[0011.761] I> tegrabl_power_on_phy: power on phy @0x3e40000
[0011.767] I> tegrabl_power_on_phy: power on phy @0x3e50000
[0011.772] I> tegrabl_power_on_phy: power on phy @0x3e60000
[0012.877] C> Failed to link up controller-0
[0012.878] I> PCIe (0) Link is not UP
[0012.878] W> Failed tegrabl_pcie_soc_init(), error=0x12
[0012.879] I> Failed to initialize SoC Host PCIe controller
[0012.879] E> tegrabl_nvme_init: Failed tegrabl_pcie_init(0); error=0x12
[0012.880] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800612
[0012.883] W> Failed to open NVME-0, err = 80800612
[0012.888] W> Failed to initialize device 10-0
[0012.892] E> NVME (0) boot failed, err: 0x80800612
[0012.897] I> ########## NVME (1) boot ##########
[0012.901] I> Initializing nvme device instance 1
[0012.905] I> Initializing nvme controller
[0012.910] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14100000
[0012.916] I> vpcie3v3-supply not found
[0012.919] I> vpcie12v-supply not found
[0012.923] W> Failed to get nvidia,plat-gpios
[0012.927] I> tegrabl_pcie_soc_preinit: (1):
[0012.931] I> Unpowergate
[0012.934] I> tegrabl_car_clk_disable(1) ...
[0012.937] I> tegrabl_car_rst_set(CORE, 1) ...
[0012.942] I> tegrabl_car_rst_set(APB, 1) ...
[0012.946] I> tegrabl_car_clk_enable(1) ...
[0012.950] I> tegrabl_car_rst_clear(APB, 1) ...
[0012.954] I> tegrabl_set_ctrl_state(1)
[0012.958] I> CLR PCIE_APB:6
[0012.960] I> tegrabl_pcie_soc_init: (1):
[0012.964] I> APPL initialization ...
[0012.967] I> poweron phys
[0012.970] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14100000
[0012.976] I> tegrabl_power_on_phy: power on phy @0x3e10000
[0013.083] I> PCIe controller-1 link is up
[0013.083] I> tegra_pcie_info[1].cfg0_base = 0x30000000
[0013.084] I> tegra_pcie_info[1].cfg1_base = 0x30020000
[0013.084] I> tegra_pcie_info[1].atu_dma_base = 0x30040000
[0013.085] I> tegra_pcie_bus[1].mem = 0x30200000
[0013.085] I> Scanning busnr: 0 devfn: 0
[0013.085] I> PCIe IDs: 0x1ad210de
[0013.088] I> PCIe RID_CC: 0x60400a1
[0013.092] I> Scanning busnr: 1 devfn: 0
[0013.095] I> PCIe IDs: 0x91711b4b
[0013.099] I> PCIe RID_CC: 0x1060113
[0013.102] I> PCI Config: I/O=0x30100000, Memory=0x30200000
[0013.107] I> IO bar_num=0 bar=0x30100000
[0013.111] I> IO bar_num=1 bar=0x30100008
[0013.115] I> IO bar_num=2 bar=0x30100010
[0013.119] I> IO bar_num=3 bar=0x30100018
[0013.122] I> IO bar_num=4 bar=0x30100020
[0013.126] I> MEM bar_num=5 bar=0x30200000
[0013.130] I> Number of PCIe devices detected: 2
[0013.134] E> tegrabl_nvme_init: Failed tegrabl_pcie_get_dev(1); error=0x0
[0013.141] I> PCIe (1) link is UP
[0013.154] E> Link didn't transition to L2 state
[0013.185] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800612
[0013.185] W> Failed to open NVME-1, err = 80800612
[0013.186] W> Failed to initialize device 10-1
[0013.186] E> NVME (1) boot failed, err: 0x80800612
[0013.186] I> ########## NVME (2) boot ##########
[0013.187] I> Initializing nvme device instance 2
[0013.189] I> Initializing nvme controller
[0013.193] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14120000
[0013.199] I> vpcie3v3-supply not found
[0013.202] I> vpcie12v-supply not found
[0013.206] W> Failed to get nvidia,plat-gpios
[0013.210] I> tegrabl_pcie_soc_preinit: (2):
[0013.214] I> Unpowergate
[0013.217] I> tegrabl_car_clk_disable(2) ...
[0013.221] I> tegrabl_car_rst_set(CORE, 2) ...
[0013.225] I> tegrabl_car_rst_set(APB, 2) ...
[0013.229] I> tegrabl_car_clk_enable(2) ...
[0013.233] I> tegrabl_car_rst_clear(APB, 2) ...
[0013.237] I> tegrabl_set_ctrl_state(2)
[0013.241] I> CLR PCIE_APB:6
[0013.243] I> tegrabl_pcie_soc_init: (2):
[0013.247] I> APPL initialization ...
[0013.250] I> poweron phys
[0013.253] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14120000
[0013.259] I> tegrabl_power_on_phy: controller 2 not available
[0013.265] E> Failed to power on phy on controller-2
[0013.270] I> PCIe (2) Link is not UP
[0013.273] W> Failed tegrabl_pcie_soc_init(), error=0x1
[0013.278] I> Failed to initialize SoC Host PCIe controller
[0013.283] E> tegrabl_nvme_init: Failed tegrabl_pcie_init(2); error=0x1
[0013.290] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800601
[0013.296] W> Failed to open NVME-2, err = 80800601
[0013.301] W> Failed to initialize device 10-2
[0013.305] E> NVME (2) boot failed, err: 0x80800601
[0013.310] I> ########## NVME (3) boot ##########
[0013.314] I> Initializing nvme device instance 3
[0013.319] I> Initializing nvme controller
[0013.323] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14140000
[0013.329] I> vpcie3v3-supply not found
[0013.332] I> vpcie12v-supply not found
[0013.336] W> Failed to get nvidia,plat-gpios
[0013.340] I> tegrabl_pcie_soc_preinit: (3):
[0013.344] I> Unpowergate
[0013.347] I> tegrabl_car_clk_disable(3) ...
[0013.351] I> tegrabl_car_rst_set(CORE, 3) ...
[0013.355] I> tegrabl_car_rst_set(APB, 3) ...
[0013.359] I> tegrabl_car_clk_enable(3) ...
[0013.363] I> tegrabl_car_rst_clear(APB, 3) ...
[0013.367] I> tegrabl_set_ctrl_state(3)
[0013.371] I> CLR PCIE_APB:6
[0013.373] I> tegrabl_pcie_soc_init: (3):
[0013.377] I> APPL initialization ...
[0013.380] I> poweron phys
[0013.383] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14140000
[0013.389] I> tegrabl_power_on_phy: power on phy @0x3e80000
[0014.495] C> Failed to link up controller-3
[0014.495] I> PCIe (3) Link is not UP
[0014.496] W> Failed tegrabl_pcie_soc_init(), error=0x12
[0014.496] I> Failed to initialize SoC Host PCIe controller
[0014.497] E> tegrabl_nvme_init: Failed tegrabl_pcie_init(3); error=0x12
[0014.497] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800612
[0014.501] W> Failed to open NVME-3, err = 80800612
[0014.505] W> Failed to initialize device 10-3
[0014.509] E> NVME (3) boot failed, err: 0x80800612
[0014.514] I> ########## NVME (4) boot ##########
[0014.518] I> Initializing nvme device instance 4
[0014.523] I> Initializing nvme controller
[0014.527] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14160000
[0014.533] I> vpcie3v3-supply not found
[0014.537] I> vpcie12v-supply not found
[0014.540] W> Failed to get nvidia,plat-gpios
[0014.544] I> tegrabl_pcie_soc_preinit: (4):
[0014.548] I> Unpowergate
[0014.551] I> tegrabl_car_clk_disable(4) ...
[0014.555] I> tegrabl_car_rst_set(CORE, 4) ...
[0014.559] I> tegrabl_car_rst_set(APB, 4) ...
[0014.563] I> tegrabl_car_clk_enable(4) ...
[0014.567] I> tegrabl_car_rst_clear(APB, 4) ...
[0014.571] I> tegrabl_set_ctrl_state(4)
[0014.575] I> CLR PCIE_APB:6
[0014.578] I> tegrabl_pcie_soc_init: (4):
[0014.581] I> APPL initialization ...
[0014.585] I> poweron phys
[0014.588] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x14160000
[0014.594] I> tegrabl_power_on_phy: controller 4 not available
[0014.599] E> Failed to power on phy on controller-4
[0014.604] I> PCIe (4) Link is not UP
[0014.608] W> Failed tegrabl_pcie_soc_init(), error=0x1
[0014.612] I> Failed to initialize SoC Host PCIe controller
[0014.618] E> tegrabl_nvme_init: Failed tegrabl_pcie_init(4); error=0x1
[0014.624] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800601
[0014.631] W> Failed to open NVME-4, err = 80800601
[0014.635] W> Failed to initialize device 10-4
[0014.639] E> NVME (4) boot failed, err: 0x80800601
[0014.644] I> ########## NVME (5) boot ##########
[0014.648] I> Initializing nvme device instance 5
[0014.653] I> Initializing nvme controller
[0014.657] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x141a0000
[0014.663] I> i=0, reg_phandle=0x1b
[0014.669] I> reg_voltage=3300000
[0014.669] I> tegrabl_pcie_enable_regulators: regulator_set_voltage(0x1b, 3300000)
[0014.682] I> enabling 'vdd-3v3-pcie' regulator
[0014.683] I> i=1, reg_phandle=0x1c
[0014.687] I> reg_voltage=1200000
[0014.687] I> tegrabl_pcie_enable_regulators: regulator_set_voltage(0x1c, 1200000)
[0014.700] I> enabling 'vdd-12v-pcie' regulator
[0014.701] I> tegrabl_pcie_soc_preinit: (5):
[0014.703] I> Unpowergate
[0014.706] I> tegrabl_car_clk_disable(5) ...
[0014.710] I> tegrabl_car_rst_set(CORE, 5) ...
[0014.714] I> tegrabl_car_rst_set(APB, 5) ...
[0014.718] I> tegrabl_car_clk_enable(5) ...
[0014.722] I> tegrabl_car_rst_clear(APB, 5) ...
[0014.726] I> tegrabl_set_ctrl_state(5)
[0014.730] I> CLR PCIE_APB:6
[0014.732] I> tegrabl_pcie_soc_init: (5):
[0014.736] I> APPL initialization ...
[0014.739] I> poweron phys
[0014.742] I> tegrabl_locate_pcie_ctrl_in_dt: found match at 0x141a0000
[0014.748] I> tegrabl_power_on_phy: power on phy @0x3eb0000
[0014.754] I> tegrabl_power_on_phy: power on phy @0x3ec0000
[0014.759] I> tegrabl_power_on_phy: power on phy @0x3ed0000
[0014.764] I> tegrabl_power_on_phy: power on phy @0x3ee0000
[0014.770] I> tegrabl_power_on_phy: power on phy @0x3ef0000
[0014.775] I> tegrabl_power_on_phy: power on phy @0x3f00000
[0014.780] I> tegrabl_power_on_phy: power on phy @0x3f10000
[0014.786] I> tegrabl_power_on_phy: power on phy @0x3f20000
[0015.891] C> Failed to link up controller-5
[0015.891] I> PCIe (5) Link is not UP
[0015.892] I> tegrabl_pcie_disable_regulators: disable regulator 0x1b
[0015.895] I> disabling 'vdd-3v3-pcie' regulator
[0015.896] I> tegrabl_pcie_disable_regulators: disable regulator 0x1c
[0015.899] I> disabling 'vdd-12v-pcie' regulator
[0015.899] W> Failed tegrabl_pcie_soc_init(), error=0x12
[0015.900] I> Failed to initialize SoC Host PCIe controller
[0015.905] E> tegrabl_nvme_init: Failed tegrabl_pcie_init(5); error=0x12
[0015.912] W> tegrabl_nvme_bdev_open: Failed NVME INIT; error=0x80800612
[0015.918] W> Failed to open NVME-5, err = 80800612
[0015.923] W> Failed to initialize device 10-5
[0015.927] E> NVME (5) boot failed, err: 0x80800612
[0015.931] I> ########## Fixed storage boot ##########
[0015.936] I> Loading kernel-bootctrl from partition
[0015.941] I> Loading partition kernel-bootctrl at 0xa4b30000 from device(0x1)
[0015.954] W> tegrabl_get_kernel_bootctrl: magic number(0x00000000) is invalid
[0015.955] W> tegrabl_get_kernel_bootctrl: use default dummy boot control data
[0015.968] I> Already published: 00010003
[0015.968] I> Look for boot partition
[0015.969] I> Fallback: assuming 0th partition is boot partition
[0015.975] I> Set 0th partition is boot partition
[0015.980] I> Detect filesystem
[0016.007] I> Loading extlinux.conf ...
[0016.007] I> Loading extlinux.conf binary from rootfs ...
[0016.007] I> rootfs path: /sdmmc_user/boot/extlinux/extlinux.conf
[0016.050] I> ext4_read_data_from_extent:298: Total file read should not be larger than file stat size
[0016.051] E> file /sdmmc_user/boot/extlinux/extlinux.conf read failed!!
[0016.051] W> Failed to load extlinux.conf binary from rootfs (err=202113305)
[0016.052] E> Failed to find/load /boot/extlinux/extlinux.conf
[0016.053] I> Loading kernel ...
[0016.056] I> No kernel binary path
[0016.059] I> Continue to load from partition ...
[0016.064] I> A/B: bin_type (37) slot 0
[0016.067] I> Loading kernel from partition
[0016.071] I> Loading partition kernel at 0xa4b30000 from device(0x1)
[0016.567] I> Validate kernel ...
[0016.567] I> T19x: Authenticate kernel (bin_type: 37), max size 0x5000000
[0016.568] E> Stage2Signature validation failed with SHA2!!
[0016.568] C> OEM authentication of kernel header failed!
[0016.569] E> Failed to validate kernel binary from partition (err=1077936152)
[0016.569] I> Checking boot.img header magic ... [0016.573] E> Invalid header magic
[0016.577] E> Failed to load kernel, abort booting.
[0016.581] E> Failed extlinux boot.
[0016.585] I> SMD partition is updated.
[0016.636] I> Kernel EP: 0x0, DTB: 0x90000000
[0016.637] 
[0016.637] -----------------------------------------------
[0016.639] Synchronous Exception: UNKNOWN EXCEPTION
[0016.641] -----------------------------------------------
[0016.643] 
[0016.643] ESR 0x2000000: ec 0x0, il 0x1, iss 0x0
[0016.645] -----------------------------------------------
[0016.647]  [Stack Trace]
[0016.648] 
[0016.648] => pc:0x00000000, sp:0xA0EAE530
[0016.650] => pc:0xA060F7E4, sp:0xA0EAE760
[0016.654] => pc:0xA060F7F8, sp:0xA0EAE7B0
[0016.658] => pc:0xA060F5E0, sp:0xA0EAE7E0
[0016.662] => pc:0xA060EB54, sp:0xA0EAE7F0
[0016.665] => pc:0xA060EB28, sp:0xA0EAE800
[0016.669] -----------------------------------------------
[0016.674] iframe 0xa0eae440:
[0016.677] x0  0x        90000000 x1  0x               0 x2  0x               0 x3  0x               0
[0016.686] x4  0x               0 x5  0x              20 x6  0x         b200123 x7  0x        ffffffc0
[0016.695] x8  0x               0 x9  0xffffffffffffffff x10 0x               6 x11 0x               2
[0016.704] x12 0x               1 x13 0x              40 x14 0x               1 x15 0x             2c0
[0016.714] x16 0x            1500 x17 0x             438 x18 0x               0 x19 0x               0
[0016.723] x20 0x               0 x21 0x               0 x22 0x               0 x23 0x               0
[0016.732] x24 0x               0 x25 0x               0 x26 0x               0 x27 0x               0
[0016.741] x28 0x               0 x29 0x        a0eae760 lr  0x        a060f798 sp  0x        a0eae530
[0016.750] elr 0x               0
[0016.753] spsr 0x        400003c9
[0016.756] -----------------------------------------------
[0016.761] panic (caller 0xa0601238): die
[0016.765] HALT: spinning forever...

Hi,

Could you clarify how to reproduce your issue?

I am not sure how you get into this situation. For example, first few lines from your log show

[ 5318.214108] watchdog: watchdog0: watchdog did not stop!
[ 5318.221469] systemd-shutdow: 38 output lines suppressed due to ratelimiting
[ 5319.286795] reboot: Restarting system
ÿäÿâShutdown state requested 1
Rebooting system …

So it looks like previous one is able to boot into kernel. But this time, it even not able to load kernel. Could you clarify this part?

Thank you for checking the log.

The reason why the kernel cannot be booted is that the following command is executed to intentionally put the kernel in a state of boot failure, destroying the kernel and the recovery image.

dd if=/dev/zero of=/dev/disk/by-partlabel/kernel
dd if=/dev/zero of=/dev/disk/by-partlabel/kernel-bootctrl
dd if=/dev/zero of=/dev/disk/by-partlabel/kernel-dtb
dd if=/dev/zero of=/dev/disk/by-partlabel/recovery
dd if=/dev/zero of=/dev/disk/by-partlabel/recovery-dtb

Hi,

Is your log showing the first time boot failure or it is the log from the 7th failure?

This log is the first time boot failure.

Ok, so you problem is you just stuck here but not something related to recovery mode?

If you just put the board in this state for like 2 mins, will the WDT triggered and let it reboot?

[0016.761] panic (caller 0xa0601238): die
[0016.765] HALT: spinning forever…

Ok, so you problem is you just stuck here but not something related to recovery mode?

Yes.

If you just put the board in this state for like 2 mins, will the WDT triggered and let it reboot?

Yes. It was rebooting.

Ok… so is there still any issue?

Yes.
But there is a possibility that we can solve the problem on our end, so we will conduct a survey.
If we cannot resolve it, we will post it again in the forum.

Sorry for all the confusion.

Thanks for your support, it is much appreciated.

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.