Xavier power up Abnormally

Hi
My xavier can not power up after press the power-on button.but after pressed the power-on button,then press the rst-button,it can powered up.I changed another carrier board,it is the same.The rs232 log as below:
[0000.208] W> RATCHET: MB1 binary ratchet value 4 is too large than ratchet level 1 from HW fuses.
[0000.217] I> MB1 (prd-version: 1.5.1.0-t194-41334769-59d8a47d)
[0000.222] I> Boot-mode: Coldboot
[0000.225] I> Chip revision : A02
[0000.228] I> Bootrom patch version : 7 (correctly patched)
[0000.233] I> ATE fuse revision : 0x200
[0000.237] I> Ram repair fuse : 0x0
[0000.240] I> Ram Code : 0x0
[0000.242] I> rst_source : 0x0
[0000.245] I> rst_level : 0x0
[0000.249] I> Boot-device: eMMC
[0000.252] W> DEVICE_PROD: device prod is not initialized.
[0000.257] W> DEVICE_PROD: device prod is not initialized.
[0000.274] I> sdmmc DDR50 mode
[0000.277] W> DEVICE_PROD: device prod is not initialized.
[0000.283] I> Active Boot chain : 0
[0000.286] I> Boot-device: eMMC
[0000.290] E> MB1_PLATFORM_CONFIG: device prod data is empty in MB1 BCT.
[0000.296] E> MB1_PLATFORM_CONFIG: Failed to initialize device prod.
[0000.303] I> Temperature = 32000
[0000.306] W> Skipping boost for clk: BPMP_CPU_NIC
[0000.311] W> Skipping boost for clk: BPMP_APB
[0000.315] W> Skipping boost for clk: AXI_CBB
[0000.319] W> Skipping boost for clk: AON_CPU_NIC
[0000.323] W> Skipping boost for clk: CAN1
[0000.327] W> Skipping boost for clk: CAN2
[0000.331] I> Boot-device: eMMC
[0000.334] I> Boot-device: eMMC
[0000.343] I> Sdmmc: HS400 mode enabled
[0000.348] I> ECC region[0]: Start:0x0, End:0x0
[0000.352] I> ECC region[1]: Start:0x0, End:0x0
[0000.356] I> ECC region[2]: Start:0x0, End:0x0
[0000.360] I> ECC region[3]: Start:0x0, End:0x0
[0000.364] I> ECC region[4]: Start:0x0, End:0x0
[0000.368] I> Non-ECC region[0]: Start:0x80000000, End:0x100000000
[0000.374] I> Non-ECC region[1]: Start:0x0, End:0x0
[0000.378] I> Non-ECC region[2]: Start:0x0, End:0x0
[0000.383] I> Non-ECC region[3]: Start:0x0, End:0x0
[0000.387] I> Non-ECC region[4]: Start:0x0, End:0x0
[0000.393] W> MB1_PLATFORM_CONFIG: Rail ID 9 not found in pmic rail config table.
[0000.400] E> FAILED: Thermal config
[0000.403] W> DEVICE_PROD: device prod is not initialized.
[0000.412] W> MB1_PLATFORM_CONFIG: Rail ID 7 not found in pmic rail config table.
[0000.419] E> FAILED: MEMIO rail config
[0000.431] I> scrub mode: full dram
[0000.434] E> FUSE: Failed to verify ECID.
[0000.438] I> Boot-device: eMMC
[0000.447] I> sdmmc bdev is already initialized
[0000.485] W> No fuse-bypass data
[0000.492] W> MB1_PLATFORM_CONFIG: Rail ID 8 not found in pmic rail config table.
[0001.999] E> WP1.5 ACK pending
[0002.002] E> Error: 20
[0002.004] E> Task 75 failed (err: 0x32320006)
[0002.008] E> Top caller module: CPUINIT, error module: CPUINIT, reason: 0x06, aux_info: 0x00
[0002.017] I> MB1(1.5.1.0-t194-41334769-59d8a47d) BIT boot status dump :
0000000000011111111110111111111111111111111001111111011100111111110110111111000000000000000000000000000000000000000000000000000011111011000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000
[0002.047] I> Reset to recovery mode

Hi,

[0002.047] I> Reset to recovery mode

Is this carrier board nvidia devkit or custom carrier board?

Hi
carrier board is nvidia devkit

And the normal power up log generated by another xavier is as below:

[0000.312] W> RATCHET: MB1 binary ratchet value 4 is too large than ratchet level 2 from HW fuses.
[0000.320] I> MB1 (prd-version: 1.5.1.0-t194-41334769-59d8a47d)
[0000.326] I> Boot-mode: Coldboot
[0000.329] I> Chip revision : A02P
[0000.332] I> Bootrom patch version : 15 (correctly patched)
[0000.337] I> ATE fuse revision : 0x200
[0000.340] I> Ram repair fuse : 0x0
[0000.343] I> Ram Code : 0x2
[0000.346] I> rst_source : 0x0
[0000.349] I> rst_level : 0x0
[0000.352] I> Boot-device: eMMC
[0000.355] W> DEVICE_PROD: device prod is not initialized.
[0000.360] W> DEVICE_PROD: device prod is not initialized.
[0000.377] I> sdmmc DDR50 mode
[0000.381] W> DEVICE_PROD: device prod is not initialized.
[0000.386] W> No valid slot number is found in scratch register
[0000.392] W> Return default slot: _a
[0000.395] I> Active Boot chain : 0
[0000.398] I> Boot-device: eMMC
[0000.402] E> MB1_PLATFORM_CONFIG: device prod data is empty in MB1 BCT.
[0000.408] E> MB1_PLATFORM_CONFIG: Failed to initialize device prod.
[0000.415] I> Temperature = 32000
[0000.418] W> Skipping boost for clk: BPMP_CPU_NIC
[0000.423] W> Skipping boost for clk: BPMP_APB
[0000.427] W> Skipping boost for clk: AXI_CBB
[0000.431] W> Skipping boost for clk: AON_CPU_NIC
[0000.435] W> Skipping boost for clk: CAN1
[0000.439] W> Skipping boost for clk: CAN2
[0000.443] I> Boot-device: eMMC
[0000.446] I> Boot-device: eMMC
[0000.455] I> Sdmmc: HS400 mode enabled
[0000.460] I> ECC region[0]: Start:0x0, End:0x0
[0000.464] I> ECC region[1]: Start:0x0, End:0x0
[0000.468] I> ECC region[2]: Start:0x0, End:0x0
[0000.472] I> ECC region[3]: Start:0x0, End:0x0
[0000.476] I> ECC region[4]: Start:0x0, End:0x0
[0000.480] I> Non-ECC region[0]: Start:0x80000000, End:0x100000000
[0000.486] I> Non-ECC region[1]: Start:0x0, End:0x0
[0000.491] I> Non-ECC region[2]: Start:0x0, End:0x0
[0000.495] I> Non-ECC region[3]: Start:0x0, End:0x0
[0000.500] I> Non-ECC region[4]: Start:0x0, End:0x0
[0000.505] W> MB1_PLATFORM_CONFIG: Rail ID 9 not found in pmic rail config table.
[0000.512] E> FAILED: Thermal config
[0000.515] W> DEVICE_PROD: device prod is not initialized.
[0000.524] W> MB1_PLATFORM_CONFIG: Rail ID 7 not found in pmic rail config table.
[0000.531] E> FAILED: MEMIO rail config
[0000.547] I> scrub mode: full dram
[0000.550] E> FUSE: Failed to verify ECID.
[0000.554] I> Boot-device: eMMC
[0000.564] I> sdmmc bdev is already initialized
[0000.601] W> No fuse-bypass data
[0000.608] W> MB1_PLATFORM_CONFIG: Rail ID 8 not found in pmic rail config table.
[0000.641] I> MB1 done

1 Like

It looks like a hardaware issue.

Do you have other xavier modules to do the test? Plug out problematic module, plug in a new one and see if same issue happens on new module.

Yes,i have tried that way.It is Related to the core carrier module. When plug in a new one module,there’s no issue.

Hi WayneWWW

What does this error mean:

[0002.002] E> Error: 20
[0002.004] E> Task 75 failed (err: 0x32320006)
[0002.008] E> Top caller module: CPUINIT, error module: CPUINIT, reason: 0x06, aux_info: 0x00

Hi,

I am not quite sure about this error. Have you tried to re-flash this module? Looks like cpu is not able to init anymore.

Yes,i have tried re-flash the whole module .
Please help me to confirm this error. Thanks !!!

Hi,

If even reflash cannot help, then this module is broken. You could directly RMA this module.
We don’t have any internal bug that has task 75 error.