Watchdog timer resetting system on boot - SdMmcControllerDxe.efi error

We are seeing occasional boot failures on AGX Orin devkit which appear to be related to the SdMmcControllerDxe.efi driver.

In the error case the last log messages on the serial port are

[2025-01-20 06:33:23.506] Loading driver at 0x008257A6000 EntryPoint=0x008257AEBF8 SdMmcControllerDxe.efi
[2025-01-20 06:33:23.524] 
[2025-01-20 06:38:21.203] Watchdog Timer resetting system
[2025-01-20 06:38:21.205] Shutdown state requested 1
[2025-01-20 06:38:21.220] Rebooting system ...

After the SdMmcControllerDxe.efi driver is loaded, nothing happens for 5 minutes (timestamps added by minicom), then the watchdog resets the system to forced recovery mode and we have to manually power cycle it.

On a successful boot there are error messages printed after the SdMmcControllerDxe.efi driver is loaded, but this does not prevent the devkit from continuing to boot into linux, nor does it seem to have any impact on SD card functionality.

[2025-01-19 06:01:01.246] Loading driver at 0x008257A6000 EntryPoint=0x008257AEBF8 SdMmcControllerDxe.efi
[2025-01-19 06:01:01.263] 
[2025-01-19 06:01:01.279] Error reported by SDHCI
[2025-01-19 06:01:01.279] Interrupt status = 8000
[2025-01-19 06:01:01.279] Error interrupt status = 1
[2025-01-19 06:01:01.285] TRB failed with Device Error
[2025-01-19 06:01:01.285] Printing SD_MMC_HC_TRB
[2025-01-19 06:01:01.290] Slot: 0
[2025-01-19 06:01:01.290] BlockSize: 512
[2025-01-19 06:01:01.290] Data: 0
[2025-01-19 06:01:01.291] DataLen: 0
[2025-01-19 06:01:01.291] Read: 0
[2025-01-19 06:01:01.291] DataPhy: 0
[2025-01-19 06:01:01.296] DataMap: 0
[2025-01-19 06:01:01.296] Mode: 0
[2025-01-19 06:01:01.296] AdmaLengthMode: 0
[2025-01-19 06:01:01.296] Event: 0
[2025-01-19 06:01:01.296] Started: 0
[2025-01-19 06:01:01.301] CommandComplete: 0
[2025-01-19 06:01:01.301] Timeout: 1000000
[2025-01-19 06:01:01.301] Retries: 5
[2025-01-19 06:01:01.302] PioModeTransferCompleted: 0
[2025-01-19 06:01:01.308] PioBlockIndex: 0
[2025-01-19 06:01:01.308] Adma32Desc: 0
[2025-01-19 06:01:01.308] Adma64V3Desc: 0
[2025-01-19 06:01:01.312] Adma64V4Desc: 0
[2025-01-19 06:01:01.313] AdmaMap: 0
[2025-01-19 06:01:01.313] AdmaPages: 0
[2025-01-19 06:01:01.313] Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
[2025-01-19 06:01:01.319] Command index: 1, argument: 0
[2025-01-19 06:01:01.319] Command type: 1, response type: 3
[2025-01-19 06:01:01.324] Response 0: 0, 1: 0, 2: 0, 3: 0
[2025-01-19 06:01:01.329] Timeout: 1000000
[2025-01-19 06:01:01.329] InDataBuffer: 0
[2025-01-19 06:01:01.330] OutDataBuffer: 0
[2025-01-19 06:01:01.330] InTransferLength: 0
[2025-01-19 06:01:01.336] OutTransferLength: 0
[2025-01-19 06:01:01.336] TransactionStatus: Success
[2025-01-19 06:01:01.336] Error reported by SDHCI
[2025-01-19 06:01:01.341] Interrupt status = 8000
[2025-01-19 06:01:01.341] Error interrupt status = 1
[2025-01-19 06:01:01.346] TRB failed with Device Error
[2025-01-19 06:01:01.346] Printing SD_MMC_HC_TRB
[2025-01-19 06:01:01.351] Slot: 0
[2025-01-19 06:01:01.351] BlockSize: 512
[2025-01-19 06:01:01.352] Data: 0
[2025-01-19 06:01:01.352] DataLen: 0
[2025-01-19 06:01:01.352] Read: 0
[2025-01-19 06:01:01.352] DataPhy: 0
[2025-01-19 06:01:01.357] DataMap: 0
[2025-01-19 06:01:01.358] Mode: 0
[2025-01-19 06:01:01.358] AdmaLengthMode: 0
[2025-01-19 06:01:01.358] Event: 0
[2025-01-19 06:01:01.358] Started: 0
[2025-01-19 06:01:01.364] CommandComplete: 0
[2025-01-19 06:01:01.364] Timeout: 1000000
[2025-01-19 06:01:01.364] Retries: 5
[2025-01-19 06:01:01.364] PioModeTransferCompleted: 0
[2025-01-19 06:01:01.368] PioBlockIndex: 0
[2025-01-19 06:01:01.368] Adma32Desc: 0
[2025-01-19 06:01:01.368] Adma64V3Desc: 0
[2025-01-19 06:01:01.375] Adma64V4Desc: 0
[2025-01-19 06:01:01.375] AdmaMap: 0
[2025-01-19 06:01:01.375] AdmaPages: 0
[2025-01-19 06:01:01.375] Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
[2025-01-19 06:01:01.379] Command index: 8, argument: 1FF
[2025-01-19 06:01:01.380] Command type: 1, response type: 8
[2025-01-19 06:01:01.386] Response 0: 0, 1: 0, 2: 0, 3: 0
[2025-01-19 06:01:01.390] Timeout: 1000000
[2025-01-19 06:01:01.391] InDataBuffer: 0
[2025-01-19 06:01:01.391] OutDataBuffer: 0
[2025-01-19 06:01:01.391] InTransferLength: 0
[2025-01-19 06:01:01.397] OutTransferLength: 0
[2025-01-19 06:01:01.397] TransactionStatus: Success
[2025-01-19 06:01:01.397] Error reported by SDHCI
[2025-01-19 06:01:01.403] Interrupt status = 8000
[2025-01-19 06:01:01.403] Error interrupt status = 1
[2025-01-19 06:01:01.407] TRB failed with Device Error
[2025-01-19 06:01:01.408] Printing SD_MMC_HC_TRB
[2025-01-19 06:01:01.413] Slot: 0
[2025-01-19 06:01:01.413] BlockSize: 512
[2025-01-19 06:01:01.413] Data: 0
[2025-01-19 06:01:01.413] DataLen: 0
[2025-01-19 06:01:01.413] Read: 0
[2025-01-19 06:01:01.414] DataPhy: 0
[2025-01-19 06:01:01.419] DataMap: 0
[2025-01-19 06:01:01.420] Mode: 0
[2025-01-19 06:01:01.420] AdmaLengthMode: 0
[2025-01-19 06:01:01.420] Event: 0
[2025-01-19 06:01:01.420] Started: 0
[2025-01-19 06:01:01.424] CommandComplete: 0
[2025-01-19 06:01:01.425] Timeout: 1000000
[2025-01-19 06:01:01.425] Retries: 5
[2025-01-19 06:01:01.425] PioModeTransferCompleted: 0
[2025-01-19 06:01:01.431] PioBlockIndex: 0
[2025-01-19 06:01:01.431] Adma32Desc: 0
[2025-01-19 06:01:01.431] Adma64V3Desc: 0
[2025-01-19 06:01:01.435] Adma64V4Desc: 0
[2025-01-19 06:01:01.436] AdmaMap: 0
[2025-01-19 06:01:01.436] AdmaPages: 0
[2025-01-19 06:01:01.436] Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
[2025-01-19 06:01:01.442] Command index: 5, argument: 0
[2025-01-19 06:01:01.442] Command type: 1, response type: 4
[2025-01-19 06:01:01.448] Response 0: 0, 1: 0, 2: 0, 3: 0
[2025-01-19 06:01:01.452] Timeout: 1000000
[2025-01-19 06:01:01.452] InDataBuffer: 0
[2025-01-19 06:01:01.453] OutDataBuffer: 0
[2025-01-19 06:01:01.453] InTransferLength: 0
[2025-01-19 06:01:01.458] OutTransferLength: 0
[2025-01-19 06:01:01.458] TransactionStatus: Success
[2025-01-19 06:01:01.459] Error reported by SDHCI
[2025-01-19 06:01:01.463] Interrupt status = 8000
[2025-01-19 06:01:01.464] Error interrupt status = 1
[2025-01-19 06:01:01.470] TRB failed with Device Error
[2025-01-19 06:01:01.470] Printing SD_MMC_HC_TRB
[2025-01-19 06:01:01.476] Slot: 0
[2025-01-19 06:01:01.476] BlockSize: 512
[2025-01-19 06:01:01.476] Data: 0
[2025-01-19 06:01:01.476] DataLen: 0
[2025-01-19 06:01:01.476] Read: 0
[2025-01-19 06:01:01.476] DataPhy: 0
[2025-01-19 06:01:01.480] DataMap: 0
[2025-01-19 06:01:01.480] Mode: 0
[2025-01-19 06:01:01.480] AdmaLengthMode: 0
[2025-01-19 06:01:01.480] Event: 0
[2025-01-19 06:01:01.480] Started: 0
[2025-01-19 06:01:01.487] CommandComplete: 0
[2025-01-19 06:01:01.487] Timeout: 1000000
[2025-01-19 06:01:01.487] Retries: 5
[2025-01-19 06:01:01.487] PioModeTransferCompleted: 0
[2025-01-19 06:01:01.492] PioBlockIndex: 0
[2025-01-19 06:01:01.492] Adma32Desc: 0
[2025-01-19 06:01:01.492] Adma64V3Desc: 0PCIe Controller-5 Link is DOWN
[2025-01-19 06:01:01.498] 
[2025-01-19 06:01:01.498] Adma64V4Desc: 0
[2025-01-19 06:01:01.498] AdmaMap: 0
[2025-01-19 06:01:01.498] AdmaPages: 0
[2025-01-19 06:01:01.503] Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
[2025-01-19 06:01:01.503] Command index: 55, argument: 0
[2025-01-19 06:01:01.509] Command type: 2, response type: 0
[2025-01-19 06:01:01.509] Response 0: 0, 1: 0, 2: 0, 3: 0
[2025-01-19 06:01:01.515] Timeout: 1000000
[2025-01-19 06:01:01.515] InDataBuffer: 0
[2025-01-19 06:01:01.519] OutDataBuffer: 0
[2025-01-19 06:01:01.520] InTransferLength: 0
[2025-01-19 06:01:01.520] OutTransferLength: 0
[2025-01-19 06:01:01.520] TransactionStatus: Success
[2025-01-19 06:01:01.549] Error reported by SDHCI
[2025-01-19 06:01:01.549] Interrupt status = 8001
[2025-01-19 06:01:01.549] Error interrupt status = 1
[2025-01-19 06:01:01.555] TRB failed with Device Error
[2025-01-19 06:01:01.555] Printing SD_MMC_HC_TRB
[2025-01-19 06:01:01.561] Slot: 0
[2025-01-19 06:01:01.561] BlockSize: 512
[2025-01-19 06:01:01.561] Data: 0
[2025-01-19 06:01:01.561] DataLen: 0
[2025-01-19 06:01:01.561] Read: 0
[2025-01-19 06:01:01.561] DataPhy: 0
[2025-01-19 06:01:01.565] DataMap: 0
[2025-01-19 06:01:01.566] Mode: 0
[2025-01-19 06:01:01.566] AdmaLengthMode: 0
[2025-01-19 06:01:01.566] Event: 0
[2025-01-19 06:01:01.566] Started: 0
[2025-01-19 06:01:01.571] CommandComplete: 0
[2025-01-19 06:01:01.572] Timeout: 2500000
[2025-01-19 06:01:01.572] Retries: 5
[2025-01-19 06:01:01.572] PioModeTransferCompleted: 0
[2025-01-19 06:01:01.577] PioBlockIndex: 0
[2025-01-19 06:01:01.577] Adma32Desc: 0
[2025-01-19 06:01:01.577] Adma64V3Desc: 0
[2025-01-19 06:01:01.583] Adma64V4Desc: 0
[2025-01-19 06:01:01.583] AdmaMap: 0
[2025-01-19 06:01:01.583] AdmaPages: 0
[2025-01-19 06:01:01.583] Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
[2025-01-19 06:01:01.589] Command index: 7, argument: 0
[2025-01-19 06:01:01.589] Command type: 2, response type: 0
[2025-01-19 06:01:01.594] Response 0: 0, 1: 0, 2: 0, 3: 0
[2025-01-19 06:01:01.600] Timeout: 2500000
[2025-01-19 06:01:01.600] InDataBuffer: 0
[2025-01-19 06:01:01.600] OutDataBuffer: 0
[2025-01-19 06:01:01.600] InTransferLength: 0
[2025-01-19 06:01:01.610] OutTransferLength: 0
[2025-01-19 06:01:01.610] TransactionStatus: Success
[2025-01-19 06:01:01.610] Installed Fat filesystem on 8221E3998

The error messages print whether or not an SD card is installed in the devkit, the only significant difference in that case appears to be the address in the last line of the preceding log changes to Installed Fat filesystem on 8221D6B18 with an SD card installed.

This jetson was running Linux MUK 5.15.136-l4t-r36.3-1009.9+g46cdb595bebc #1 SMP PREEMPT Fri May 3 10:21:41 UTC 2024 aarch64 GNU/Linux based on a meta-tegra yocto linux build.

Please let me know what other debugging steps or logs I may provide to aid in debugging this.

1 Like

Hi tbottom,

What’s the fail rate in your case?

Would you hit the issue if you use SDK manager to flash the AGX Orin devkit with Jetpack 6.0 (L4T R36.3.0)?
Or the issue is specific to your yocto project?

If there’s the issue when the SD card is inserted, please just remove it and use internal eMMC to narrow down the issue.

The boot failure rate is around 1%.

I have not yet tested with jetpack 6.0 from the SDK manager - I will flash a jetson today and attempt to reproduce the issue.

We have observed the failure to boot with and without an SD card installed. We do not normally have an SD card inserted, but wanted to see if this would avoid the issue.

I installed Jetpack 6.0.0 and rebuilt the uefi bootloader based on the instructions here:

I rebuilt with edk2_docker edkrepo clone nvidia-uefi NVIDIA-Platforms r36.3.0 and flashed uefi_Jetson_DEBUG.bin to the devkit.

I am seeing the SdMmcControllerDxe.efi error message print on successful boots, and was able to reproduce the watchdog timeout after ~20 reboots.

successful boot:

...
add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/PcieDWControllerDxe/PcieControllerDxe/DEBUG/PcieControllerDxe.dll 0x825711000                         11:49:31 [363/25248]
Loading driver at 0x00825710000 EntryPoint=0x0082571B910 PcieControllerDxe.efi

Failed to read controller number
PCIE Controller ID-1, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
Failed to read controller number
PCIE Controller ID-4, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
Failed to read controller number
PCIE Controller ID-5, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/MdeModulePkg/Bus/Pci/NvmExpressDxe/NvmExpressDxe/DEBUG/NvmExpressDxe.dll 0x825700000
Loading driver at 0x008256FF000 EntryPoint=0x0082570A3F0 NvmExpressDxe.efi

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/SdMmcControllerDxe/SdMmcControllerDxe/DEBUG/SdMmcControllerDxe.dll 0x8256F2000
Loading driver at 0x008256F1000 EntryPoint=0x008256F9A88 SdMmcControllerDxe.efi

Error reported by SDHCI
Interrupt status = 8000
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 1000000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32Desc: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 1, argument: 0
Command type: 1, response type: 3
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 1000000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
...

failed to boot:

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/MdeModulePkg/Bus/Pci/PciBusDxe/PciBusDxe/DEBUG/PciBusDxe.dll 0x825725000
Loading driver at 0x00825724000 EntryPoint=0x0082572D134 PciBusDxe.efi

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/PcieDWControllerDxe/PcieControllerDxe/DEBUG/PcieControllerDxe.dll 0x825711000
Loading driver at 0x00825710000 EntryPoint=0x0082571B910 PcieControllerDxe.efi

Failed to read controller number
PCIE Controller ID-1, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
Failed to read controller number
PCIE Controller ID-4, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
Failed to read controller number
PCIE Controller ID-5, Bpmp Phandle-3
ParseGicMsiBase: cannot retrieve property 'msi-parent': FDT_ERR_NOTFOUND
add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/MdeModulePkg/Bus/Pci/NvmExpressDxe/NvmExpressDxe/DEBUG/NvmExpressDxe.dll 0x825700000
Loading driver at 0x008256FF000 EntryPoint=0x0082570A3F0 NvmExpressDxe.efi

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/SdMmcControllerDxe/SdMmcControllerDxe/DEBUG/SdMmcControllerDxe.dll 0x8256F2000
Loading driver at 0x008256F1000 EntryPoint=0x008256F9A88 SdMmcControllerDxe.efi

Watchdog Timer resetting system
Shutdown state requested 1
Rebooting system ...

The jetson does not appear to always get stuck in forced recovery mode after the watchdog timeout. In this case after the watchdog timeout it reset and booted into linux normally. This appears to be the case with our yocto based image as well.

I also observed a crash dump I have not seen before on one of the reboots:

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/SdMmcControllerDxe/SdMmcControllerDxe/DEBUG/SdMmcControllerDxe.dll 0x8256F2000
Loading driver at 0x008256F1000 EntryPoint=0x008256F9A88 SdMmcControllerDxe.efi



Synchronous Exception at 0x0000000825D457F4
PC 0x000825D457F4 (0x000825D42000+0x000037F4) [ 0] BpmpIpcDxe.dll
PC 0x000825D45880 (0x000825D42000+0x00003880) [ 0] BpmpIpcDxe.dll
PC 0x000825D4A434 (0x000825D42000+0x00008434) [ 0] BpmpIpcDxe.dll
PC 0x000825D4A840 (0x000825D42000+0x00008840) [ 0] BpmpIpcDxe.dll
PC 0x000825D4ABFC (0x000825D42000+0x00008BFC) [ 0] BpmpIpcDxe.dll
PC 0x000825B31410 (0x000825B2E000+0x00003410) [ 1] BpmpScmiDxe.dll
PC 0x0008256F2CDC (0x0008256F1000+0x00001CDC) [ 2] SdMmcControllerDxe.dll
PC 0x0008256F9810 (0x0008256F1000+0x00008810) [ 2] SdMmcControllerDxe.dll
PC 0x000827F49668 (0x000827F41000+0x00008668) [ 3] DxeCore.dll
PC 0x000827F56938 (0x000827F41000+0x00015938) [ 3] DxeCore.dll
PC 0x000827F4C900 (0x000827F41000+0x0000B900) [ 3] DxeCore.dll
PC 0x00082C807F14

[ 0] /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/BpmpIpc/BpmpIpcDxe/DEBUG/BpmpIpcDxe.dll
[ 1] /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/BpmpScmi/BpmpScmiDxe/DEBUG/BpmpScmiDxe.dll
[ 2] /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/SdMmcControllerDxe/SdMmcControllerDxe/DEBUG/SdMmcControllerDxe.dll
[ 3] /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/Build/Jetson/DEBUG_GCC5/AARCH64/MdeModulePkg/Core/Dxe/DxeMain/DEBUG/DxeCore.dll

  X0 0x0000000000000002   X1 0x0000000000000004   X2 0x00000000000F4240   X3 0x0000000000000001
  X4 0x000000082224E2D0   X5 0x0000000002D60DE0   X6 0x000000082CBFE810   X7 0x0000000000000071
  X8 0x0000000825D4A90C   X9 0x0000000000000004  X10 0x0000000000000004  X11 0x0000000000000035
 X12 0x0000000000000002  X13 0x000000000001545C  X14 0x000000082C50C000  X15 0x0000000000000000
 X16 0x0000000825FC9D0C  X17 0x000000082A17EFFC  X18 0x0000000000000000  X19 0x000000082CBFE738
 X20 0x000000082CBFE730  X21 0x000000082CBFE738  X22 0x0000000000000004  X23 0x0000000825D4E000
 X24 0x000000082361CFD8  X25 0x8000000000000006  X26 0x0000000000000001  X27 0x0000000000000001
 X28 0x0000000000000016   FP 0x000000082CBFE610   LR 0x0000000825D45880

  V0 0x0000000000000000 0000000000000000   V1 0x0073656D616E2D74 6573657200737465
  V2 0x756D6D6F6900736C 6C65632D616D6423   V3 0x4000000000000000 0000000040000000
  V4 0x0000000000100000 0000000000000000   V5 0x4010040140100401 4010040140100401
  V6 0x0010000040000040 0010000040000040   V7 0x0000000000000000 0000000000000000
  V8 0x0000000000000000 0000000000000000   V9 0x0000000000000000 0000000000000000
 V10 0x0000000000000000 0000000000000000  V11 0x0000000000000000 0000000000000000
 V12 0x0000000000000000 0000000000000000  V13 0x0000000000000000 0000000000000000
 V14 0x0000000000000000 0000000000000000  V15 0x0000000000000000 0000000000000000
 V16 0x0000000000000000 0000000000000000  V17 0x0000000000000000 0000000000000000
 V18 0x0000000000000000 0000000000000000  V19 0x0000000000000000 0000000000000000
 V20 0x0000000000000000 0000000000000000  V21 0x0000000000000000 0000000000000000
 V22 0x0000000000000000 0000000000000000  V23 0x0000000000000000 0000000000000000
 V24 0x0000000000000000 0000000000000000  V25 0x0000000000000000 0000000000000000
 V26 0x0000000000000000 0000000000000000  V27 0x0000000000000000 0000000000000000
 V28 0x0000000000000000 0000000000000000  V29 0x0000000000000000 0000000000000000
 V30 0x0000000000000000 0000000000000000  V31 0x0000000000000000 0000000000000000

  SP 0x000000082CBFE610  ELR 0x0000000825D457F4  SPSR 0x80000209  FPSR 0x00000000
 ESR 0x96000007          FAR 0x0000000000000004

 ESR : EC 0x25  IL 0x1  ISS 0x00000007

Data abort: Translation fault, third level

Stack dump:
  000082CBFE510: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE530: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE550: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE570: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE590: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE5B0: 0000000000000000 0000000000000000 0000000825D4A038 0000000080000209
  000082CBFE5D0: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  000082CBFE5F0: 000000082CBFE630 0000000825D4A314 0000000040071088 000000082CBFE810
> 000082CBFE610: 000000082CBFE630 0000000825D45880 000000082CBFE738 000000082361CFD8
  000082CBFE630: 000000082CBFE650 0000000825D4A434 000000082CBFE738 0000000825D4A834
  000082CBFE650: 000000082CBFE670 0000000825D4A840 000000082361CF98 0000000000000000
  000082CBFE670: 000000082CBFE6B0 0000000825D4ABFC 000000082CBFE720 000000082361CF98
  000082CBFE690: 0000000825D4E000 8000000000000006 0000000000000004 000000082361CFD8
  000082CBFE6B0: 000000082CBFE7A0 0000000825B31410 0000000000000000 000000082CBFE8D8
  000082CBFE6D0: 000000082CBFE8CF 0000000825B36000 000000082CBFE7F8 0000000825B36000
  000082CBFE6F0: 000000000000007B 000000082224EF98 00000008256FD208 000000082CBFEA28


Synchronous Exception at 0x0000000825D457F4
ASSERT [ArmCpuDxe] /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi/edk2/ArmPkg/Library/DefaultExceptionHandlerLib/AArch64/DefaultExceptionHandler.c(343): ((BOOLEAN)(0==1))

Resetting the system in 5 seconds.
1 Like

Could you verify with the latest Jetpack 6.2(L4T R36.4.3) since you are using the devkit?
Please let us know if you still hit similar error with Jetpack 6.2.

Installed r36.4.3 and rebuilt the debug UEFI as shown below:

$ export EDK2_DEV_IMAGE="ghcr.io/tianocore/containers/ubuntu-22-dev:latest"
$ export EDK2_USER_ARGS="-v \"${HOME}\":\"${HOME}\" -e EDK2_DOCKER_USER_HOME=\"${HOME}\""
$ export EDK2_BUILD_ROOT="$(pwd)"
$ export EDK2_BUILDROOT_ARGS="-v \"${EDK2_BUILD_ROOT}\":\"${EDK2_BUILD_ROOT}\""
$ alias edk2_docker="docker run -it --rm -w \"\$(pwd)\" ${EDK2_BUILDROOT_ARGS} ${EDK2_USER_ARGS} \"${EDK2_DEV_IMAGE}\""
$ edk2_docker init_edkrepo_conf
$ edk2_docker edkrepo manifest-repos add nvidia https://github.com/NVIDIA/edk2-edkrepo-manifest.git main nvidia
$ edk2_docker edkrepo clone nvidia-uefi-36.4.3 NVIDIA-Platforms r36.4.3
$ cd nvidia-uefi-36.4.3/
$ edk2_docker edk2-nvidia/Platform/NVIDIA/Jetson/build.sh

I have not been able to reproduce the watchdog timeout yet. I will keep the test running for a while and see if it recurs.

I am seeing the same error messages printed on a successful boot with r36.4.3, so it may have been a red herring. I will report back if I am able to reproduce the boot failure with r36.4.3

Error message from successful r36.4.3 boot for reference:

add-symbol-file /home/tbottom/src/scratch/edk2-docker-build/nvidia-uefi-36.4.3/Build/Jetson/DEBUG_GCC5/AARCH64/Silicon/NVIDIA/Drivers/SdMmcControllerDxe/SdMmcControllerDxe/DEBUG/SdMmcControllerDxe.dll 0x825674000
Loading driver at 0x00825673000 EntryPoint=0x0082567BCB0 SdMmcControllerDxe.efi

Error reported by SDHCI
Interrupt status = 8000
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 1000000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32Desc: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 1, argument: 0
Command type: 1, response type: 3
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 1000000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
Error reported by SDHCI
Interrupt status = 8000
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 1000000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32Desc: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 8, argument: 1FF
Command type: 1, response type: 8
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 1000000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
Error reported by SDHCI
Interrupt status = 8000
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 1000000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32DesPCIe Controller-5 Link is DOWN
c: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 5, argument: 0
Command type: 1, response type: 4
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 1000000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
Error reported by SDHCI
Interrupt status = 8000
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 1000000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32Desc: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 55, argument: 0
Command type: 2, response type: 0
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 1000000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
Error reported by SDHCI
Interrupt status = A001
Error interrupt status = 1
TRB failed with Device Error
Printing SD_MMC_HC_TRB
Slot: 0
BlockSize: 512
Data: 0
DataLen: 0
Read: 0
DataPhy: 0
DataMap: 0
Mode: 0
AdmaLengthMode: 0
Event: 0
Started: 0
CommandComplete: 0
Timeout: 2500000
Retries: 5
PioModeTransferCompleted: 0
PioBlockIndex: 0
Adma32Desc: 0
Adma64V3Desc: 0
Adma64V4Desc: 0
AdmaMap: 0
AdmaPages: 0
Printing EFI_SD_MMC_PASS_THRU_COMMAND_PACKET
Command index: 7, argument: 0
Command type: 2, response type: 0
Response 0: 0, 1: 0, 2: 0, 3: 0
Timeout: 2500000
InDataBuffer: 0
OutDataBuffer: 0
InTransferLength: 0
OutTransferLength: 0
TransactionStatus: Success
[ext4] Needs journal recovery, mounting read-only
Installed Fat filesystem on 822171C98

Thanks for the update.
If you can still boot successful, it should be harmless.
Please help to monitor if there’s any boot issue with JP6.2(R36.4.3).

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.