Hi,
We are experiencing a cold boot failure on the Jetson Orin NX at -20°C ambient temperature. The ambient temperature has been verified with an external sensor.
We have applied NVIDIA’s critical QSPI software update, but the issue persists.
Issue:
The MB1 bootloader fails during the thermal check with the following error:
[0000.120] I> Task: Thermal check (0x50021d55)
[0000.121] I> max_chip_limit = 105
[0000.122] I> min_chip_limit = -28
[0000.123] I> max temp read = -20
[0000.124] I> min temp read = -33
[0000.125] E> SOC_THERM: Failed to verify temp range.
[0000.126] C> Task 0x0 failed (err: 0x5f5f030b)
[0000.127] E> Top caller module: SOC_THERM, error module: SOC_THERM, reason: 0x0b, aux_info: 0x03
The module then enters a boot loop (“Busy Spin”).
Analysis:
The SOC_THERM sensors report a minimum temperature of -33°C, which is impossible given the verified ambient temperature of -20°C. Since the reported value falls below the firmware’s hard-coded limit of -28°C, the boot process is terminated.
This appears to be a sensor calibration error at low temperatures rather than an actual temperature reading. The Orin NX is rated to operate at Tj minimum of -25°C, so we are within the specified operating range, yet the erroneous sensor reading prevents boot.
Additional Observation:
We have another Orin NX module flashed with the identical image and tested under the same conditions. That unit reports temperatures approximately 5°C lower than the failing unit. This suggests significant unit-to-unit variation in thermal sensor calibration, which may cause some modules to fail cold boot while others pass.
Questions:
-
Is this a known issue with SOC_THERM calibration at low temperatures?
-
Is there a way to adjust the
min_chip_limitparameter via MB1 BCT configuration? -
Can NVIDIA provide a firmware update or workaround for this issue?
-
Is the unit-to-unit thermal sensor variation within expected tolerances?
Environment:
-
Module: Jetson Orin NX
-
JetPack version: 5.1.1
-
Ambient temperature: -20°C (verified)
-
QSPI patch: Applied
-
Carrier: Forecr - NVIDIA® Jetson Orin™ NX & Orin Nano™ Dual LAN Carrier Board - DSBOARD-ORNX-LAN
Full boot log attached.
Thank you.
crash_log_1152_040126.txt (26.8 KB)