Thermal Alert Triggered when powering on Drive AGX

Please provide the following info (check/uncheck the boxes after creating this topic):
Software Version
DRIVE OS Linux 5.2.6
DRIVE OS Linux 5.2.6 and DriveWorks 4.0
DRIVE OS Linux 5.2.0
DRIVE OS Linux 5.2.0 and DriveWorks 3.5
NVIDIA DRIVE™ Software 10.0 (Linux)
NVIDIA DRIVE™ Software 9.0 (Linux)
other DRIVE OS version
other

Target Operating System
Linux
QNX
other

Hardware Platform
NVIDIA DRIVE™ AGX Xavier DevKit (E3550)
NVIDIA DRIVE™ AGX Pegasus DevKit (E3550)
other

SDK Manager Version
1.7.1.8928
other

Host Machine Version
native Ubuntu 18.04
other

Hi, we were suddenly met with Thermal Alert Triggered error after powering on Drive AGX. Whenver we try power cycling the board using aurixreset, the fans will suddenly run significantly faster and the AURIX Console will print Thermal Alert Triggered.

AURIX Console constantly printing Thermal Alert Triggered

We’ve tried reflashing the OS and got similar results to Drive AGX unable to flash Drive OS 5.2.6. The linked topic suggests that this is a hardware issue.

Additionally, DRIVE AGX fail to power on frequently also suggests that we reflash the AURIX MCU to run showvoltages program.

Any suggestions on how we should continue?

Dear @aulia.widyaputra1,
What is the state of the board before you hit this issue? Is it used in office premises or lab? Also, could you run showvoltages on aurix console? Also, there is short time gap before thermal alert shows up on aurix console, we need to reset tegra and put it in recovery mode before thermal alert and check flashing manually using bootburn. could you check Reflash 99% , the sdkmange stucks - #18 by tonyb.zhao if it works for you?

What is the state of the board before you hit this issue?

We just finished installing DriveOS 5.2.6.0 using instructions from Install DRIVE Platform with SDK Manager.

After flashing, we connected to nvshell using Putty and was asked to setup the initial user. That’s the moment when continuously got the Thermal Alert Triggered error.

We don’t have other logs of the board before the error because we didn’t expect this error would occur. Will look into routinely logging board conditions for the future.

Is it used in office premises or lab?

It’s used in a lab.

Also, could you run showvoltages on aurix console?

Our aurix console does not have the showvoltages command. Should we reflash the aurix using Flashing Basics?

Also, there is short time gap before thermal alert shows up on aurix console, we need to reset tegra and put it in recovery mode before thermal alert and check flashing manually using bootburn. could you check Reflash 99% , the sdkmange stucks if it works for you?

Will try the suggestion on this link first. Thank you.

Dear @aulia.widyaputra1,
I am wondering if the flashing is successful earlier, How did you confirm if the flashing is successful?
Do you see thermal alert errors on Tegra console or aurix console?

Please flash the target as suggested in previous post( Reflash 99% , the sdkmange stucks - #18 by tonyb.zhao and provide your feedback.

Do you see thermal alert errors on Tegra console or aurix console?

We saw the thermal alert errors on the aurix console. The Tegra consoles were inaccessible. If we tried to Putty to the terminal, the terminal gave no response.

Please flash the target as suggested in previous post( Reflash 99% , the sdkmange stucks - #18 by tonyb.zhao and provide your feedback.

We followed the suggestions to flash in recovery mode as explained in Reflash 99% , the sdkmange stucks - #18 by tonyb.zhao.

We were able to successfully flash Drive Software 10.0 using SDK Manager. Thank you.

1 Like

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.