Jetson Orin Nano Random Reboots

Hi folks,

I have a Jetson Orin Nano Developer Kit that tends to reboot itself during random events.
Eg, when there is no load (sitting idle), when building a container, or using the UI.
It’s so random but frustrating since there is no way to replicate the behavior.

Some info:

  • Developer Kit with included/provided Power Supply
  • Using Samsung 990 PRO SSD as the boot device.
  • Previously I tried an SD CARD (Sandisk extreme pro) with the same behavior
  • Only peripherals are a monitor and usb keyboard+mouse
  • Using Jetpack 6.2.1
  • I can rule out any thermal issues since tegrastats is showing +/- 52 c’ for board, cpu and gpu

I’ve collected as many logs as I can, but really stumped here.

Thanks for helping!
jetson_logs.zip (12.8 MB)

*** Please note that this reply is generated by LLM automatically ***

The issue you’re experiencing with your Jetson Orin Nano Developer Kit, where it randomly reboots during various events such as idle periods, building containers, or using the UI, is frustrating and challenging to diagnose due to its randomness. Given the information you’ve provided, let’s break down the potential causes and compare them with the data you’ve shared:

  1. Power Supply: You’re using the power supply provided with the Developer Kit. This is a good starting point, as the official power supply is designed to meet the kit’s power requirements. However, power issues can sometimes be subtle and not immediately apparent. It might be worth checking the power supply’s specifications against the kit’s requirements to ensure it’s adequate, especially under peak loads.

  2. Boot Device: You’ve tried both an SD card (specifically, a Sandisk Extreme Pro) and presumably the kit’s default storage (likely eMMC, given the context) as the boot device, with the same behavior observed. This suggests that the issue might not be related to the storage medium itself.

  3. Peripherals: Your setup includes a monitor and a USB keyboard/mouse. These are basic peripherals that are unlikely to cause system instability leading to reboots. However, it’s worth noting that sometimes, faulty or incompatible peripherals can cause issues. Since you’ve mentioned the problem occurs even when the system is idle, it’s less likely to be related to these peripherals.

  4. Thermal Issues: You’ve checked the temperatures using tegrastats, and the board, CPU, and GPU temperatures are at 52°C, which is within a safe operating range for most electronics, especially considering the Jetson Orin Nano’s design for efficient heat dissipation. This suggests that thermal issues are unlikely to be the cause of the reboots.

  5. Logs and Diagnosis: You’ve collected logs, which is a crucial step in diagnosing the issue. However, without specific details from these logs, it’s challenging to pinpoint the exact cause. Common areas to look into would include kernel logs for any error messages related to hardware failures, power management issues, or software crashes.

Given the randomness of the issue and the fact that it occurs under various conditions (idle, load, different boot devices), it might be helpful to consider a few additional troubleshooting steps:

  • Firmware and Software Updates: Ensure that your Jetson Orin Nano Developer Kit is running the latest firmware and software. NVIDIA frequently releases updates that can address stability and performance issues.

  • Power Management: Sometimes, power management settings can lead to unexpected reboots. Checking the power settings and ensuring that the system is configured to stay on or properly handle power events might be useful.

  • Hardware Inspection: Although less likely given your setup, inspecting the hardware for any signs of physical damage or issues (e.g., loose connections, overheating components) could be beneficial.

  • Comparison with Similar Issues: Looking into forums, NVIDIA’s official support channels, and developer communities for similar issues can provide insights. The fact that you’ve experienced this issue with different boot devices suggests a potentially deeper hardware or firmware issue rather than a software configuration problem.

In comparison to other potential issues with the Jetson Orin Nano or similar devices, such as thermal issues (Jetson Orin NX thermal issue), camera problems (Jetson Orin Nano camera issue), or fan issues (Jetson Orin Nano fan issue), your problem seems to be more related to system stability and potentially power management or hardware reliability, given the randomness and variety of conditions under which the reboots occur.

Without more specific log data or error messages, it’s challenging to provide a definitive diagnosis or solution. Continuing to monitor system logs and potentially running stress tests or diagnostic tools provided by NVIDIA or the Linux community might help isolate the cause of the issue.

*** Please give the thumbs up if you get answers you like. Or provide feedback to help us improve the answer. ***

Hi richard170,

How about the failed rate?

Would you hit the issue if you use SDK manager to flash clean Jetpack 6.2.1(r36.4.4) on the devkit w/o running any application?

[41539.333585] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call cmd:0x731341 result 0xffff:
[41540.461406] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call cmd:0x731341 result 0xffff:
[41599.019662] rfkill: input handler enabled
[41603.987622] rfkill: input handler disabled

Are above the messages you got before reboot?

Please share the serial console log when you hit the reboot issue.
You can refer to Jetson Nano & NX Style - Serial Debug Console - JetsonHacks to capture them.

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.