Power Cut on Official Device of Jeston AGX Orin 64GB

Hi,

We are using official JAO 64GB Carrier Board for our robot development. Now, we encounter a problem of random power cut, which seriously impact our project. We raised several cases - Several Jetson Orin 64GB Devkit sudennly power-off unexpectedly used in our product body under 25℃ - #35 by liwei.1988.

Now , we want to confirm with you the fault device tree file for our double check the thermal zone related setting. They are tegra234.dtsi, and tegra234-p3701-0000.dtsi, tegra234-p3737-0000.dtsi,tegra234-p3737-0000+p3701-0000.dtsi, tegra234-sim-vdk.dtsi.

*** Please note that this reply is generated by LLM automatically ***

Based on the provided context, it appears that you are experiencing issues with random power cuts on your Jetson AGX Orin 64GB Carrier Board, which is impacting your robot development project. You have raised several cases and are seeking to confirm the device tree file settings related to thermal zones.

The device tree files mentioned are:

  1. tegra234dtsi
  2. tegra234p37010000dtsi
  3. tegra234p37370000dtsi
  4. tegra234p37370000p37010000dtsi
  5. tegra234simvdkdtsi

These files are part of the Linux kernel source code and contain settings for the Jetson AGX Orin module. The dtsi files are Device Tree Source Include files, which are used to define the hardware components and their properties.

To address your issue, I recommend checking the thermal zone settings in these device tree files. Specifically, you should look for the thermal-zones node and its child nodes, which define the thermal zones and their properties.

Here’s a brief overview of the thermal zone settings:

  • thermal-zones: This node defines the thermal zones on the Jetson AGX Orin module.
  • zone0: This node defines the first thermal zone, which typically corresponds to the CPU.
  • zone1: This node defines the second thermal zone, which typically corresponds to the GPU.
  • zone2: This node defines the third thermal zone, which typically corresponds to the memory.

In each zone node, you should check the following properties:

  • trip-points: This property defines the temperature trip points for each zone.
  • cooling-devices: This property defines the cooling devices associated with each zone.

To troubleshoot your issue, you can try the following:

  1. Check the thermal zone settings in the device tree files to ensure that they are correctly configured.
  2. Verify that the thermal zones are properly defined and that the trip points are set correctly.
  3. Check the cooling devices associated with each zone to ensure that they are functioning correctly.
  4. Monitor the system temperature and thermal zone status using tools like sensors or thermal_zone to see if there are any anomalies.

If you are still experiencing issues after checking the device tree files and thermal zone settings, you may want to consider other factors that could be contributing to the random power cuts, such as:

  • Power supply issues
  • Overheating due to environmental factors or inadequate cooling
  • Hardware faults or defects

It’s also worth noting that the Jetson AGX Orin module has a minimum operating temperature of -20°C, as mentioned in the topic “Jetson Orin 200T module’s minimum operating temperature is only -20℃”. If your system is operating below this temperature, it may be causing issues.

I hope this helps you troubleshoot and resolve the issue with your Jetson AGX Orin 64GB Carrier Board.

*** Please give the thumbs up if you get answers you like. Or provide feedback to help us improve the answer. ***

Hi,

Please check the /hardware/nvidia directory from the source code tarball.

我下载了public_sources、Jetson_Linux_R36.4.3_aarch64、Tegra_Linux_Sample-Root-Filesystem_R36.4.3_aarch64、ubuntu_jammy-l4t_aarch64_src,请问您说的文件具体是哪个路径?

public_sources

這包裡面的kernel src裡面就有我提到的路徑.