Finally was able to do tests again and got similar results.
Test Description:
P2597 evm board
Two new TX2 modules without Heatsink, no fan, fw Jetpack 4.3
Running stress test from first post
Thermometer: Graphtec GL840, T-type Thermocouple, ± 0.6 ºC measurement accuracy
Thermocouple location - TTP temperature: Jetson_TX2_Thermal_Design_Guide_v1.0 - Figure 3-2
DUT inside chamber adjusting ambient temperature to different points: -20C, 10C, 15C, 18C
TX2 logs retrieved with tegrastats command
Test Results for one TX2 module: (Data captured 4 hours after Chamber temperature is stable)
Ambient temperature = -20C
TX2 TTP temperature = 29.5C
RAM 2524/7860MB (lfb 979x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@47C MCPU@47C PMIC@100C Tboard@27C GPU@51.5C BCPU@47C thermal@48.8C Tdiode@51.5C VDD_SYS_GPU 10571/10358 VDD_SYS_SOC 1072/1063 VDD_4V0_WIFI 0/9 VDD_IN 19060/19033 VDD_SYS_CPU 4443/4577 VDD_SYS_DDR 1798/1811
Result: Pass - No CPU throttling
Ambient temperature = 10C
TX2 TTP temperature = 67.0C
RAM 2405/7860MB (lfb 1169x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@87C MCPU@87C PMIC@100C Tboard@63C GPU@92.5C BCPU@87C thermal@89.5C Tdiode@92C VDD_SYS_GPU 11915/11912 VDD_SYS_SOC 1451/1377 VDD_4V0_WIFI 0/15 VDD_IN 22111/21877 VDD_SYS_CPU 5576/5495 VDD_SYS_DDR 1990/1872
Result: Pass - No CPU throttling
Ambient temperature = 15C
TX2 TTP temperature = 68.5C
RAM 3971/7860MB (lfb 780x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@65C GPU@95C BCPU@90.5C thermal@92.3C Tdiode@94.5C VDD_SYS_GPU 11457/11041 VDD_SYS_SOC 1451/1448 VDD_4V0_WIFI 0/10 VDD_IN 21461/21183 VDD_SYS_CPU 5499/5613 VDD_SYS_DDR 1856/1874
Result: Pass - No CPU throttling
Ambient temperature = 18C
TX2 TTP temperature = 69.1C
02/02/20 06:13:01: RAM 3771/7860MB (lfb 822x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@93.1C Tdiode@95C VDD_SYS_GPU 10392/10346 VDD_SYS_SOC 1451/1449 VDD_4V0_WIFI 0/10 VDD_IN 20734/20252 VDD_SYS_CPU 5805/5368 VDD_SYS_DDR 1856/1870
02/02/20 06:13:02: RAM 3236/7860MB (lfb 955x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@92.8C Tdiode@95C VDD_SYS_GPU 10392/10346 VDD_SYS_SOC 1451/1449 VDD_4V0_WIFI 0/10 VDD_IN 20696/20252 VDD_SYS_CPU 5728/5368 VDD_SYS_DDR 1875/1870
02/02/20 06:13:03: RAM 4705/7860MB (lfb 588x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@93.1C Tdiode@94.75C VDD_SYS_GPU 10392/10346 VDD_SYS_SOC 1452/1449 VDD_4V0_WIFI 0/10 VDD_IN 20581/20252 VDD_SYS_CPU 5728/5369 VDD_SYS_DDR 1837/1870
02/02/20 06:13:04: RAM 2128/7860MB (lfb 1232x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@93.1C Tdiode@95C VDD_SYS_GPU 10392/10346 VDD_SYS_SOC 1451/1449 VDD_4V0_WIFI 0/10 VDD_IN 20734/20252 VDD_SYS_CPU 5805/5369 VDD_SYS_DDR 1894/1870
02/02/20 06:13:05: RAM 3414/7860MB (lfb 911x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@92.8C Tdiode@94.75C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19786/20252 VDD_SYS_CPU 4892/5369 VDD_SYS_DDR 1837/1870
02/02/20 06:13:06: RAM 3665/7860MB (lfb 848x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95.5C BCPU@90.5C thermal@92.8C Tdiode@94.75C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19556/20252 VDD_SYS_CPU 4816/5368 VDD_SYS_DDR 1779/1870
02/02/20 06:13:07: RAM 1869/7860MB (lfb 1272x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95C BCPU@90.5C thermal@92.5C Tdiode@94.75C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1376/1449 VDD_4V0_WIFI 0/10 VDD_IN 19488/20251 VDD_SYS_CPU 4739/5368 VDD_SYS_DDR 1760/1870
02/02/20 06:13:08: RAM 2203/7860MB (lfb 1213x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95.5C BCPU@90.5C thermal@92.3C Tdiode@94.5C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19633/20251 VDD_SYS_CPU 4892/5368 VDD_SYS_DDR 1779/1870
02/02/20 06:13:09: RAM 2899/7860MB (lfb 1039x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95.5C BCPU@90.5C thermal@92.5C Tdiode@94.5C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19595/20251 VDD_SYS_CPU 4739/5368 VDD_SYS_DDR 1817/1870
02/02/20 06:13:10: RAM 3764/7860MB (lfb 823x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90C MCPU@90C PMIC@100C Tboard@67C GPU@95.5C BCPU@90C thermal@92.5C Tdiode@94.5C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19518/20251 VDD_SYS_CPU 4663/5368 VDD_SYS_DDR 1856/1870
02/02/20 06:13:11: RAM 2474/7860MB (lfb 975x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95C BCPU@90.5C thermal@92.5C Tdiode@94.25C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19710/20251 VDD_SYS_CPU 4892/5368 VDD_SYS_DDR 1837/1870
02/02/20 06:13:12: RAM 1342/7860MB (lfb 1360x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@1881,100%@1881,100%@1881,100%@1881,100%@1881,100%@1881] EMC_FREQ 0% GR3D_FREQ 99% PLL@90.5C MCPU@90.5C PMIC@100C Tboard@67C GPU@95C BCPU@90.5C thermal@92.5C Tdiode@94.25C VDD_SYS_GPU 10396/10346 VDD_SYS_SOC 1453/1449 VDD_4V0_WIFI 0/10 VDD_IN 19556/20251 VDD_SYS_CPU 4739/5368 VDD_SYS_DDR 1837/1870
02/02/20 06:13:13: RAM 2830/7860MB (lfb 1056x4MB) SWAP 0/3930MB (cached 0MB) CPU [100%@2035,100%@2035,100%@2035,100%@2035,100%@2035,100%@2035] EMC_FREQ 0% GR3D_FREQ 99% PLL@91C MCPU@91C PMIC@100C Tboard@67C GPU@95.5C BCPU@91C thermal@92.6C Tdiode@94.5C VDD_SYS_GPU 10392/10346 VDD_SYS_SOC 1452/1449 VDD_4V0_WIFI 0/10 VDD_IN 20322/20251 VDD_SYS_CPU 5425/5368 VDD_SYS_DDR 1856/1870
Result: Failed - CPU throttling, CPU frequency swings between 100%@2035 and 100%@1881
Comments and Questions:
At TTP temperature around 70C we already get CPU throttling.
Is the CPU throttling event log somewhere? don’t see anything in dmesg