phdm
June 9, 2021, 1:41pm
1
Hello,
When I run
sudo nvpmodel -m 0
on my TX2 devkit running jetpack-4.5.1, my kernel silently freezes (no answer anymore on the serial line, no answer to ‘ping’) and finally (after about 2 minutes) reboots.
No message appears on the serial console.
Any hint at the cause or to a way to debug that ?
That sounds similar to the problem I see with mit Xavier AGX DevKit:
Running a CPU&GPU intensive application (training a model with Coqui-TTS) I see reproducible freeze of my AGX Xavier a few minutes after setting it to nvpmodel 0 / MAXN mode. Fan profile is set to “cool”. The same application runs for hours and days in “30W all” mode without problems.
The last lines of dmesg --follow before the crash are:
[71675.256574] FAN rising trip_level:2 cur_temp:61350 trip_temps[3]:62000
[71679.736282] FAN rising trip_level:3 cur_temp:62100 trip_temps[4]:73000
[71684.2…
phdm
June 11, 2021, 9:29am
3
This was caused by the DT not being loaded correctly. See https://forums.developer.nvidia.com/t/jetpack-4-5-1-tx2-bug-fdt-selected-file-loaded-incorrectly-by-uboot/180197 I did however not succeed to find why that made the kernel silently freeze, although there is surely a kernel bug lying there.
@dkreutz , your case only happens to Xavier AGX. I didn’t hear anyone report the bluetooth hostwake issue on other platforms in these years.