Hello ,
I am using INVIDIA Jetson Xavier with a limited power of 30w and powered with15V external power source (it can deliver 45w ) , I am having a lot of crashes on startup and it occurs randomly,
I am using I2C_1 and I2C_8 to comunicate with other devices through the 40 pins outputs ,
to note also that two displays are connected (hdmi + USB_C)
I looked to the log file one one week and I found a lot of thoes messages before [Booting Linux on physical CPU 0x0]:
iwlwifi 0003:01:00.0: Getting the temperature timed out 1618
tegradc 15210000.nvdisplay: blank - powerdown 3269
iwlwifi 0003:01:00.0: Direct firmware load for iwlwifi 932
I want to know what can causes thoes error messages and crashes
This is the TX2 forum, you might want to repost this in the Xavier forum.
After reposting you will probably need a serial console boot log, and also mention if this is the dev kit or a third party carrier board, along with which release of L4T is used (if you can get it booted for a moment, then check the result of command “head -n 1 /etc/nv_tegra_release
” to verify L4T version). Additionally, you’ll want to mention which peripherals are connected (if other than keyboard/mouse…do mention any kind of monitor and type of cable it uses).
Thank you for your reply
I am using the standard dev kit
~$ head -n 1 /etc/nv_tegra_release
R32 (release), REVISION: 4.3, GCID: 21589087, BOARD: t186ref, EABI: aarch64, DATE: Fri Jun 26 04:34:27 UTC 2020
the connected peripherals are :
2 cameras connected to j512 and j507 USB
Epson projector card with HDMI cable
4.3 iunches waveshare screen to USB_C j512 with USB to HDMI adapter
2 microconrollers connected to I2c_GP5 and I2C_GP2 of the 40 pin header to exchange I2C messages
wifi intel card 8265NGW with antenas
log kernel file : kern.log_copy.txt (9.8 MB)
any Ideas ?
I don’t know enough about temperature monitoring code, but it seems like most of the last messages are based on temperature. This does not necessarily mean temperature is too high, but it seems like effort is being made to find temperature, and it times out. This is the last thing at the end of the log.
Can anyone else suggest if failure to read temperature at the end of the log would cause the system to crash? Here is the excerpt (some extra lines shown to illustrate video and wired ethernet was working):
Dec 7 16:34:35 tpsh-agx-1 kernel: [ 17.052471] eqos 2490000.ether_qos eth0: Link is Up - 1Gbps/Full - flow control rx/tx
Dec 7 16:34:35 tpsh-agx-1 kernel: [ 17.053147] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Dec 7 16:34:41 tpsh-agx-1 kernel: [ 23.074258] Bluetooth: BNEP (Ethernet Emulation) ver 1.3
Dec 7 16:34:41 tpsh-agx-1 kernel: [ 23.074301] Bluetooth: BNEP socket layer initialized
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.435964] fuse init (API version 7.26)
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.871061] tegradc 15200000.nvdisplay: unblank
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.871090] tegradc 15220000.nvdisplay: unblank
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.871101] tegradc 15210000.nvdisplay: blank - powerdown
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.907457] tegradc 15200000.nvdisplay: unblank
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.907485] tegradc 15220000.nvdisplay: unblank
Dec 7 16:34:42 tpsh-agx-1 kernel: [ 23.907496] tegradc 15210000.nvdisplay: blank - powerdown
Dec 7 16:34:43 tpsh-agx-1 kernel: [ 24.598746] tegradc 15200000.nvdisplay: unblank
Dec 7 16:34:43 tpsh-agx-1 kernel: [ 24.598775] tegradc 15220000.nvdisplay: unblank
Dec 7 16:34:43 tpsh-agx-1 kernel: [ 24.598789] tegradc 15210000.nvdisplay: blank - powerdown
Dec 7 16:34:46 tpsh-agx-1 kernel: [ 27.472005] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:34:49 tpsh-agx-1 kernel: [ 30.767994] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:35:19 tpsh-agx-1 kernel: [ 60.480571] nvgpu: 17000000.gv11b railgate_enable_store:297 [INFO] railgate is disabled.
Dec 7 16:35:40 tpsh-agx-1 kernel: [ 81.711860] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:36:08 tpsh-agx-1 kernel: [ 109.423941] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:36:15 tpsh-agx-1 kernel: [ 117.071894] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:36:22 tpsh-agx-1 kernel: [ 123.439918] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:36:25 tpsh-agx-1 kernel: [ 126.479883] iwlwifi 0003:01:00.0: Getting the temperature timed out
Dec 7 16:36:37 tpsh-agx-1 kernel: [ 138.512241] FAN rising trip_level:1 cur_temp:50000 trip_temps[2]:63000
Dec 7 16:36:39 tpsh-agx-1 kernel: [ 140.463929] iwlwifi 0003:01:00.0: Getting the temperature timed out