jetson nano freezes

Hi!
I am trying to use my new jetson nano. Unfortunately, it freezes at random time points anywhere between approx. 2-30 min after powering up. In that state, both LEDs of the ethernet jack are permanently on in addition to the power LED. The monitor on the DisplayPort doesn’t receive any data, and the jetson is not pingable. sysloggin via ssh doesn’t yield any insights.
From the posts I found, I assumed it might be an issue with the power supply, or overheating. As it is pretty cool to the touch, I am not expecting overheating. I have tried multiple power supplies (2.5A and 3.1A on the MicroUSB, 4A on the power jack with the jumper on J48 set). The outcome is the same every time. I also suspected the SD card to be a culprit, but as changing did not have an impact (SanDisk Extreme PRO microDXC USH-I 128HG, 170MB/s read, 90MB/s write and alternatively a SanDisk 32GB card that has been working in a raspberry Pi 3B+). Next, I tried what happens if I don’t have a network cable connected: again, same thing.
If I don’t wait a few minutes between unplugging the power and replugging, the network LEDs are back on, so it is still caught in the same stage - after a few minutes, however, I can do a proper restart.
Only having DisplayPort, USB mouse and USB keyboard plugged in, I feel there is not much I can get rid of any more.
Does anyone have a similar experience? Or an idea of what else to try?
Thanks!

1 Like

I would have tried the same things as you already did.

Two more suggestions:

  • Did you verify that the CPU/GPU modul is correctly inserted into its socket (the DIMM like socket)?
  • Did you flash a fresh image to the SD card without any software modifications?

Just to make sure: There is no Wifi card plugged into the PCIe connector nor a camera in the CSI port?

Thanks for your suggestions:

  • yes, the CPU/GPU module sits in the socket correctly
  • yes, the SD card is freshly flashed, according to the instructions and with the image from nvidia.com/jetsonnano-start. I did not alter anything apart from specifying language and timezone, as well as accepting the license.
  • no wifi nor camera nor any peripherals other than keyboard, mouse, and monitor are plugged in.

Hi,

Are you about to enable the serial console and see what is the last log before it dies?

Does the monitor just go blank and cannot wake up anymore? If you set the display to never sleep, would it die too?

If I set the display to never sleep, the issue persists.

When hooking up to the serial console and using minicom (on ubuntu LTS16.04), as described in the link, I don’t get an intelligible log, but something like the following:

                                                                                                                                                                                             �
                                                                                                                                                                                             �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �
                                                                                                                                                                                             �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                                                                                                 �                                                                                                                    �����                                                                        �y=�3#9�=������'-#5���@����6-�V��u����婺��j����t�j:�������������Vo ����6���V������������j���޺�{���j:������������������k�                                                                          3�������R���6��V�j:����;       ��                                                                                      ��u**�j�j��4����{�;j��+�@����6-�֬��u**��k�j��j:���Ժ�����ڴiTo�}�����         =
                       ��
                            �t�

                               ��}��*�{���֭oH}���6-��VR���6R���6

I switched the USB converter, and now got a readable log (see below). The issue seems to get worse; now the Ethernet LEDs immediately turn on, and I can’t even log on etc. This cannot be reverted by changing SD card or re-flashing. So I guess it is hardware-related…

.[0000.302] [TegraBoot] (version 00.00.2018.01-l4t-89b97a49)
[0000.307] Processing in cold boot mode Bootloader 2
[0000.312] A02 Bootrom Patch rev = 1023
[0000.315] Power-up reason: pmc por
[0000.319] No Battery Present
[0000.321] pmic max77620 reset reason
[0000.325] pmic max77620 NVERC : 0x40
[0000.328] RamCode = 0
[0000.330] Platform has DDR4 type RAM
[0000.334] max77620 disabling SD1 Remote Sense
[0000.338] Setting DDR voltage to 1125mv
[0000.342] Serial Number of Pmic Max77663: 0x403a3
[0000.349] Entering ramdump check
[0000.352] Get RamDumpCarveOut = 0x0
[0000.356] RamDumpCarveOut=0x0, RamDumperFlag=0xe59ff3f8
[0000.361] Last reboot was clean, booting normally!
[0000.365] Sdram initialization is successful
[0000.369] SecureOs Carveout Base=0x00000000ff800000 Size=0x00800000
[0000.376] Lp0 Carveout Base=0x00000000ff780000 Size=0x00001000
[0000.381] BpmpFw Carveout Base=0x00000000ff700000 Size=0x00080000
[0000.387] GSC1 Carveout Base=0x00000000ff600000 Size=0x00100000
[0000.393] GSC2 Carveout Base=0x00000000ff500000 Size=0x00100000
[0000.399] GSC4 Carveout Base=0x00000000ff400000 Size=0x00100000
[0000.405] GSC5 Carveout Base=0x00000000ff300000 Size=0x00100000
[0000.411] GSC3 Carveout Base=0x000000017f300000 Size=0x00d00000
[0000.427] RamDump Carveout Base=0x00000000ff280000 Size=0x00080000
[0000.433] Platform-DebugCarveout: 0
[0000.436] Nck Carveout Base=0x00000000ff080000 Size=0x00200000
[0000.442] Non secure mode, and RB not enabled.
[0000.447] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.457] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.464] Number of retries left 4
[0000.467] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.477] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.484] Number of retries left 3
[0000.487] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.497] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.504] Number of retries left 2
[0000.507] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.517] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.524] Number of retries left 1
[0000.527] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.537] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.544] Number of retries left 0
[0000.547] Send command failed with 0x3
[0000.550] CMD8 send failed. Retrying CMD0/CMD8 (2)…
[0000.556] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.566] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.572] Number of retries left 4
[0000.576] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.586] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.592] Number of retries left 3
[0000.596] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.606] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.612] Number of retries left 2
[0000.616] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.626] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.632] Number of retries left 1
[0000.636] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.646] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.652] Number of retries left 0
[0000.656] Send command failed with 0x3
[0000.659] CMD8 send failed. Retrying CMD0/CMD8 (1)…
[0000.664] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.674] Command complete wait failed with error 0x3 Interrupt 0x18001
[0000.681] Number of retries left 4
[0000.684] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.695] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.701] Number of retries left 3
[0000.704] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.715] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.721] Number of retries left 2
[0000.725] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.735] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.741] Number of retries left 1
[0000.745] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.755] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.761] Number of retries left 0
[0000.764] Send command failed with 0x3
[0000.768] CMD8 send failed. Retrying CMD0/CMD8 (0)…
[0000.773] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.783] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.789] Number of retries left 4
[0000.793] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.803] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.809] Number of retries left 3
[0000.813] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.823] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.830] Number of retries left 2
[0000.833] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.843] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.850] Number of retries left 1
[0000.853] Error mask set in wait for cmd complete with error 0x3 in HwSdmmcWaitForCommandComplete func at 278 line
[0000.863] Command complete wait failed with error 0x3 Interrupt 0x18000
[0000.870] Number of retries left 0
[0000.873] Send command failed with 0x3
[0000.876] CMD55 send failed with error 0x3 in SdIdentifyCard func at 1891 line
[0000.883] SD Identify card failed with 0x3
[0000.887] SdIdentifyCard has failed with error 0x3 in NvTbootSdmmcInit func at 87 line
[0000.895] Sdmmc Init failed with 0x3 error
[0000.899] Error in NvTbootLoadBinary: 0x3 !
[0000.903] failed to load NvTbootTbootCpu from (4:0)
[0000.907] re-load NvTbootTbootCpu from (2:0)
[0000.913] Invalid GPT Partition
[0000.918] Invalid GPT Partition
[0000.921] Error is 1

Looks like a hardware issue and SD card fails to start… Please file a RMA request.

The process of filing a RMA request is described here:
https://www.developer.nvidia.com/embedded/faq
Additionally, do have your invoice and a picture of the Jetson Nano showing its serial number at hand.