Jetson Nano crashes after boot

anil.beereddy · February 24, 2021, 7:15am

I have a Jetson Nano development kit B01 that I run some CV workloads on. All of my CV workloads are run as docker containers and apart from these, an openvpn client service is the only one that I run on the device.

The device is now crashing continuously even when I try to run simple commands like ls, tail or cat. The Nano becomes unresponsive on crash and restarts. Tried a different SD Card with this device and it works fine.

How do I debug this? Unable to print syslog or dmesg either. Anyhelp is greatly appreciated.

Thanks in advance

WayneWWW · February 24, 2021, 8:35am

Hi,

What kind of “crash” do you see if you run “ls”? It sounds the root file system is corrupted.

anil.beereddy · February 24, 2021, 8:47am

Nah, I don’t think root file system is corrupted. At least I’m able to check the remaining space on disk and free memory and swap details

WayneWWW · February 24, 2021, 8:52am

What kind of error do you see when you use command “ls” “cat” “tail” ?

anil.beereddy · February 24, 2021, 10:59am

I should’ve clarified better. The system crashes; nothing is responsive and the Nano restarts

WayneWWW · February 24, 2021, 1:03pm

Are you able to give the serial console log when error happens?

https://elinux.org/Jetson/General_debug

anil.beereddy · February 25, 2021, 11:03am

Is there any other way to do it? I don’t have access to a PL2303HX TTL to USB cable

WayneWWW · February 26, 2021, 2:43am

Sorry that I think this is the only way to gather the detail log.

anil.beereddy · March 15, 2021, 10:31am

I finally got UART access. This is what is printed on the serial console

[   39.304927] mmc0: Data timeout error
[   39.308607] sdhci: =========== REGISTER DUMP (mmc0)===========
[   39.314525] sdhci: Sys addr: 0x00000400 | Version:  0x00000303
[   39.320426] sdhci: Blk size: 0x00007200 | Blk cnt:  0x00000338
[   39.326320] sdhci: Argument: 0x03c51b48 | Trn mode: 0x0000003b
[   39.332215] sdhci: Present:  0x01fb0000 | Host ctl: 0x00000017
[   39.338107] sdhci: Power:    0x00000001 | Blk gap:  0x00000000
[   39.344000] sdhci: Wake-up:  0x00000000 | Clock:    0x00000007
[   39.349891] sdhci: Timeout:  0x0000000e | Int stat: 0x00000000
[   39.355785] sdhci: Int enab: 0x02ff100b | Sig enab: 0x02fc100b
[   39.361677] sdhci: AC12 err: 0x00000000 | Slot int: 0x00000000
[   39.367570] sdhci: Caps:     0x376cd08c | Caps_1:   0x10006f73
[   39.373462] sdhci: Cmd:      0x0000123a | Max curr: 0x00000000
[   39.379349] sdhci: Host ctl2: 0x0000308b
[   39.383337] sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0x00000000ffefe420
[   39.389960] sdhci: ===========================================
[   39.397021] mmcblk0: error -110 transferring data, sector 63249224, nr 1024, cmd response 0x900, 0
[   50.854416] mmc0: Data timeout error
[   50.858116] sdhci: =========== REGISTER DUMP (mmc0)===========
[   50.864033] sdhci: Sys addr: 0x00000400 | Version:  0x00000303
[   50.869935] sdhci: Blk size: 0x00007200 | Blk cnt:  0x00000324
[   50.875832] sdhci: Argument: 0x03c51b48 | Trn mode: 0x0000003b
[   50.881724] sdhci: Present:  0x01fb0000 | Host ctl: 0x00000017
[   50.887618] sdhci: Power:    0x00000001 | Blk gap:  0x00000000
[   50.893492] sdhci: Wake-up:  0x00000000 | Clock:    0x00000007
[   50.899332] sdhci: Timeout:  0x0000000e | Int stat: 0x00000000
[   50.905156] sdhci: Int enab: 0x02ff100b | Sig enab: 0x02fc100b
[   50.910993] sdhci: AC12 err: 0x00000000 | Slot int: 0x00000000
[   50.916863] sdhci: Caps:     0x376cd08c | Caps_1:   0x10006f73
[   50.922744] sdhci: Cmd:      0x0000123a | Max curr: 0x00000000
[   50.928602] sdhci: Host ctl2: 0x0000300b
[   50.932548] sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0x00000000ffefe420
[   50.939148] sdhci: ===========================================

The above message is repeated multiple times and the device restarts. I suspect there might be something wrong with the memory sectors, but I’m not sure how to understand what’s causing it or how to fix it.

Attaching the file with complete log
start_to_reboot.txt (41.6 KB)

WayneWWW · March 15, 2021, 10:51am

That error is from sdcard driver.

Tried a different SD Card with this device and it works fine.

If this issue could be resolved with different sdcard, I don’t think it is a hardware defect on nano.

Is it possible for you to format this card and reinstall sdcard image or sdkmanager ?

anil.beereddy · March 15, 2021, 10:54am

I can do this, but that wouldn’t help me understand why this happens or help me avoid such situations in the future. Is there anyway for me to find out what is happening?

WayneWWW · March 15, 2021, 11:06am

I cannot tell either. If this issue could be easily reproduced with specific steps, then we can try it with our device and investigate.

However, so far I guess even you don’t know how it was crashed.

Topic		Replies	Views
Jetson nano SD card can use,but kernel prompt error Jetson Nano kernel , nvbugs	3	3179	October 18, 2021
Occasional Crash - strange output pattern Jetson Nano ubuntu	16	509	December 15, 2021
Jetson Nano does not boot (stuck on boot logs) Jetson Nano boot	15	3145	October 15, 2021
Jetson Nano Dev Kit w/B01 carrier - no console output and will not boot Jetson Nano boot	22	2138	March 7, 2023
Jetson Nano crash after 20-30 hours of running DS pipeline DeepStream SDK	4	388	October 12, 2021
CRC Error During Software Reboot on Jetson Nano with Custom Carrier Board (L4T 32.6.1) Jetson Nano boot , board-design	10	34	December 5, 2024
Jetson Nano No Response Jetson Nano boot , nano2gb	29	1997	February 11, 2022
Jetson Nano Crashes at high clock speeds Jetson Nano	3	1248	November 17, 2019
Jetson nano 4gb is not booting suddenly Jetson Nano boot	3	1476	January 13, 2022
Jetson Nano 4GB dev kit not flashing Jetson Nano reflash	13	197	June 10, 2024

Jetson Nano crashes after boot

Related topics