Hi, I did the “Getting Started with AI on Jetson Nano” course. I found some strange issues and I wonder if anyone has this too? For the accessories of the course I bought the Nano-DLI-kit to make sure I had the right things. During the course the Raspberry PI camera and a Logitech USB camera were physically connected but most of the time I used the USB camera in the Jupyter notebook.
During the course the Jetson Nano regularly suddenly spontaneously powered off. Is there an autopower off function on this board or is this known feature? The power adapter is from Mean Well GST25B05-P1J output 5V, 4A 20W max.
During capturing of the images some jpgs were zero sized. This caused the training to hang. To solve it I removed these files and add some new ones. Is this known?
I had also regularly a broken connection. The only way was to turn the power of and on and reboot and connect again. And run the cells and do training again.
In the 5W mode issue 1 also happened but only it takes longer. Issues 2 and 3 did not occur anymore but the Jetson became also slower because of less processing speed.
It looked like it was a thermal problem. I ran the tegrastats and below shows the last line for 5W and 10W without a fan and then it powered off:
For the 5W:
RAM 3160/3965MB (lfb 107x4MB) SWAP 42/4096MB (cached 3MB) CPU [98%@921,98%@921,off,off] EMC_FREQ 0% GR3D_FREQ 9% PLL@49.5C CPU@53C PMIC@100C GPU@51C AO@62.5C thermal@53C POM_5V_IN 2728/4064 POM_5V_GPU 120/1157 POM_5V_CPU 882/853
For the 10W:
RAM 3093/3965MB (lfb 107x4MB) SWAP 57/4096MB (cached 9MB) CPU [37%@825,21%@825,21%@825,21%@825] EMC_FREQ 0% GR3D_FREQ 98% PLL@34C CPU@37C PMIC@100C GPU@37C AO@48.5C thermal@37.5C POM_5V_IN 6583/6170 POM_5V_GPU 3429/3015 POM_5V_CPU 709/814
So I bought the recommended fan Noctua NF-A4x20 5V PWM and then running in 10W and even jetson_clocks gave no problems anymore.
With fan the Jetson Nano kept running and AO didn’t reach high temperatures. Maybe there is a dT/dt protection in the Jetson Nano because the 5W raises the remperatures slowly and resches higher temperatures than in 10W mode? I didn’t read it in the thermal design guide.
With a fan 5W the max AO temperature measured was 42C and for 10W and jetson_clocks 50C.
Looks like a fan is obligatory for deep learning applications and I think it should have been added to the NANO-DLI-KIT.
Another issue I didn’t mention but does not occur everytime so it is hard to put a finger on it, is that when powering the Jetson Nano on, it sometimes turns off immediately. Then powering on again the Jetson Nano remains on. I used the Mean Well 5V 4A adapter included in the NANO-DLI-KIT. My guess explanation is that when the Jetson is fully discharged there is a surge current because of the capacitance of capacitors and USB leads and wires that is higher than 5A and this lowers the voltage. Because the capacitors are charged the second time the surge current is lower. It is a wild guess and I don’t have equipment to measure it. As said it does not always happen but I bought a Mean Well 5V 6A adapter as a backup adapter to test this. It hasn’t happened yet with this adapter.
Per your description of normal work with 5V 6A adapter, it looks like the workload during power on is so much that 5V 4A adapter failed to power on system. It might happen as the full discharge caps will draw some current when power on, or the devices attached do the same. Basically if no external devices and power supply is good, 5V 4A should be able to power on system without issue.
hello, i have same issue, its madness already((( i have prooved 5V 4A DC https://www.meanwell-web.com/en-gb/ac-dc-dual-output-enclosed-power-supply-output-rd--65b and 5W mode is not suitable, i need $ sudo nvpmodel -q NVPM WARN: fan mode is not set! NV Power Mode: MAXN 0 for my payload. and test running ends abnormally, jetson switched off. im sure about DC power, i checked it.
what to do? please help asap.
photo:
[url]IMG_20190904_165833.jpg - Google Drive
The message “NVPM WARN: fan mode is not set!” is new for me. Maybe you have a newer version of L4T.
But I don’t see a fan on your heat sink. The greatest cause of my issues was thermal. Installing a fan like the Noctua NF-A4x20 5V PWM solved already the first three issues. I only have issue #4 when I use the 5V 4A adapter and only the first time starting up. The second time it keeps running until I turn it off.
If not thermal problem you might need to probe if any voltage drop happen before shut down abnormally. Do you have other power supply more than 4A to try?
yes. i have ATX supply with 5V/40A. But i`m sure about power stability. I beholds my voltmeter during all testing process, 5.06V was stable. Proof - my photo link above
You are measuring 5.06V at the power adapter output. And then there is a two-wire cable. What is the voltage at the barrel with the same current running?
Only from HW point, generally the drop is a very short pulse, only oscilloscope can capture it by trigger, multi-meter has no such function. Power supply might have transient response ability problem even though its output capability fits request.
Do you have full log till shutdown happened? It might be useful to check root cause.