I am following the “Hello AI world” of Nvidia on my new Jetson-Nano dev kit (4GB). in the 3rd video ( here) , a dog/cat training is done, on top of the existing network. The command is: python3 train.py --model-dir=models/cat_dog data/cat_dog
, and it aborts with “killed” message.
When I add the flags that are supposed to reduce memory needs " --batch-size=4 --workers=1 --epochs=1" it starts running, yet aborting with “OSError: [Errno 12] Cannot allocate memory”`
Yet in the video it runs well, even though it runs on the 2GB model, while I use the 4GB model.
In my case, I also terminated all other applications.
Any idea why can’t it finish the task?
Can I execute the training outside the Jetson Nano?
I restarted few times. At first it wasn’t solved. Then it went well after making sure I do nothing but this. it made me thinking that the memory needed is on the edge.