UserWarning: This overload of nonzero is deprecated: (Extremely slow model training)

siftikh1 · April 11, 2021, 10:29pm

Hi I’ve just started having an issue with the classification_interactive notebook in the dli nano course.
I tried to train a Resnet34 and an Inception_v3 in there for 6 categories, with a size of 210 images in each. I know we’re not really supposed to use the Nano 2GB for training, but I’ve tried training it before with a Resnet34 and 300 images per category, I managed to get results from that and the training went a lot faster.
However this time, I am now getting this warning regardless of how I run the notebook. It is resulting in incredibly slow training of the model. In the case of the inception_v3 it’s taking an hour just to make 10% progress in a single epoch.
I’ve read that it’s a bug in pytorch that has come up frequently, but I don’t quite understand some of the solutions being provided.
Some forums recommend doing this:

But I have no idea where I would implement this bug fix inside the DLI course notebook.
Please help me figure out what to do here.

AastaLLL · April 12, 2021, 3:17am

Hi,

Since Nano 2GB has limited resource, it is possible that the slowness comes from memory or storage shortage.
Would you mind to monitor the device with tegrastats to check it first?

$ sudo tegrastats

Thanks.

siftikh1 · April 14, 2021, 8:28am

Hello I am facing the same issue with my Nano 4GB. Could it be an issue with the notebook?

AastaLLL · April 27, 2021, 8:33am

Hi,

The notebook is trying to run a PyTorch model.
Have you checked the memory usage with tegrastats?

Sometime the slowness is from the memory shortage, since the device need to read/write data frequently.

Thanks.

Topic		Replies	Views
Jetson-inference: Retraining cat_dog using train.py is not running Jetson Nano	8	989	October 14, 2021
DLI Getting Started With AI on the Jetson Nano - no progress when training thumbs example Jetson Nano	7	1454	October 14, 2021
What almost everyone with a nano is looking for Jetson Nano	65	6695	October 15, 2021
Jetson nano 2GB freezes after hit "Train" on the thumb up/down exercise in Image Classification project Jetson Nano ai-training , nano	3	972	October 15, 2021
Jetson Nano B01 RAM memory runs out Jetson Nano	7	1436	October 14, 2021
It is Impossible to do the course Getting Started with AI on Jetson Nano Jetson Nano jupyterlab , nano2gb	3	813	October 15, 2021
Torch Inference slows down after a few iterations Jetson Nano pytorch	4	670	March 2, 2022
Jetson-inference: cannot train model with custom data set Jetson Nano jetson-inference	11	2038	March 9, 2022
Thumbs Project, please help Jetson Nano	8	1192	October 14, 2021
Jetson Nano 2GB Killed (Out Of Memory) During Re-Training Jetson Nano ai-training	20	3311	November 22, 2021

UserWarning: This overload of nonzero is deprecated: (Extremely slow model training)

Related topics