Jetson AI Fundamentals - S3E2 - Image Classification Inference - low accuracy after 70 epochs by using imagenet

ryan_sg · October 26, 2020, 7:10am

I follow the video below to train the cat_dog model.

Jetson AI Fundamentals - S3E2 - Image Classification Inference

Although I successfully trained the model (model_best.pth.tar is saved and new timestamp is updated ), the accuracy of identifying dog pictures is lower than 20% (by using the photos in “test” folder), but it’s almost 90% for cat photos. I train the model for 70 epochs and got a Acc@1 79% accuracy.

I get an “Segmentation fault (core dumped)” after the model is trained but according to the this threat ( Segmentation fault (core dumped) on jetson nano when training resnet-18 on my small dataset of just 60 images using transfer learning!), it’s fine because the “segfault” happens after done training.

Would you please tell me what’s to do next?
p.s. I also convert model to ONNX successfully with new timestamp updated (Segmentation fault also occurred while converting)

AastaLLL · October 26, 2020, 8:33am

Hi,

May I know your next step is to improve accuracy or run the inference on Nano?

Thanks.

ryan_sg · October 26, 2020, 8:38am

Hi
I’d like to reproduce the result from the S3E2 video, which is have accuracy more than 70% for both dog and cat classes via using the photos in the test folder.

the accuracy that I’m referring to about here is to identify 70 dogs’ photo correctly out of 100 god’s photo, however, I only have 20 at this moment.

dusty_nv · October 26, 2020, 6:20pm

Hi @ryan_sg, can you try deleting the .engine file in your model’s folder? TensorRT will then re-generate the engine from your ONNX the next time your run imagenet program.

It’s possible that you are using an old engine if you had exported the ONNX previously, and it needs re-generated against your latest ONNX model.

ryan_sg · October 27, 2020, 4:09am

Hi @dusty_nv
I removed the .engine file and train the model (1 epochs) as well as convert the new ONNX again.
However, this time I successfully identified 90 dog photos out of 100 but only 10 cat photos out of 100.

I can see the new .engine file regenerated while executing “imagenet” command. Although I cannot have the same result as you show in the video, but overall speaking, the accuracy is around 50%.

I download the model and the ONNX from the link that you provided for 100 epochs training and run the test again, this time the overall accuracy is around 80% and it’s for both cat and dog. I think I need to train the model with more epochs to get more accurate result.

But just be curious, if I trained the model 100 epochs and finally got the final model_best.pth.tar is saved for around 80% accuracy, then I trained the model again for 1 epoch, does the result of 1 epoch training overwrite the result of 100 epochs?

dusty_nv · October 27, 2020, 4:31pm

Yes, unless you ran train.py with the --resume=<CHECKPOINT> flag, the result of 1 epoch would overwrite the model_best.pth.tar of your previous 100 epoch run. That is because by default, the training starts fresh from the first epoch (unless you are resuming the training with the --resume and --start-epoch flags).

It’s probably a good idea to make a backup copy of your model_best.pth.tar after a long training run to prevent inadvertent overwriting.

Topic		Replies	Views
Re-training on the Cat/Dog Dataset Jetson Nano jetson-inference	7	724	October 18, 2021
Training Image classification - S3E3 - no dog issue Jetson Nano ai-training	4	502	October 15, 2021
Re-training ResNet-18 model on Jetson nano \| how many epochs? The model shows only one class on everything after re-training Jetson Nano jetson-inference , jetson-nano	7	1815	May 12, 2022
Low confidence scores of cat and dog in jetson-inference classification Jetson Nano jetson-inference , ai-training	4	900	October 18, 2021
Very low matching confidence (usually abbout 24%) when using own images and training setup with jetson-inference Jetson Nano jetson-inference , ai-training	6	687	October 15, 2021
Jetson AI Fundamentals - S3E3 - Training Image Classification Models Jetson Nano jetson-inference	6	640	February 21, 2023
Hello AI World, choose Epochs and Batch-Size Jetson Nano jetson-inference	8	972	November 23, 2022
Re-training inception v4 Jetson Nano jetson-inference	4	1047	December 1, 2021
Problems With Jetson Orin Nano Inference classication when using more that one imported camera pictures Jetson Orin Nano jetson-inference	5	484	July 17, 2023
Failing custom object detection Jetson Nano jetson-inference	6	519	January 5, 2022

Jetson AI Fundamentals - S3E2 - Image Classification Inference - low accuracy after 70 epochs by using imagenet

Jetson AI Fundamentals - S3E2 - Image Classification Inference

Related topics