Hi all,
@AastaLLL @dusty_nv
Firstly, big thank you guys for helping me to figure it out, and everything you created here to make it easier for people like me (with no background) to hoop on the inference\DL\AI train :)
@AastaLLL - yes, I followed the steps to the point (only customized the batch size in order to make it easier on my 2GB nano).
I did not get any errors.
@dusty_nv
The accuracy was around 67% after each epoch , however it was the same exact number, something like 67.513 (I don’t remember exactly) - which I found weird.
I did run your 100 epochs onnx file and it recognized the cats\dogs pretty well (except the Siamese cats, which he thought were dogs).
However, I have an update - meanwhile I followed the ROS2 installation (here too, I am learning as I go) as my final goal is an autonomous robot, and for that I also did the steps of “building the project from source” - which I did not do until now as I worked exclusively with the container, and after that I successfully trained 40 epochs cat_dog model which do recognizes cats and dogs pretty well (and presents both labels accordingly).
While building the project from source, I noticed that I was missing a lot of CUDA drivers (?) or something like that and it took a good 40 minutes to install it - maybe this have something to do with the fix.
Clarification: I did the ROS2 installation before I saw you reply of trying to delete the *.engine file, and after running the training again (and it worked) I did not delete anything, however, I did create a brand new folder for this training as I gave it a different name (so I believe the *.engine file was newly created for this run).
I thought that building the container was enough for the inference recognition projects, but maybe I misunderstood the guide\YouTube tutorials?
Soon I will collect my own dataset for detection project, hope I won’t run again to one label problem :)
Thank you all!!!