Please provide the following information when requesting support.
• Hardware (RGX 3080)
• Network Type (Classification)
• TLT Version format_version: 2.0
toolkit_version: 3.22.05
published_date: 05/25/2022()
• Training spec file
• How to reproduce the issue ?
I am experiencing an error similar to this topic:
“When I run tao classification train” the container stops and I get the following error:
As mentioned in the included topic some images seem to be broken. You can google this error message to find ways to locate the images. You may also try to remove all the formatted&splitted datasets and execute the “prepare dataset” again.
It turns out the issue may not be completely solved.
I can run “tao classification train” on the 2-class, 1004 image database.
it completes 1 epoch but at image 1003 of the 2nd epoch it throws the “UnidentifiedImageError”:
I ran “$ ls -la” in the “train” folder and found the following hidden files, in addition to the two dataset folders.
The contents of “.” and “…” are as follows:
The database was created on a Mac, which, I have now learnt, will put this hidden file in any folder it creates. I have deleted it, but the same error persists.
Every file opens as an image.
I have tried running a PIL-based script, which revealed the hidden copies with the original names, but now doesn’t flag anything.
The next thing to try would be to save each .jpg as a .png and retry. Do you have any other suggestion?
I have checked using the bisection method (extensively, though not exhaustively) and tao classification train seems to have a problem with every image. I can run you through the exact procedure I followed if you need that, but it is very repetitive. What criterion is it using for identification? These are all .jpgs that open and have no visible artefacts. Is it possible there is another explanation for this behaviour?
May I also repeat that these datasets run on a Jetson Nano and were used that way in earlier tests.
As a sanity check I will try some other images in two classes to see if they will work.
To keep things as simple as possible I cleared the folders from my “train” directory.
I created 2 new folders, one called “cat” and the other “dog”.
I put the image above in each.
When I run “tao classification train” I get the same UIError.
Here is the single image from the “cat” folder"
It was also the single image in the “dog” folder.
I have now used the utility jpeginfo to verify all files.
The image above is [OK].
Here is a printout of part of the results of the contents of the “damage” folder:
So, you have run two kinds of training dataset, right?
For “cat” and “dog” training, is the training working?
For “damage” and “healthy” training, is the training working?
For “damage” and “healthy” training, is the training working?
no
In both cases, and in all bisection stages, the PIL.UnidentifiedImageError is thrown on the penultimate frame (see earlier posts in this thread). As per my post #18, I have now verified all .jpgs (even the cat).