PIL.UnidentifiedImageError: cannot identify image file <_io.BytesIO

pddarrell · September 24, 2022, 3:28pm

Please provide the following information when requesting support.

• Hardware (RGX 3080)
• Network Type (Classification)
• TLT Version format_version: 2.0
toolkit_version: 3.22.05
published_date: 05/25/2022()
• Training spec file
• How to reproduce the issue ?

I am experiencing an error similar to this topic:

“When I run tao classification train” the container stops and I get the following error:

There are 2 classes and these are the files:
Uploading: Screenshot from 2022-09-24 16-01-55.png…
Uploading: Screenshot from 2022-09-24 16-02-25.png…
I have checked the files thumbnails for each and all the files are .jpg image files e.g:
Uploading: Screenshot from 2022-09-24 16-03-18.png…

Please advise

yingliu · September 25, 2022, 1:50pm

As mentioned in the included topic some images seem to be broken. You can google this error message to find ways to locate the images. You may also try to remove all the formatted&splitted datasets and execute the “prepare dataset” again.

Morganh · September 25, 2022, 3:37pm

Please refer to the debug method and solution in TAO - Custom Mask RCNN - Dataset Convert error - #3 by Morganh and TAO - Custom Mask RCNN - Dataset Convert error - #4 by IainA

pddarrell · October 5, 2022, 4:07pm

Hi again.

I am struggling to re-purpose your solution to the mask-RCNN topic, where you suggested using the following to check if an image can be identified:

I have tried the following and I am getting a syntax error:

Please give me some very simple instructions on how to check the 1004 .jpgs which are in two folders inside the “train” directory.

Thank you

Morganh · October 5, 2022, 4:50pm

Hi,
Why did you run “tao tao_vod run /bin/bash” ? There is no tao_vod.

Can you use "tao mask_rcnn run /bin/bash " ?

pddarrell · October 5, 2022, 6:38pm

I was running it because that is the notebook I am using.

I have solved the problem.

It turns out my machine was keeping copies of the files with their original names as hidden files. I discovered this by using the script found here:

I then just had to run “rm ._*” to remove the rogue files.

Thank you

Morganh · October 6, 2022, 5:11am

Glad to know the issue is fixed.

pddarrell · October 6, 2022, 9:51am

It turns out the issue may not be completely solved.

I can run “tao classification train” on the 2-class, 1004 image database.
it completes 1 epoch but at image 1003 of the 2nd epoch it throws the “UnidentifiedImageError”:

I ran “$ ls -la” in the “train” folder and found the following hidden files, in addition to the two dataset folders.
The contents of “.” and “…” are as follows:

part of the result of “cat .DS_Store” is as follows:

cat ._.DS_Store gives:

Are these what are causing the error?
Do I fix this in tao_mounts?

Please advise

Morganh · October 6, 2022, 1:40pm

You can try to delete .DS_Store.

pddarrell · October 6, 2022, 3:45pm

Hi, that did not fix it:

Is there any reason not to delete ._.DS_store too?

Morganh · October 6, 2022, 3:48pm

Why there is ._.DS_store file in your training data folder?
You need to check it.
If it is not necessary, please delete it.

pddarrell · October 6, 2022, 4:36pm

The database was created on a Mac, which, I have now learnt, will put this hidden file in any folder it creates. I have deleted it, but the same error persists.

Does the 1003/1004 have any significance?

Morganh · October 6, 2022, 4:44pm

I think there still be something wrong in some files under your training data folder.
Please check further.

pddarrell · October 6, 2022, 4:55pm

Every file opens as an image.
I have tried running a PIL-based script, which revealed the hidden copies with the original names, but now doesn’t flag anything.
The next thing to try would be to save each .jpg as a .png and retry. Do you have any other suggestion?

Morganh · October 6, 2022, 4:57pm

Suggest you to check your training data folder via bisection method.

pddarrell · October 7, 2022, 3:25pm

I have checked using the bisection method (extensively, though not exhaustively) and tao classification train seems to have a problem with every image. I can run you through the exact procedure I followed if you need that, but it is very repetitive. What criterion is it using for identification? These are all .jpgs that open and have no visible artefacts. Is it possible there is another explanation for this behaviour?

May I also repeat that these datasets run on a Jetson Nano and were used that way in earlier tests.

As a sanity check I will try some other images in two classes to see if they will work.
Screenshot from 2022-10-07 18-24-25
To keep things as simple as possible I cleared the folders from my “train” directory.
I created 2 new folders, one called “cat” and the other “dog”.
I put the image above in each.
When I run “tao classification train” I get the same UIError.

I don’t think its the .jpgs

Please advise

Morganh · October 8, 2022, 3:34pm

So, do you mean you can still meet “PIL.UnidentifiedImageError” even with very few images?
Can you share some of them?

pddarrell · October 8, 2022, 4:21pm

Here is the single image from the “cat” folder"

It was also the single image in the “dog” folder.

I have now used the utility jpeginfo to verify all files.
The image above is [OK].
Here is a printout of part of the results of the contents of the “damage” folder:

Here is a printout of part of the results of the contents of the “healthy” folder:

Every file is [OK]
Here is an example from the “damage” folder:

Here is an example from the “healthy” folder:

(these images are confidential)

I do not think the .jpgs are the problem here.

However the “damage” image does not currently show in the response when published. Here is a screen grab of it in the preview window.

Please advise

Morganh · October 8, 2022, 4:27pm

So, you have run two kinds of training dataset, right?
For “cat” and “dog” training, is the training working?
For “damage” and “healthy” training, is the training working?

pddarrell · October 8, 2022, 4:36pm

No

For “damage” and “healthy” training, is the training working?
no

In both cases, and in all bisection stages, the PIL.UnidentifiedImageError is thrown on the penultimate frame (see earlier posts in this thread). As per my post #18, I have now verified all .jpgs (even the cat).