I already annotate 900 pictures via CVAT and most of the pictures I annotate are as shown below;
I have 2 extra bounding boxes inside the bounding box for the face, which are purposely to detect nose and mouth. However, after the datasets have been trained (30 epochs) and run, detectnet unable to boxed (detect) nose and mouth as shown in the picture below;
The next issue I have is, multiple bounding box exist when it detect something as shown in picture above. There are 2 and sometimes 3 or more bounding boxes for “No_Mask” exist and they are overlapped.
For now, I have no idea what caused this issue, but there are a few things that I think might lead to this issue;
- Low quality image used for the datasets
- Need more pictures for my datasets
- Inaccurate annotations
- CVAT parameters are not assign properly, as shown in picture below. I have no idea do I need to input anything for the ‘Overlap size’ and ‘Segment size’ in order for the re-trained model to be able to detect mouth and nose. Previously, I did not assign any value to this parameters.
Hope any of you guys can help and suggest anything to solve this problem :c