Please try more experiments.
1 Make sure the anchor box size is almost the same as the objects’ size. In your config file,
your anchors are as below. So they can cover the small objects(87/4, 54/4).
But I suggest you to check your small objects’ size further, to see if it is needed to trigger more experiments for different anchor ratio or scale.
array([[[ 8. , 8. ],
[ 5.656854, 11.313708],
[[16. , 16. ],
[[32. , 32. ],
[45.254833, 22.627417]]], dtype=float32)
2.Try via larger backbones, resnet34 or vgg19
3.Try other networks in TLT as well to see if there is any improvement.
- try ssd, with lower ratio too.
- try detectnet_v2. In dectent_v2, set lower minimum_bounding_box_height(try to set to 3), lower minimum_height and minimum_width (try to set to 0) and lower minimum_detection_ground_truth_overlap (try to set to 0.3)