Several comments here.
- Please use ImageNet-pretrained weight. As mentioned in the blog, you need to train classification models on the ImageNet 2012 classification dataset. Then this ImageNet-pretrained weights can be a starting point to train your YoloV4 model. Pretrained weights trained on the ImageNet dataset tend to provide good accuracy for object detection.
- Please finetune below parameters, for example:
freeze_blocks: 0, comment out this: #freeze_blocks: 0
weight: 3e-5 → weight: 3e-6 - We will have focal loss for yolov4, which will also improve mAP, as we tested.
- In Tao documentation, there is not recommended backbone. End user can modify to any backbone. It just set a typical backbone.