YOLOv4 accuracy difference between TAO and Darknet

Several comments here.

  1. Please use ImageNet-pretrained weight. As mentioned in the blog, you need to train classification models on the ImageNet 2012 classification dataset. Then this ImageNet-pretrained weights can be a starting point to train your YoloV4 model. Pretrained weights trained on the ImageNet dataset tend to provide good accuracy for object detection.
  2. Please finetune below parameters, for example:
    freeze_blocks: 0, comment out this: #freeze_blocks: 0
    weight: 3e-5 → weight: 3e-6
  3. We will have focal loss for yolov4, which will also improve mAP, as we tested.
  4. In Tao documentation, there is not recommended backbone. End user can modify to any backbone. It just set a typical backbone.