I trained (detect-net-v2 with resnet-18) on single class containing 1,60,000 training images.
The Mean average precision (MAP) I got is almost 73%. while on same data in yolov4 I got 91% MAP (Mean average precision).
I Know that increase in layers is directly related to Mean average precision and accuracy, but I am working on deep-stream. I need (FPS+good detection), If I will increase layers so, my FPS will move down.