Low batch size during training

For bs in Faster-rcnn, refer to Faster RCNN ResNet-101 Problems - #10 by Morganh
ResNet101 is a huge backbone and can not fit into a single GPU with a large batch size like 16.

For SSD/Yolo, the bs can be set above 32.