Low batch size during training

newtume.123 · October 12, 2020, 11:37am

Just notices a strange thing. Why the BS is so small during training? With Faster RCNN (resnet50-101) I can only fit 1 image on 2080TI with res=320. And for some small detectors like SSD/Yolo with MobileNet the BS is about 16-32, which is also quite small for such simple models.

Keep in mind that I unfroze all blocks and here is my preprocessing config.

augmentation_config {
  preprocessing {
    output_image_width: 320
    output_image_height: 320
    output_image_channel: 3
    crop_right: 320
    crop_bottom: 320
    min_bbox_width: 1.0
    min_bbox_height: 1.0
  }

Morganh · October 13, 2020, 3:31am

For bs in Faster-rcnn, refer to Faster RCNN ResNet-101 Problems - #10 by Morganh
ResNet101 is a huge backbone and can not fit into a single GPU with a large batch size like 16.

For SSD/Yolo, the bs can be set above 32.

Topic		Replies	Views
Errors in the training model when batch_size_per_gpu is modified to be greater than 4 TAO Toolkit	5	619	October 12, 2021
Train faster-rcnn with multiple images per iteration TAO Toolkit	4	931	October 12, 2021
Training Object Detection with FasterRCNN TAO Toolkit	5	635	October 12, 2021
Possible to train faster rcnn in batch? TAO Toolkit	5	700	October 12, 2021
Detectnet_V2 Training Configuration (nVidia TLT-NGC) TAO Toolkit	3	497	October 12, 2021
Help with Detectnet_V2 train config file (TAO) Computer Vision & Image Processing tao	2	805	December 12, 2024
Optimal width and height of the images TAO Toolkit	4	459	December 5, 2021
Custom dataset – ValueError: steps_per_epoch must be > 0 TAO Toolkit	5	670	November 16, 2021
training on small objects TAO Toolkit	2	500	October 12, 2021
Network Image Input Resizing TAO Toolkit	7	837	October 12, 2021

Low batch size during training

Related topics