I have questions regarding some config fields for maskRCNN.
data_config, it’s not clear to me if I need to resize input images to match
image_size or resizing is done by TLT, from this blog Training Instance Segmentation Models Using Mask R-CNN on the NVIDIA Transfer Learning Toolkit:
Input images are resized and padded to
image_sizewhile keeping the aspect ratio.
To me, this indicates that TLT will resize the input images to match
image_size, I also check out
create_coco_tf_record.py, no resizing takes place before the tf_record conversion so the input image from COCO are not resized in advance.
I just want to double check that TLT will do the resizing (for image, bbox & mask annotation) as part of the training pipeline.
eval_samples = number of samples for evaluation -> is this the number images from the training set to use for evaluation or the size of the valuation set? As an aside question, does the losses print out during training computed from the training set or the validation set?
gt_mask_size = ground truth mask size, would you please explain how do I set this value?