Hi,
I have questions regarding some config fields for maskRCNN.
In data_config
, it’s not clear to me if I need to resize input images to match image_size
or resizing is done by TLT, from this blog Training Instance Segmentation Models Using Mask R-CNN on the NVIDIA Transfer Learning Toolkit:
Input images are resized and padded to
image_size
while keeping the aspect ratio.
To me, this indicates that TLT will resize the input images to match image_size
, I also check out download_and_preprocess_coco.sh
and create_coco_tf_record.py
, no resizing takes place before the tf_record conversion so the input image from COCO are not resized in advance.
I just want to double check that TLT will do the resizing (for image, bbox & mask annotation) as part of the training pipeline.
eval_samples = number of samples for evaluation → is this the number images from the training set to use for evaluation or the size of the valuation set? As an aside question, does the losses print out during training computed from the training set or the validation set?
gt_mask_size = ground truth mask size, would you please explain how do I set this value?