Poor metric results after retraining maskrcnn using TLT notebook

Accelerated Computing Intelligent Video Analytics TAO Toolkit

Morganh September 10, 2020, 6:15am 17

For 2gpus, please try to trigger training as below spec. Per the latest result from Nvidia internal team, training with 2 gpus(V100), the AP can get 33.2 in the end.

seed: 123
use_amp: False
warmup_steps: 50000
checkpoint: “/workspace/tlt-experiments/mask_rcnn/resnet50.hdf5”
learning_rate_steps: “[360000, 540000]”
learning_rate_decay_levels: “[0.1, 0.01]”
total_steps: 720000
train_batch_size: 2
eval_batch_size: 8
num_steps_per_eval: 60000
momentum: 0.9
l2_weight_decay: 0.00002
warmup_learning_rate: 0.00001
init_learning_rate: 0.005

Training from scratch using TAO for maskrcnn

Maskrcnn.ipynb - followed notebook and ended up with poor (almost untrained) network from instructions

Mask R-CNN hangs during training using custom made tfrecords

Maskrcnn.ipynb - followed notebook and ended up with poor (almost untrained) network from instructions

Low accuracy for MS COCO dataset in tao maskrcnn model training

Topic		Replies	Views
MaskRCNN Input to reshape is a tensor with 3135248 values, but the requested shape has 2691200 TAO Toolkit	38	1124	May 9, 2023
Training doesn't converge for Mapillary Vistas Dataset training with MaskRCNN TAO Toolkit	47	1722	June 16, 2022
Maskrcnn.ipynb - followed notebook and ended up with poor (almost untrained) network from instructions TAO Toolkit	13	760	October 12, 2021
Faster RCNN ResNet-101 Problems TAO Toolkit	20	1125	October 12, 2021
Input to reshape is a tensor with 3067968 values, but the requested shape has 2691200 TAO Toolkit inception	2	19	January 16, 2025
Training Custom FasterRCNN resnet50 Object detection issue TAO Toolkit	9	1126	October 12, 2021
Training Instance Segmentation Models Using Mask R-CNN on the NVIDIA Transfer Learning Toolkit Technical Blog	3	1024	August 18, 2021
Error while re-training with custom dataset using tlt file- FasterRCNN TAO Toolkit	5	364	June 26, 2023
Mask rcnn poor results TAO Toolkit	4	1037	October 12, 2021
Faster RCNN on TLT 3.0 not learning the same as TLT 2.0 TAO Toolkit	15	1022	October 12, 2021

Poor metric results after retraining maskrcnn using TLT notebook

Related topics