Training Retinanet

orene.elmaleh · May 18, 2020, 6:46am

Hi,

I am facing some difficulties while trying to train Retinanet with my own Dataset.
Here the snapshot describing the error encountered:

In the training specification txt file, I changed the size of input image such that it corresponds to my data (640x352).

I would be pleased if someone could help me to fix this.

Morganh · May 18, 2020, 6:59am

There should be some issues from your tfrecords.
Please share the full log and spec when you run tlt-dataset-convert.

orene.elmaleh · May 18, 2020, 7:24am

The specification file retinanet_tfrecords_kitti_trainval.txt:

  root_directory_path: "/workspace/tlt-experiments/data/training"
  image_dir_name: "image_2"
  label_dir_name: "label_2"
  image_extension: ".jpg"
  partition_mode: "random"
  num_partitions: 2
  val_split: 14
  num_shards: 10
}
image_directory_path: "/workspace/tlt-experiments/data/training"

And the log of the cmd tlt-dataset-convert:

Morganh · May 18, 2020, 3:01pm

Your val image is too little, only 6.
Please refer to Training detectnet_v2 Issue

val_images >= num_shards
train_images >= num_shards

orene.elmaleh · May 18, 2020, 6:31pm

Thanks a lot, that just works!

Topic		Replies	Views
Invalid loss, terminating training TAO Toolkit	5	675	October 12, 2021
Resnet18 Object Detection Image Resolution Problem TAO Toolkit	6	1446	October 12, 2021
ZeroDivisionError when training peoplenet TAO Toolkit	10	591	October 12, 2021
Training error on resnet18 TAO Toolkit	9	695	October 12, 2021
Image size- DetectNet_v2 TAO Toolkit tao , inception	4	898	January 21, 2023
Retraining peoplenet model for detecting face and person only TAO Toolkit	4	360	October 12, 2021
Trafficcamnet detect car very low accuracy TAO Toolkit	18	805	January 4, 2022
When I use TLT, I get the following error TAO Toolkit	2	311	October 12, 2021
Dataset_convert error in tao TAO Toolkit	3	448	September 16, 2022
Error when using TLT TAO Toolkit	2	361	October 12, 2021

Training Retinanet

Related topics