i’m trying to train the YOLO v3 with tlt
i use 1080x720 input image for train
but i know the yolov3 input was 608 or 416
can i train with 1080x720 images?
and should i change the big_anchor_shape,mid_anchor_shape,… in train config file related to the input image size??
and i wonder which size should i use when i import my custom yolo v3 model to the deepstream
should i reszie to 608? or 1080x720 to the nvinfer?
Firstly, please check if your spec.txt is exactly the spec file during the training. |'m asking this because you set below in the spec
output_image_width: 1280
output_image_height: 960
but below training log just shows that you set 608x608.
Input (InputLayer) (64, 3, 608, 608) 0
Second,
If you want to train a 608x608 model, please double check you have already resized your images/labels to 608x608 offline.
Run kmeans.py to get the proper anchors_shapes against the label files.
Finetune the hyper-parameters. For example, set lower bs, finetune max_learning_rate.
Also,you can run small part of training dataset to finetune the hyper-parameters.