Questions regarding the preparation of images for training yolo_v4 model on TAO toolkit

Alex6s · December 22, 2023, 2:04pm

I plan to train a yolo v4 model using TAO toolkit, but I have questions regarding the preprocessing of my images.

I will use my model on an hd video stream (1920x1080), is it still ok to have the input dim of the model smaller to have shorter train time ? (so for example If I train the model with 1364x768 images).
If the input dim of my model has a different aspect ratio than 16/9, when I will infer on the hd video, will the model see distorted images ? Can this affect performances (my understanding is that it will)
I know that with yolov4, TAO resize all input images during the augmentation process to be the equal to the input dim of the model (distorting the image if necessary). But if I have images with various resolutions and aspect ratios, my understanding is that I should beforehand have all my images have the same resolution as the model input dim, or the very least, the same aspect ratio so that my training images don’t get distorted. Is that correct ?
If yes, If I have an image that is originally smaller (or have a dimention smaller) than the input dim, what would be the best thing to do ? upscale the image enouth so that I can crop a portion the size of the input dim (potentially cutting out part of the annotated object), or is it possible to add padding to the image ?

image1571×390 131 KB

To be fair both feels wrong, is there a better solution ? or should I just not include those images in my dataset ?

Morganh · December 24, 2023, 3:03pm

Yes, it is possible to set a smaller output_width and output_height in the spec file. But need to make sure it is multiple of 32.
Please refer to tao_tensorflow1_backend/nvidia_tao_tf1/cv/yolo_v4/scripts/inference.py at main · NVIDIA/tao_tensorflow1_backend · GitHub and tao_tensorflow1_backend/nvidia_tao_tf1/cv/common/inferencer/inferencer.py at main · NVIDIA/tao_tensorflow1_backend · GitHub, it can load an image while keeping aspect ratio or not.
Refer to tao_tensorflow1_backend/nvidia_tao_tf1/cv/yolo_v4/dataio/data_sequence.py at main · NVIDIA/tao_tensorflow1_backend · GitHub, the augmentation pipeline will do mosaic/jitter/resize/random-crop/etc. The images will not get distorted.
You can just need to set the training spec file and trigger training.

Alex6s · January 3, 2024, 8:39am

Hello Morganh,

Thank you for those informations.
2. Does it keep the aspect ratio by default ? if not how do I enable it ?
3. So if I use the default spec file for training with my dataset, the images won’t be distorted ? even if the output_width and output_height in the augmentation_config are set to a different aspect ratio ?

Morganh · January 3, 2024, 9:24am

Hi,
2) Yes, it does.
3) Yes, the images are not distorted.

Alex6s · January 3, 2024, 9:25am

Perfect,
Thank you very much

system · January 17, 2024, 9:26am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tao input size yolo_V4 TAO Toolkit	6	446	October 4, 2022
TAO augmentation question TAO Toolkit	2	504	August 12, 2022
Relationship between training dataset size and inference data size TAO Toolkit	12	728	February 22, 2022
ERROR: Input to reshape is a tensor with 2108 values, but the requested shape has 2074 TAO Toolkit	2	635	April 17, 2023
TLT yolo_v4 image resizer during evaluation TAO Toolkit	2	388	October 12, 2021
Yolov4 for different input size, pretrained model weights for the different sizes TAO Toolkit	31	4394	February 24, 2022
TAO input image resizing TAO Toolkit	9	1205	April 18, 2022
How is TAO Classification adding padding when preprocessing images? TAO Toolkit	2	614	January 17, 2023
Inference on images of different size than training TAO Toolkit	2	357	October 26, 2023
Tuning of Parameters TAO Toolkit	13	612	November 24, 2021

Questions regarding the preparation of images for training yolo_v4 model on TAO toolkit

Related topics