RGB Images of dimensions: 224 X 224 X 3 (W x H x C) Channel Ordering of the Input: NCHW, where N = Batch Size, C = number of channels (3), H = Height of images (224), W = Width of the images (224) Input scale: None Mean subtraction: [103.939, 116.779, 123.68]
so should i reshape my data according to this shape 224 X 224 X 3?
3.yolov4 gives many options for pretrained model is resnet18 is vehical-net?
For your case, please check which model size you want to train. See output_width and output_height you set in the training spec. Then, resize the bboxes in all the labels files. Then run kmeans to get the anchors.
The vehicleTypeNet is a classification model. It is not related to yolov4 model. May I know why you ask VehicleTypenet?
I cannot understand. May I know what are the “options” . The pretrained models in ngc are not related to vehicles. They are trained via OpenImage dataset.
These are pretrained models for different backbones. Please consider the combination of mAP and fps. It depends on your requirement. You can have a try for resnet18.