Which detection model will give more accuracy for arial view image detection!

Morganh · January 18, 2020, 6:58am

Hi samjith888,
As mentioned previously, you need to trigger experiments.To improve accuracy on small objects, the most common trick is to use a smaller set of anchors. The anchor sizes should have a size that is similar to the small objects’ size. Anchor ratios can be kept unchanged.
You can also train only two classes firstly instead of 5 classes. Trigger less classes in order to narrow down.
Mssing_P 25x23
Extra_P 26X13

Note that for above size, since you change from 40962160 to 1024544, you need to make anchor sizes cover
25/4 * 23/4
26/4 * 13/4

samjith888 · January 18, 2020, 7:18am

How to change the anchor sizes to cover the above anchor sizes?

anchor_box_config {
scale: 8.0
scale: 16.0
scale: 32.0
ratio: 1.0
ratio: 0.5
ratio: 2.0
}

Morganh · January 18, 2020, 11:00am

Refer to the way in https://devtalk.nvidia.com/default/topic/1069737/transfer-learning-toolkit/which-detection-model-will-give-more-accuracy-for-arial-view-image-detection-/post/5420836/#5420836,
for the two small objects,
Mssing_P 25x23
Extra_P 26X13

since you change from 4096x2160 to 1024x544, so it becomes
6.25x5.75
6.5x3.25

You can try

anchor_box_config {
scale: 4.0
scale: 4.6
scale: 5.0
ratio: 1.0
ratio: 0.5
ratio: 2.0
}

It can cover anchor sizes like
4x4,
2.828x5.656
5.656x2.828
4.6x4.6
3.25x6.5
6.5x3.25
5x5
3.535x7.07
7.07x3.535

samjith888 · January 23, 2020, 5:53am

can i use different resolution images for training ?

Morganh · January 23, 2020, 11:46am

Sure, you can set your original resolution to others.
Then calculate the new pixels range of the small objects. And set a better anchor_box_config.

samjith888 · January 23, 2020, 5:54pm

Hi Morganh,

I meant that i have a data set which consist of images with different resolutions ( eg:41202240, 800450, 1080120 ,300 250 etc). So can i use this dataset for training? Or TLT didn’t only unique sized images ?

Morganh · January 24, 2020, 4:17am

For detectnet_2 and SSD network, all of the images must be resized offline to the final training size.
For faster-rcnn, you don’t need to resize the image.

See tlt user guide’s chapter 2 for more details.

Topic		Replies	Views
Tlt-train loss is minimal but performances are bad TAO Toolkit	11	632	October 12, 2021
Faster RCNN ROI issue TAO Toolkit	34	2105	October 12, 2021
Very low precision while Training detectnet_v2 model using custom data in TAO TAO Toolkit	13	1260	May 4, 2023
Training Custom Object detector with 6 classes TAO Toolkit	27	2481	October 12, 2021
TLT Detectnet with Standford Drone Dataset Low Average Precision TAO Toolkit	18	933	October 12, 2021
AP, precision and recall are remaining zero using the custom dataset. training the fasterRCNN with resnet_18 TAO Toolkit	12	853	November 5, 2021
Anchor_box_config in Frcnn object detection training TAO Toolkit	2	520	October 12, 2021
Faster RCNN ResNet-101 Problems TAO Toolkit	20	1398	October 12, 2021
IndexError: index 6 is out of bounds for axis 1 with size 6 while training by using FasterRCNN. TAO Toolkit	23	4188	October 12, 2021
Detectnet_v2(resnet50) low accuracy on 2 class dataset TAO Toolkit	25	1237	February 12, 2023

Which detection model will give more accuracy for arial view image detection!

Related topics