NVIDIA TAO - detectnet_v2 - 0mAP problem

gglrthiru · October 17, 2021, 7:04am

whenever i train the detectnet_v2 architecture in visdrone dataset, I get 0mAP problem. I thought that this is because of wrong class_maps but, the classmaps are fine. The model isn’t learning anything. I have provided the link to drive where you can find files: tao_mounts.json, model_log, training_spec file

• Hardware RTX 2070 SUPER

• Network Type: DetectNet_v2

• TLT Version (Please run “tlt info --verbose” and share “docker_tag” here) 3.21.08

• Training spec file(If have, please share here): issues - Google Drive

• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)

Download the visdrone dataset and convert it to tfrecords using dataset_convert tool
After converting, train the detectnet_v2 model on it, this causes a 0mAP problem and the model doesn’t learn anything

I followed the steps from the NVIDIA TAO documentation to train the detectnet_v2 model

Morganh · October 18, 2021, 7:39am

See DetectNet_v2 - NVIDIA Docs
For detectnet_v2 network,

The train tool does not support training on images of multiple resolutions. However, the dataloader does support resizing images to the input resolution defined in the specification file. This can be enabled by setting the enable_auto_resize parameter to true in the augmentation_config module of the spec file.

Does visdrone dataset have training images of multiple resolutions ?

gglrthiru · October 18, 2021, 8:01am

yeah it has images of multiple resolutions but i have enabled auto_resize parameter

Morganh · October 18, 2021, 8:03am

For detectnet_v2 network, the train tool does not support training on images of multiple resolutions.
Please resize images/labels offline to the same resolution.

gglrthiru · October 18, 2021, 8:11am

actually auto_resize does this right, IDK correct me if I’m wrong!

gglrthiru · October 18, 2021, 1:06pm

I tried this way too. But, the model didn’t learn anything and the same problem persists

Morganh · October 18, 2021, 3:46pm

Seems that the objects in visdrone dataset are very small. In this case, please modify evaluation_box_config and run evaluation again.

  evaluation_box_config {
    key: "car"
    value {
      minimum_height: 20
      maximum_height: 9999
      minimum_width: 10
      maximum_width: 9999
    }
  }

  evaluation_box_config {
    key: "pedestrian"
    value {
      minimum_height: 20
      maximum_height: 9999
      minimum_width: 10
      maximum_width: 9999
    }
  }

to

  evaluation_box_config {
    key: "car"
    value {
      minimum_height: 4
      maximum_height: 9999
      minimum_width: 4
      maximum_width: 9999
    }
  }

  evaluation_box_config {
    key: "pedestrian"
    value {
      minimum_height: 4
      maximum_height: 9999
      minimum_width: 4
      maximum_width: 9999
    }
  }

gglrthiru · October 19, 2021, 6:13am

No, I tried it just now and I got the same results, the model didn’t learn anything

The loss will be very less from the start and after finishing, if we check the status.json file, it will show 0mAP and 0 precision in all objects that the model was trained.

Morganh · October 19, 2021, 6:23am

Seems that the training is normal but evaluation is not.
Please change minimum_bounding_box_height to a lower value too. For example,

postprocessing_config {
  target_class_config {
    key: "car"
    value {
      clustering_config {
        clustering_algorithm: NMS
        coverage_threshold: 0.005
        nms_iou_threshold: 0.5
        nms_confidence_threshold: 0.5
        minimum_bounding_box_height: 4
      }
    }
  }

gglrthiru · October 19, 2021, 6:38am

I changed it but still the same problem. I have a doubt, does the box coords in the label txt files needs to be normalized.

This is the output after changing the post_process_config, still the model doesnt learn.

gglrthiru · October 19, 2021, 6:40am

The status.json file report for first two epochs

Morganh · October 19, 2021, 6:53am

As mentioned above, please resize the images and also labels.
After done, please check if the bboxes are correct.

system · November 9, 2021, 1:18am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
NVIDIA TAO - detectnet_v2 - 0mAP problem TAO Toolkit tensorrt , tensorflow , python , tao	4	607	November 9, 2021
0.0 average precision during a detectnet_v2 training TAO Toolkit	10	493	September 28, 2023
Getting 0 mAP for detectnet_v2 model over 150 epochs TAO Toolkit	14	55	January 11, 2025
mAP training several classes = 0.0 and so low with data custom using detectnet_v2 (resnet_18)) TAO Toolkit	33	482	February 1, 2024
Mean average precision of 0.00 for detectnet_v2 using Tao Toolkit TAO Toolkit	5	734	February 27, 2023
mAP=0 error TAO Toolkit tensorrt , ai-training	7	1371	October 12, 2021
How to train TAO Toolkit models on COCO Dataset? TAO Toolkit	8	608	May 23, 2023
Accuracy not improving even after changing the input dim of DetectnetV2 Tao TAO Toolkit tao , jetson	2	9	March 14, 2025
Error detectnet_V2 train with TAO : dbscan_min_samples: 0.05' TAO Toolkit tao	4	388	November 7, 2023
Used the pascalvoc dataset to train with detectnet_V2, but the accuracy is low TAO Toolkit	15	585	July 6, 2022

NVIDIA TAO - detectnet_v2 - 0mAP problem

Related topics