Detectnet_v2 training failure using Tao Toolkit on DeepStream

renato.cardenas · June 2, 2025, 12:45pm

Please provide the following information when requesting support.

• Hardware (T4/V100/Xavier/Nano/etc): T1000
• Network Type (Detectnet_v2/Faster_rcnn/Yolo_v4/LPRnet/Mask_rcnn/Classification/etc): Detectnet_v2
• TLT Version (Please run “tlt info --verbose” and share “docker_tag” here)
• Training spec file(If have, please share here)
• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)

I am training the detectnet_v2 network, but during execution in the log I see that the network is not learning, the mAP during the 120 epochs remains at 0.0%, below are the training log files and .txt configuration file for training,

log_treinamento_02_06.txt (452.0 KB)
detectnet_v2_treinamento_fragmaq_spec.txt (9.3 KB)

The resolution of the training images is 960 x 544, I already tried training at 640 x 640, but it didn’t work,

Morganh · June 3, 2025, 8:19am

Can you run evaluation again with lower minimum_bounding_box_height: 20?
What is the average of the training images? Is it possible to share an example? More, is it too small for the objects?
Also, can you share the log when you generate tfrecords files? Since you are training 5 classes and all the training images are only 120, I need to check how many samples in each class of evaluation dataset and training dataset.

renato.cardenas · June 10, 2025, 8:28pm

I managed to carry out the training with a 960x 544 image dataset and with the following configuration:

detectnet_v2_treinamento_fragmaq_spec.txt (9.9 KB)

Soon after, I carried out another training with another dataset maintaining the same configuration parameters and it didn’t work, in all epochs it returns 0 mAP for each object class, this is the result of the training,

log_train.txt (762.1 KB)

The dataset used for training has 950 training images and 424 test images, all images are at a resolution of 960x544, which do you think could be the fault? I already tried training with a pre-trained resnet10 network, but it didn’t give good results.

Morganh · June 16, 2025, 1:47am

Could you set lower minimum_bounding_box_height? For example, minimum_bounding_box_height:4. Then run evaluation again.

renato.cardenas · June 16, 2025, 2:50pm

Hello, I did a new training with the parameter minimum_bounding = 4, and the training didn’t work, so I did the training again with this parameter set to 2, and the result log is in the .txt file, if you have any other parameters to change and test again, let me know

log_train_16_06.txt (635.1 KB)
detectnet_v2_treinamento_fragmaq_spec.txt (9.9 KB)

Morganh · June 17, 2025, 2:34am

May I know the tfreocrds logs? Also, could you please several training images along with their label files? Thanks.

renato.cardenas · June 17, 2025, 12:23pm

Below is the log of the conversion of images to binary, and a .zip file with the images and labels (in .json format)

log_dataset_convert.txt (190.9 KB)
train.zip (48.0 MB)

The images were resized to 960x544 together with the labels

Morganh · June 18, 2025, 9:34am

Can you run evaluation with the same dataset as training to narrow down?
I will also check further as well.

Change to:

  validation_data_source: {
    tfrecords_path: "/home/renato/treinamento_feira_objects/dataset_convert/-fold-000-of-001-shard-*"
    image_directory_path: "/home/renato/treinamento_feira_objects"
  }

Morganh · June 18, 2025, 9:39am

More, please run experiments on 4.0.1 docker to narrow down.
docker run --runtime -it --rm nvcr.io/nvidia/tao/tao-toolkit:4.0.1-tf1.15.5 /bin/bash .
And run with training command detectnet_v2 train xxx .

yingliu · September 16, 2025, 8:22am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks.

Topic		Replies	Views
DetectNet V2 TAO 5.5 average_precision very low or zero TAO Toolkit	11	240	January 22, 2025
Mean average precision of 0.00 for detectnet_v2 using Tao Toolkit TAO Toolkit	5	870	February 27, 2023
Tao Training Detectnet_v2 custom dataset : Average precision value 0.0000% TAO Toolkit	5	321	June 25, 2024
NVIDIA TAO - detectnet_v2 - 0mAP problem TAO Toolkit	12	1168	November 9, 2021
Very low precision while Training detectnet_v2 model using custom data in TAO TAO Toolkit	13	1374	May 4, 2023
Error training detectnet_V2 with TAO TAO Toolkit	4	504	August 24, 2022
[DetectNet_v2] mAP 0% with custom dataset after full training – TAO Toolkit 5.5.0 TAO Toolkit	34	608	September 16, 2025
A few questions regarding detection tasks in TAO toolkit TAO Toolkit	2	350	September 15, 2023
Getting 0 mAP for detectnet_v2 model over 150 epochs TAO Toolkit	14	303	January 11, 2025
Map near to 0 TAO Toolkit tao , jetson	10	186	December 23, 2024

Detectnet_v2 training failure using Tao Toolkit on DeepStream

Related topics