Error when using tao tool to train detectnet_v2 detection model

18981275647 · January 17, 2022, 11:44am

I still don’t save but just kill， I uploaded my data and configuration, can you help me see if it works?
test.tar.gz (1.0 MB)

It can be found normally on the configuration and the original dataset I uploaded, but not on the dataset. I don’t know the data there. Is this because my dataset has already appeared?

18981275647 · January 17, 2022, 12:50pm

I found it because of the explosion, but resnet18 training batch_size_per_gpu: 1 is already the lowest, or training

The computer’s memory is also covered with dust, and then the program kills

Morganh · January 17, 2022, 2:05pm

Can you upload your training spec file?

18981275647 · January 18, 2022, 2:24am

detectnet_v2_train_resnet18_kitti.txt (5.2 KB)

I found out last night that if I replaced it with the resnet10 model, I couldn’t train it either. It would also cause video memory and memory overflow, and the program would be killed.

Morganh · January 18, 2022, 2:49am

In your case, training on 1070ti, please try to train a smaller model.
Change

output_image_width: 960
output_image_height: 544

to

output_image_width: 480
output_image_height: 272

Also, set below in augmentation_config

enable_auto _resize: true

18981275647 · January 18, 2022, 7:42am

After enabling the enable_auto_resize parameter, are the width and height set earlier not needed? Or is it meaningless?

Morganh · January 18, 2022, 7:47am

It is needed.
See https://docs.nvidia.com/tao/tao-toolkit/text/object_detection/detectnet_v2.html#training-the-model

For detectnet_v2, it does not support training on images of multiple resolutions. So, firstly, make sure all the training images have the same resolution.

Then if you train a model as below and set enable_auto _resize: true
output_image_width: 480
output_image_height: 272

then, it will resize your training images to 480x272.

18981275647 · January 18, 2022, 8:25am

Thanks, I went through the docs and found this trick, I’ll try it again in the evening, thanks again!

After the image is automatically resized, will it affect the marked information, resulting in inaccurate detection?

Morganh · January 18, 2022, 8:32am

No.
Just need to make sure your original images/labels are correct.

18981275647 · January 19, 2022, 2:16pm

ok, but retrain found error

detectnet_v2_retrain_resnet18_kitti.txt (5.3 KB)

18981275647 · January 20, 2022, 3:06am

Can you help me take a look again?

Morganh · January 20, 2022, 3:53am

Please run with a new result folder.

18981275647 · January 22, 2022, 4:53am

thanks for you

system · February 5, 2022, 4:54am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Detectnet_v2 training core dumped error TAO Toolkit tensorrt , tensorflow , deep-learning , tao	24	1080	June 21, 2022
Error while training detectnet v2 taotollkit on default notebook TAO Toolkit	2	307	March 9, 2024
Error Facing in Training command TAO Toolkit	13	949	March 9, 2022
docker.errors.ImageNotFound after follow "nvidia/tao/cv_samples:v1.4.1" TAO Toolkit	12	451	November 13, 2022
Tao detectnet_v2 dataset_convert TAO Toolkit	4	763	August 15, 2023
Tao-converter [ERROR] Failed to parse the model, please check the encoding key to make sure its correct TAO Toolkit deepstream	70	1662	July 10, 2023
Object Detection using TAO DetectNet_v2. Run TAO training stopped TAO Toolkit python	16	688	July 6, 2022
Error when training detectnet_v2 resnet34 on tfrecord file TAO Toolkit	7	495	October 19, 2022
Detectnet_v2 trained, tao infer can infer, but no results TAO Toolkit jetson-inference	7	538	October 23, 2023
Tao model detectnet_v2 dataset_convert Error : permission denied : status.json TAO Toolkit	2	169	May 19, 2024

Error when using tao tool to train detectnet_v2 detection model

Related topics