Error when using tao tool to train detectnet_v2 detection model

Please provide the following information when requesting support.

• Hardware 1070ti
• Network Type Detectnet_v2/
• TLT Version 3.x

Error when using tao tool to train detectnet_v2 detection model
Not found: /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images/
/home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images/smoke_17982.png; No such file or directory

There is no png file in my dataset, I don’t know what is wrong, I have completed the tf format conversion
tensorflow.python.framework.errors_impl.NotFoundError: 2 root error(s) found.
(0) Not found: /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images//home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images/smoke_17982.jpg; No such file or directory

I’m sure the file path is correct

Many topics are asking this.
Please check the your ~/.tao_mounts.json.
The path xxx should be a path inside the docker when your run command "tao detectnet_v2 xxx "

prototxt file

tao.json

spec tran.txt

What is the command ? Can you share?

tao detectnet_v2 train -k nvidia -r /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/result/ -e /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/specs/detectnet_v2_train_resnet18_kitti.txt

Can you run below and share the log?
$ tao detectnet_v2 run ls /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/specs/detectnet_v2_train_resnet18_kitti.txt


No relevant logs found

The problem I found is that he automatically added an images directory after the directory, but I didn’t add it, the system automatically added an extra images directory

Not found: /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images/images/smoke_25325.jpg

According to above error log, the image path is not available.
Please run below command to login the docker and then debug by yourself.
$ tao detectnet_v2 run /bin/bash
# detectnet_v2 train -k nvidia -r /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/result/ -e /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/specs/detectnet_v2_train_resnet18_kitti.txt

still so
Not found: /home/ncp/tao/cv_samples_v1.2.0/detectnet_v2/dataset/images/images/smoke_25325.jpg; No such file or directory
You look at my config file above, isn’t it there?

data format

Please run below command to login the docker and then debug by yourself.
$ tao detectnet_v2 run /bin/bash

Seems that you did not login.

I created this directory manually again but it still doesn’t work

Can you check under
@6a4b7055d229:/workspace$ ls /home/ncp

After I change the path to the config file, it doesn’t save the file anymore, but the program freezes for 1 minute and then just kills


this is file
test.prototxt (332 Bytes)
detectnet_v2_train_resnet18_kitti.txt (5.2 KB)

Hey,
I suggest you to run all steps inside the docker for generating tfrecords and training.

$ tao detectnet_v2 run /bin/bash (login the docker)

Then inside the docker, firstly, please make sure all your dataset are available.
@6a4b7055d229:/workspace$ ls /home/ncp/tao/cv_samples_v1.2.0/dataset/images/ (make sure you find your dataset inside the docker)

Then, generate tfrecords.
@6a4b7055d229:/workspace$ detectnet_v2 dataset-convert xxx

Then, run training
@6a4b7055d229:/workspace$ detectnet_v2 train xxx

or not foun error

then look this

Hi,
Have you found your training images inside the docker?

yes am sure,you can take a look at this screenshot of mine, the last time I ran the training, it was possible to start training, but as soon as it was started, it was killed directly

also, I can upload a few training images, label files and configuration files for you, can you help me?

If you already find your training images, please try to generate the tfrecord files again inside the docker.
I am afraid there is something wrong in this step.