Error in TAO-Toolkit while training

aksinghsubscriptions · December 16, 2021, 1:47pm

I am trying to train action recognition net inside TAO toolkit container by following the nvidia blog for ActionRecognitionNet.

I have started the container using the following command in my personal machine:

docker run --net=host -ti -v /var/run/docker.sock:/var/run/docker.sock --gpus=all -e DISPLAY=$DISPLAY nvcr.io/nvidia/tao/tao-toolkit-pyt:v3.21.11-py3

Inside this, I was able to successfully follow the jupyter notebook as mentioned in the blog up till the training part. When i run the following command

tao action_recognition train
-e /workspace/specs/train_rgb_3d_finetune.yaml
-r $RESULTS_DIR/rgb_3d_ptm
-k $KEY
model_config.rgb_pretrained_model_path=$RESULTS_DIR/pretrained/actionrecognitionnet_vtrainable_v1.0/resnet18_3d_rgb_hmdb5_32.tlt
model_config.rgb_pretrained_num_classes=5

I am getting error:

Traceback (most recent call last):
** File “/opt/conda/bin/tao”, line 8, in **
** sys.exit(main())**
** File “/opt/conda/lib/python3.8/site-packages/tlt/entrypoint/entrypoint.py”, line 113, in main**
** local_instance.launch_command(**
** File “/opt/conda/lib/python3.8/site-packages/tlt/components/instance_handler/local_instance.py”, line 296, in launch_command**
** docker_logged_in(required_registry=self.task_map[task].docker_registry)**
** File “/opt/conda/lib/python3.8/site-packages/tlt/components/instance_handler/utils.py”, line 129, in docker_logged_in**
** data = load_config_file(docker_config)**
** File “/opt/conda/lib/python3.8/site-packages/tlt/components/instance_handler/utils.py”, line 64, in load_config_file**
** assert os.path.exists(config_path), (**
AssertionError: Config path must be a valid unix path. No file found at: /root/.docker/config.json. Did you run docker login?

Note that I am able to do docker login nvcr.io in my system but cant do the same inside this container. Because when I try to do so I get the error as:

root@predator:/workspace/tlt/samples# docker login nvcr.io
bash: docker: command not found

• Hardware Platform (Jetson / GPU) 1050Ti
• DeepStream Version 6.0
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only) 470.42.01
• Issue Type( questions, new requirements, bugs) bug
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Morganh · December 17, 2021, 6:47am

You did not trigger tao with tao-launcher. And you are just triggering the tao docker via below way.

docker run --net=host -ti -v /var/run/docker.sock:/var/run/docker.sock --gpus=all -e DISPLAY=$DISPLAY nvcr.io/nvidia/tao/tao-toolkit-pyt:v3.21.11-py3

So, please directly use action_recognition train instead of tao action_recognition train.

system · January 4, 2022, 2:26am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
TAO 5.0.0. TF1 Container fail to run tao model yolo_v4 dataset_convert command TAO Toolkit	4	356	October 5, 2023
AssertionError: Config path must be a valid unix path. TAO TOOLKIT TFRECORD CONVERT TAO Toolkit docker	8	897	February 20, 2023
Error when pulling a tao-toolkit docker file TAO Toolkit	14	729	July 24, 2023
Facing error after training command TAO Toolkit	10	1087	February 28, 2022
Run TAO training probelm TAO Toolkit tao	30	438	May 21, 2024
No Such Container (Docker Container) in TAO Example Code Run TAO Toolkit docker	7	982	January 3, 2023
Run Tao inside docker TAO Toolkit docker , tao	4	1727	February 20, 2022
Docker instantiation fails when running "tao detectnet_v2" on Xavier NX Jetson AGX Xavier docker	5	558	October 5, 2022
While invoking TAO container directly getting error tensorflow/core/common_runtime/gpu/gpu_event_mgr.cc:273] Unexpected Event status: 1 TAO Toolkit	2	821	March 8, 2022
Getting JSON related errors when training in docker container TAO Toolkit	10	430	May 22, 2023

Error in TAO-Toolkit while training

Related topics