AssertionError: Config path must be a valid unix path. TAO TOOLKIT TFRECORD CONVERT

user82614 · February 10, 2023, 11:54am

• Hardware (A6000
• Network Type (Detectnet_v2)
• TLT Version 4 ? (Please run “tlt info --verbose” and share “docker_tag” here - this doesn’t work )

Python Version 3.7
• Docker version > 20

Hello all,

I’ve been following the ‘getting started’ with tao toolkit along with the setup video. I’ve opened up detectnetv2 from the tao_launcher_starter_kit and was going through the script until I reached a problem around tfrecord convert :

Creating a new directory for the output tfrecords dump.

print(“Converting Tfrecords for kitti trainval dataset”)
!mkdir -p $LOCAL_DATA_DIR/tfrecords && rm -rf $LOCAL_DATA_DIR/tfrecords/*
!tao detectnet_v2 dataset_convert
-d $SPECS_DIR/detectnet_v2_tfrecords_kitti_trainval.txt
-o $DATA_DOWNLOAD_DIR/tfrecords/kitti_trainval/kitti_trainval

The output I get from this is:

Converting Tfrecords for kitti trainval dataset
Traceback (most recent call last):
File “/home/cymru/miniconda3/bin/tao”, line 8, in
sys.exit(main())
File “/home/cymru/miniconda3/lib/python3.7/site-packages/tlt/entrypoint/tao.py”, line 116, in main
args[1:]
File “/home/cymru/miniconda3/lib/python3.7/site-packages/tlt/components/instance_handler/local_instance.py”, line 296, in launch_command
docker_logged_in(required_registry=self.task_map[task].docker_registry)
File “/home/cymru/miniconda3/lib/python3.7/site-packages/tlt/components/instance_handler/utils.py”, line 137, in docker_logged_in
data = load_config_file(docker_config)
File “/home/cymru/miniconda3/lib/python3.7/site-packages/tlt/components/instance_handler/utils.py”, line 74, in load_config_file
“No file found at: {}. Did you run docker login?”.format(config_path)
AssertionError: Config path must be a valid unix path. No file found at: /home/cymru/.docker/config.json. Did you run docker login?

So I have tried the following:

docker login nvcr.io - the log in was sucessfull
I ran the above in both virtual environment and root user - I still get the same error output.
I tried locating the .docker/config.json file and I cannot see it, I’m not sure if it even exists

May I kindly have help in understanding what I need to do?

Thank you :)

Morganh · February 10, 2023, 4:28pm

Refer to TAO BodyPoseNet - Did you run docker login? - #3 by taoonvision and Error in Generating TFrecords for yolov4 - #24 by Morganh

user82614 · February 13, 2023, 9:23am

Hello,

To provide a bit of context, I’m trying to run tao tookit using the launcher cli (stated as option 1 on this website : TAO Toolkit Getting Started | NVIDIA NGC)

Regarding the links you have sent me:

I looked at the first link regarding using sudo chown.

The message I get after running the first command is:

chown: cannot access ‘/home/cymru/.docker’: No such file or directory

The second link I assume is looking at running tao toolkit directly from a container. But I assume I do not need to do this if I’m using the getting-started juypter notebook and downloading the tao toolkit via pip3?

UPDATE: I got rid of the message by following the docker ce post installation steps to run without sudo then I did the docker login nvcr.io again. It then finally generated the .docker folder in my home directory for me. It’s now carrying on as normal.

user82614 · February 13, 2023, 2:39pm

A quick question,

How do i find the tao docker instance?

Because i do not have any containers running when i go through the launcher cli option in this getting started notwbook. I typed ‘docker container ls’ and I don’t see the tao toolkit running , so I’m just wondering how I can find the directory for the tao docker. I know i specified it in the tao_mounts.json file but don’t know where to find it.

Thanks

Morganh · February 14, 2023, 2:45am

You can find the latest docker in
TAO Toolkit | NVIDIA NGC and old docker in TAO Toolkit for Computer Vision | NVIDIA NGC

For your error log, please try to add -v /var/run/docker.sock:/var/run/docker.sock

For example,
$docker run --runtime=nvidia -it -v /var/run/docker.sock:/var/run/docker.sock --entrypoint "" nvcr.io/nvidia/tao/tao-toolkit:4.0.0-tf1.15.5

user82614 · February 14, 2023, 10:13am

Thanks - but do I really need to run that command? Since I was able to download and train the model successfully without needing to do that.

I have a different problem this morning when I came back to my computer. I’m now evaluating the trained model and I get the following output:

2023-02-14 09:39:05,194 [INFO] root: Registry: ['nvcr.io']
2023-02-14 09:39:05,233 [INFO] tlt.components.instance_handler.local_instance: Running command in container: nvcr.io/nvidia/tao/tao-toolkit:4.0.0-tf1.15.5
Docker instantiation failed with error: 500 Server Error: Internal Server Error ("failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: initialization error: nvml error: driver/library version mismatch: unknown")

so… I’ve typed nvidia-smi and I get the following output:

Failed to initialize NVML: Driver/library version mismatch

and I ran sudo docker run --rm --runtime=nvidia --gpus all nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi and I get the following output:

docker: Error response from daemon: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: initialization error: nvml error: driver/library version mismatch: unknown.

Funnily enough this has never happened before as it was working perfectly until now.

I’ve also ran dpkg -l grep nvidia and ls -l /usr/lib/x86_64-linux-gnu/*nvidia and a whole other commands to understand the version number i have. Please see the log file attached.

command_log (15.2 KB)

Do you mind explaining what went wrong and which commands I need to run through to get this up and running again please?

Thanks so much for your help

Morganh · February 14, 2023, 4:40pm

Suggest to re-install driver.

sudo apt purge nvidia-driver-520
sudo apt autoremove
sudo apt autoclean

sudo apt install nvidia-driver-520

user82614 · February 20, 2023, 2:01pm

Thank you, it’s working now :)

system · March 6, 2023, 2:02pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issue with Docker instantiation while converting Tfrecords for KITTI trainval dataset in TAO Toolkit TAO Toolkit docker	4	405	April 8, 2024
AssertionError: Config path must be a valid unix path. No file found at: /root/.docker/config.json. Did you run docker login? TAO Toolkit tao	11	1975	July 6, 2022
Error in TAO-Toolkit while training TAO Toolkit	2	1112	January 4, 2022
Tlt.components.docker_handler.docker_handler: Stopping container TAO Toolkit	19	1202	July 6, 2022
Error while pulling container tao-toolkit:5.0.0-tf1.15.5 TAO Toolkit	10	457	March 9, 2024
AssertionError: Config path must be a valid unix path. No file found at: /root/.docker/config.json. Did you run docker login? TAO Toolkit	8	559	July 13, 2023
docker.errors.ImageNotFound after follow "nvidia/tao/cv_samples:v1.4.1" TAO Toolkit	12	454	November 13, 2022
Docker instantiation fails when running "tao detectnet_v2" on Xavier NX Jetson AGX Xavier docker	5	556	October 5, 2022
Docker - No such container TAO Toolkit	7	61	March 10, 2025
TAO 5.3 docker error - Not supported URL scheme http+docker (requests 2.31.0) TAO Toolkit	5	798	July 14, 2024

AssertionError: Config path must be a valid unix path. TAO TOOLKIT TFRECORD CONVERT

Creating a new directory for the output tfrecords dump.

Related topics