Docker instantiation failed with error (TAO Toolkit - Yolo_v4_tiny)

abdel-karim.abdel-karim · March 7, 2023, 9:13am

Hello,
I was trying to follow the TAO Toolkit Quick Start Guide and want to train a yolo-v4-tiny network.
I’ve opened Jupyter notebook and I’ve finished the re-training procedure of the network, everything was fine yet when I’ve runned the Evaluate retrained model section :

!tao yolo_v4_tiny evaluate -e $SPECS_DIR/yolo_v4_tinu_retrain_kitti.txt
-m $USER_EXPERIMENT_DIR/experiment_dir_retrain/weights/yolov4_cspdarknet_tiny_epoch_$EPOCH.tlt
-k $KEY

I’ve received the following output :

2023-03-07 10:05:42, 107 [INFO] root: Registry: [‘nvcr.io’]
2023-03-07 10:05:42, 218 [INFO] tlt.components.instance_handler.local_instance: Running command in container: nvcr.io/nvidia/tao/tao-toolkit:4.0.0-tf1.15.5
2023-03-07 10:05:42, 317 [WARNING] tlt.components.docker_handler.docker_handler: Docker will run the commands as root. If you would like to retain your local host permissions, please add the “user”: “UID:GID” in the DockerOptions portion of the “/home/user/.tao_mounts.json” file. You can obtain your users UID and GID by using the “id -u” and “id -g” commands on the terminal.
Docker instantiation failed with error: 500 Server Error: Internal Server Error (“failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status1, stdout: , stderr: Auto-detected mode as ‘legacy’
nvidia-container-cli: initialization error: nvml error: driver/library version mismatch: unknown”)

Thank you for your help!

Morganh · March 7, 2023, 2:04pm

$ sudo apt purge nvidia* libnvidia*
$ sudo apt install nvidia-driver-520 nvidia-container-toolkit

Refer to Problems of telemetry using detectnet_v2 using tao toolkit - #57 by usuario3602

system · March 21, 2023, 2:05pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Docker instantiation failed with error TAO Toolkit	8	437	September 5, 2023
No CUDA-capable device is detected - yolov4 TAO Toolkit	10	195	August 16, 2024
TAO 5.3 docker error TAO Toolkit cuda , yolo , tao	7	430	June 25, 2024
TAO yolov4_tiny training fails with error TAO Toolkit	4	584	February 2, 2023
No CUDA-capable device is detected TAO Toolkit cuda , tao	9	98	February 17, 2025
Run Tao inside docker TAO Toolkit docker , tao	4	1754	February 20, 2022
Not supported URL scheme http+docker TAO Toolkit	5	1368	June 11, 2024
Dataset_convert tool is running properly but the TFrecords aren't getting created in output folder TAO Toolkit	32	1690	May 10, 2022
Yolo_v4_tiny randomly stops docker container during second or third validation phase with no errors TAO Toolkit yolo	20	921	August 29, 2022
Docker instantiation fails when running "tao detectnet_v2" on Xavier NX Jetson AGX Xavier docker	5	567	October 5, 2022

Docker instantiation failed with error (TAO Toolkit - Yolo_v4_tiny)

Related topics