Error running tao container image

Please provide the following information when requesting support.

• Hardware (T4/V100/Xavier/Nano/etc)
• Network Type (Detectnet_v2/Faster_rcnn/Yolo_v4/LPRnet/Mask_rcnn/Classification/etc)
• TLT Version (Please run “tlt info --verbose” and share “docker_tag” here)
• Training spec file(If have, please share here)
• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)

I am running the newest version of TAO on a T4 machine - everything was working fine until today when suddently i am getting the below error:

Simple command to reproduce: “docker run

Error happens on my RTX3060 machine as well.

Trying to run a training session from the launcher with “tao classification …” causes the container to exit instantly.

Did something change?

br, Mathias

I can reproduce this issue. Checking ongoing.

Please try below workaround.

$ docker run --runtime=nvidia -it --rm --entrypoint “” /bin/bash

The 2nd workaround is for uses who want to use tao launcher instead of “docker run”.

  1. Add "entrypoint": "" to ~/.tao_mounts.json
          "entrypoint": "" ,
          "shm_size": "16G",
  1. Modify lib/python3.6/site-packages/tao/components/docker_handler/ . This file should be available when you install nvidia-tao.

VALID_DOCKER_ARGS = [“user”, “ports”, “shm_size”, “ulimits”, “privileged”, “network”]


VALID_DOCKER_ARGS = [“user”, “ports”, “shm_size”, “ulimits”, “privileged”, “network”, “entrypoint”]

1 Like

Thanks for the swift reply,

Workarounds are good,


This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.