Followed all the steps to successfully install tltv3 and the launcher to a virtual enviroment with python version 3.6.13 and docker version 19.03.6. I configured the launcher with mount and docker options and created a spec file for a pretrained resnet18 classification model downloaded from ngc. Then i tried to run a tlt classification (command shown below), it appears the launcher creates a docker container instance which then immediately stops and closes (output shown below). I tried running with different spec files and pretrained models but the container always stops right after being launched regardless of what arguments are given. I also did a ‘tlt classification run ls’ to make sure my paths are mounting correctly and everything lines up in my spec file and my tlt command. Also tried giving a path to the the --log_file argument but I get the same output of the docker container stopping to stdout and no file writes out. Hope you can help, let me know if you need any other info from me!
tlt classification train -e /workspace/experiments/configs/resnet18.txt -k tlt_encode -r /workspace/experiments/output/3152021 --gpus 1
2021-03-16 23:23:24,444 [INFO] tlt.components.docker_handler.docker_handler: Stopping container.