Please provide the following information when requesting support.
• Hardware (tesla P4)
• Network Type (Detectnet_v2)
• TLT Version (Configuration of the TAO Toolkit Instance
task_group: [‘model’, ‘dataset’, ‘deploy’]
format_version: 3.0
toolkit_version: 5.1.0
published_date: 10/10/2023)
• Training spec file(-rw-rw-r-- 1 glueck glueck 3384 Dis 21 2022 detectnet_v2_train_resnet18_kitti.txt
-rw-rw-r-- 1 glueck glueck 303 Dis 21 2022 detectnet_v2_tfrecords_kitti_trainval.txt
-rw-rw-r-- 1 glueck glueck 3378 Dis 21 2022 detectnet_v2_retrain_resnet18_kitti.txt
-rw-rw-r-- 1 glueck glueck 3372 Dis 21 2022 detectnet_v2_retrain_resnet18_kitti_qat.txt
-rw-rw-r-- 1 glueck glueck 1456 Dis 21 2022 detectnet_v2_inference_kitti_tlt.txt
-rw-rw-r-- 1 glueck glueck 1485 Dis 21 2022 detectnet_v2_inference_kitti_etlt.txt
-rw-rw-r-- 1 glueck glueck 1498 Dis 21 2022 detectnet_v2_inference_kitti_etlt_qat.txt)
• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)
Creating a new directory for the output tfrecords dump.
print(“Converting Tfrecords for kitti trainval dataset”)
!mkdir -p $LOCAL_DATA_DIR/tfrecords && rm -rf $LOCAL_DATA_DIR/tfrecords/*
!tao model detectnet_v2 dataset_convert
-d $SPECS_DIR/detectnet_v2_tfrecords_kitti_trainval.txt
-o $DATA_DOWNLOAD_DIR/tfrecords/kitti_trainval/kitti_trainval
-r $USER_EXPERIMENT_DIR/
Converting Tfrecords for kitti trainval dataset
2023-12-07 16:39:57,697 [TAO Toolkit] [INFO] root 160: Registry: [‘nvcr.io’]
2023-12-07 16:39:57,829 [TAO Toolkit] [INFO] nvidia_tao_cli.components.instance_handler.local_instance 360: Running command in container: nvcr.io/nvidia/tao/tao-toolkit:5.0.0-tf1.15.5
2023-12-07 16:39:57,870 [TAO Toolkit] [INFO] nvidia_tao_cli.components.docker_handler.docker_handler 275: Printing tty value True
2023-12-07 08:39:58.722931: I tensorflow/stream_executor/platform/default/dso_loader.cc:50] Successfully opened dynamic library libcudart.so.12
2023-12-07 08:39:58,774 [TAO Toolkit] [WARNING] tensorflow 40: Deprecation warnings have been disabled. Set TF_ENABLE_DEPRECATION_WARNINGS=1 to re-enable them.
Using TensorFlow backend.
2023-12-07 08:40:00,255 [TAO Toolkit] [WARNING] tensorflow 43: TensorFlow will not use sklearn by default. This improves performance in some cases. To enable sklearn export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:00,291 [TAO Toolkit] [WARNING] tensorflow 42: TensorFlow will not use Dask by default. This improves performance in some cases. To enable Dask export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:00,295 [TAO Toolkit] [WARNING] tensorflow 43: TensorFlow will not use Pandas by default. This improves performance in some cases. To enable Pandas export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:01,659 [TAO Toolkit] [WARNING] matplotlib 500: Matplotlib created a temporary config/cache directory at /tmp/matplotlib-na53nkbb because the default path (/.config/matplotlib) is not a writable directory; it is highly recommended to set the MPLCONFIGDIR environment variable to a writable directory, in particular to speed up the import of Matplotlib and to better support multiprocessing.
2023-12-07 08:40:01,876 [TAO Toolkit] [INFO] matplotlib.font_manager 1633: generated new fontManager
WARNING:tensorflow:Deprecation warnings have been disabled. Set TF_ENABLE_DEPRECATION_WARNINGS=1 to re-enable them.
Using TensorFlow backend.
WARNING:tensorflow:TensorFlow will not use sklearn by default. This improves performance in some cases. To enable sklearn export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:03,490 [TAO Toolkit] [WARNING] tensorflow 43: TensorFlow will not use sklearn by default. This improves performance in some cases. To enable sklearn export the environment variable TF_ALLOW_IOLIBS=1.
WARNING:tensorflow:TensorFlow will not use Dask by default. This improves performance in some cases. To enable Dask export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:03,526 [TAO Toolkit] [WARNING] tensorflow 42: TensorFlow will not use Dask by default. This improves performance in some cases. To enable Dask export the environment variable TF_ALLOW_IOLIBS=1.
WARNING:tensorflow:TensorFlow will not use Pandas by default. This improves performance in some cases. To enable Pandas export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:03,532 [TAO Toolkit] [WARNING] tensorflow 43: TensorFlow will not use Pandas by default. This improves performance in some cases. To enable Pandas export the environment variable TF_ALLOW_IOLIBS=1.
2023-12-07 08:40:04,010 [TAO Toolkit] [INFO] nvidia_tao_tf1.cv.common.logging.logging 197: Log file already exists at /workspace/tao-experiments/detectnet_v2/status.json
2023-12-07 08:40:04,010 [TAO Toolkit] [INFO] root 2102: Starting Object Detection Dataset Convert.
2023-12-07 08:40:04,011 [TAO Toolkit] [INFO] root 2102: [Errno 2] No such file or directory: ‘/home/glueck/getting_started_v5.0.0/notebooks/tao_launcher_starter_kit/penang_port/specs/detectnet_v2_tfrecords_kitti_trainval.txt’
Traceback (most recent call last):
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_tf1/cv/detectnet_v2/scripts/dataset_convert.py”, line 168, in
raise e
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_tf1/cv/detectnet_v2/scripts/dataset_convert.py”, line 137, in
main()
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_tf1/cv/detectnet_v2/scripts/dataset_convert.py”, line 128, in main
with open(expand_path(args.dataset_export_spec), “r”) as f:
FileNotFoundError: [Errno 2] No such file or directory: ‘/home/glueck/getting_started_v5.0.0/notebooks/tao_launcher_starter_kit/penang_port/specs/detectnet_v2_tfrecords_kitti_trainval.txt’
Execution status: FAIL
2023-12-07 16:40:11,061 [TAO Toolkit] [INFO] nvidia_tao_cli.components.docker_handler.docker_handler 337: Stopping container.
but i have the spec file
ls -l /home/glueck/getting_started_v5.0.0/notebooks/tao_launcher_starter_kit/penang_port/specs/
total 28
-rw-rw-r-- 1 glueck glueck 1498 Dis 21 2022 detectnet_v2_inference_kitti_etlt_qat.txt
-rw-rw-r-- 1 glueck glueck 1485 Dis 21 2022 detectnet_v2_inference_kitti_etlt.txt
-rw-rw-r-- 1 glueck glueck 1456 Dis 21 2022 detectnet_v2_inference_kitti_tlt.txt
-rw-rw-r-- 1 glueck glueck 3372 Dis 21 2022 detectnet_v2_retrain_resnet18_kitti_qat.txt
-rw-rw-r-- 1 glueck glueck 3378 Dis 21 2022 detectnet_v2_retrain_resnet18_kitti.txt
-rw-rw-r-- 1 glueck glueck 303 Dis 21 2022 detectnet_v2_tfrecords_kitti_trainval.txt
-rw-rw-r-- 1 glueck glueck 3384 Dis 21 2022 detectnet_v2_train_resnet18_kitti.txt
cat /home/glueck/getting_started_v5.0.0/notebooks/tao_launcher_starter_kit/penang_port/specs/detectnet_v2_tfrecords_kitti_trainval.txt
kitti_config {
root_directory_path: “/workspace/tao-experiments/Data/Train”
image_dir_name: “Images”
label_dir_name: “Labels”
image_extension: “.jpeg”
partition_mode: “random”
num_partitions: 2
val_split: 20
num_shards: 10
}
image_directory_path: “/workspace/tao-experiments/Data/Train”