Please provide the following information when requesting support.
• Hardware (T4/V100/Xavier/Nano/etc)
Azure VM A100
• Network Type (Detectnet_v2/Faster_rcnn/Yolo_v4/LPRnet/Mask_rcnn/Classification/etc)
Detectnet_v2
• TLT Version (Please run “tlt info --verbose” and share “docker_tag” here)
• Training spec file(If have, please share here)
• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)
I was running a VM instance in Azure following this here. Im trying to perform transfer learning on custom dataset using Detectnet_v2 – mobilenet_v1 model preferably but i can use resnet18 since it looks like the jupyter notebook is already configured for it. i have my custom dataset with the same required names and structure, ive even provided a testing directory like this:
$LOCAL_PROJECT_DIR/data/training/image_2
$LOCAL_PROJECT_DIR/data/training/label_2
$LOCAL_DATA_DIR/testing/image_2
everything seems to be going right, although i have python 3.8.10 which i believe would still work (could be wrong). but anyways, i skip downloading the dataset section but i still verify and i have the images in the correct path.
the problem comes when i try to prepare tf records from the dataset.
cell:
print("TFrecords conversion spec file for kitti training")
!cat $LOCAL_SPECS_DIR/detectnet_v2_tfrecords_kitti_trainval.txt
output:
TFrecords conversion spec file for kitti training
kitti_config {
root_directory_path: "/workspace/tao-experiments/data/training"
image_dir_name: "image_2"
label_dir_name: "label_2"
image_extension: ".jpg"
partition_mode: "random"
num_partitions: 2
val_split: 14
num_shards: 10
}
image_directory_path: "/workspace/tao-experiments/data/training"
cell:
# Creating a new directory for the output tfrecords dump.
print("Converting Tfrecords for kitti trainval dataset")
!mkdir -p $LOCAL_DATA_DIR/tfrecords && rm -rf $LOCAL_DATA_DIR/tfrecords/*
!tao detectnet_v2 dataset_convert \
-d $SPECS_DIR/detectnet_v2_tfrecords_kitti_trainval.txt \
-o $DATA_DOWNLOAD_DIR/tfrecords/kitti_trainval/kitti_trainval
output
Converting Tfrecords for kitti trainval dataset
usage: tao [-h] {list,stop,info,dataset,deploy,model} ...
tao: error: invalid choice: 'detectnet_v2' (choose from 'list', 'stop', 'info', 'dataset', 'deploy', 'model')
It makes the tfrecords directory but does not create the records.
cell:
!ls -rlt $LOCAL_DATA_DIR/tfrecords/kitti_trainval/
output
ls: cannot access '/home/azureuser/tao/data/tfrecords/kitti_trainval/': No such file or directory