Yolov3 worklfow or incorrect calibration file for int8 inference

ilias.rmouque1 · July 5, 2023, 8:07am

Please provide the following information when requesting support.

• Hardware (T4/V100/Xavier/Nano/etc) I train on rtx 4090 and I do inference on AGX Xavier
• Network Type (Detectnet_v2/Faster_rcnn/Yolo_v4/LPRnet/Mask_rcnn/Classification/etc) Yolov3
• TLT Version (Please run “tlt info --verbose” and share “docker_tag” here)

dockers:
nvidia/tao/tao-toolkit-tf:
v3.21.11-tf1.15.5-py3:
docker_registry: nvcr.io
tasks:
1. augment
2. bpnet
3. classification
4. dssd
5. emotionnet
6. efficientdet
7. fpenet
8. gazenet
9. gesturenet
10. heartratenet
11. lprnet
12. mask_rcnn
13. multitask_classification
14. retinanet
15. ssd
16. unet
17. yolo_v3
18. yolo_v4
19. yolo_v4_tiny
20. converter
v3.21.11-tf1.15.4-py3:
docker_registry: nvcr.io
tasks:
1. detectnet_v2
2. faster_rcnn
nvidia/tao/tao-toolkit-pyt:
v3.21.11-py3:
docker_registry: nvcr.io
tasks:
1. speech_to_text
2. speech_to_text_citrinet
3. text_classification
4. question_answering
5. token_classification
6. intent_slot_classification
7. punctuation_and_capitalization
8. action_recognition
v3.22.02-py3:
docker_registry: nvcr.io
tasks:
1. spectro_gen
2. vocoder
nvidia/tao/tao-toolkit-lm:
v3.21.08-py3:
docker_registry: nvcr.io
tasks:
1. n_gram
format_version: 2.0
toolkit_version: 3.22.02
published_date: 02/28/2022

• Training spec file(If have, please share here)

random_seed: 42
yolov3_config {
big_anchor_shape: “[(114.94, 60.67), (159.06, 114.59), (297.59, 176.38)]”
mid_anchor_shape: “[(42.99, 31.91), (79.57, 31.75), (56.80, 56.93)]”
small_anchor_shape: “[(15.60, 13.88), (30.25, 20.25), (20.67, 49.63)]”
matching_neutral_box_iou: 0.7
arch: “resnet”
nlayers: 18
arch_conv_blocks: 2
loss_loc_weight: 0.8
loss_neg_obj_weights: 100.0
loss_class_weights: 1.0
freeze_bn: false
#freeze_blocks: 0
force_relu: false
}
training_config {
batch_size_per_gpu: 8
num_epochs: 80
enable_qat: true
checkpoint_interval: 10
learning_rate {
soft_start_annealing_schedule {
min_learning_rate: 1e-6
max_learning_rate: 1e-4
soft_start: 0.1
annealing: 0.5
}
}
regularizer {
type: L1
weight: 3e-9
}
optimizer {
adam {
epsilon: 0.001
beta1: 0.9
beta2: 0.999
amsgrad: false
}
}
pretrain_model_path: “/workspace/tao-experiments/yolov3/pretrained_resnet18/pretrained_object_detection_vresnet18/resnet_18.hdf5”
}
eval_config {
average_precision_mode: SAMPLE
batch_size: 8
matching_iou_threshold: 0.5
}
nms_config {
confidence_threshold: 0.001
clustering_iou_threshold: 0.5
top_k: 200
force_on_cpu: True
}
augmentation_config {
hue: 0.1
saturation: 1.5
exposure:1.5
vertical_flip:0
horizontal_flip: 0.5
jitter: 0.3
output_width: 1280
output_height: 1280
output_channel: 3
randomize_input_shape_period: 0
}
dataset_config {
data_sources: {
tfrecords_path: “/workspace/tao-experiments/data/tfrecords/kitti_trainval/*”
image_directory_path: “/workspace/tao-experiments/try-6”
}
include_difficult_in_training: true
image_extension: “jpg”
target_class_mapping {
key: “cachalot”
value: “cachalot”
}
target_class_mapping {
key: “jet”
value: “jet”
}
target_class_mapping {
key: “rorqual”
value: “rorqual”
}
target_class_mapping {
key: “bateau”
value: “bateau”
}
target_class_mapping {
key: “globicephale”
value: “globicephale”
}
target_class_mapping {
key: “queue_cachalot”
value: “queue_cachalot”
}
validation_fold: 0
}

• How to reproduce the issue ? (This is for errors. Please share the command line and the detailed log here.)

Hello,
I think I have a problem with the generation of my calibration file in TAO. I want to do inference on deepstream in int8 using this yolo v3 model (key = tlt_encode). When I do inference both in fp32 and fp16 everything works well, but when I use int8 it gives pretty bad results.

I’m pretty sure it comes from the calibration file. To generate it I export the model I train using tao-converter and then I use tao-deploy to generate the calibration file, both step on my RTX4090.

Can you tell me if my workflow is correct and if so can you check my calibration file and tell me what’s wrong ?

Thank you for your answer

Morganh · July 5, 2023, 8:42am

To export the model, you can use yolo_v3 export xxx. It will generate .etlt model based on .tlt model.

Please note that tao-converter will generate tensorrt engine based on .etlt model. It will not generate .etlt model.

In latest docker, you can use tao-deploy to generate tensorrt engine instead of tao-converter.

For the incorrect calibration, please try to use entire training dataset when you run tao-deploy yolo_v3 gen_trt_engine to generate a new cal.bin.

I also observe that you are using an old version of TAO. In this case, you can use
tao yolo_v3 export to generate a new cal.bin.

ilias.rmouque1 · July 5, 2023, 9:57am

Thank you for this very quick answer !

Sorry I mixed things up I used tao export to generate the etlt and then tao deploy for the cal.bin.

Here is my code for both (I used containers to be sure to be up to date).

For the export
docker run -it --rm --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864
-v .:/workspace nvcr.io/nvidia/tao/tao-toolkit:4.0.1-tf1.15.5
yolo_v3 export -e /workspace/specs/experiment_spec.json
-m /workspace/yolo_v3/experiment_dir_final_2/weights/yolov3_resnet18_epoch_080.tlt
-o /workspace/yolo_v3/experiment_dir_final_2/resnet18_detector_qat.etlt
-k tlt_encode
–static_batch_size 1
–gen_ds_config

For the cal.bin generation

docker run -it --rm --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864
-v .:/workspace nvcr.io/nvidia/tao/tao-toolkit:4.0.0-deploy
yolo_v3 gen_trt_engine
-e /workspace/specs/experiment_spec.json
-m /workspace/yolo_v3/experiment_dir_final_2/resnet18_detector_qat.etlt
-k tlt_encode
–cal_image_dir /workspace/try-6/train
–data_type int8
–cal_cache_file /workspace/yolo_v3/experiment_dir_final_2/cal.bin
–force_ptq
–engine_file /workspace/yolo_v3/experiment_dir_final_2/trt.engine.int8

As you can see I’m using the train directory as source for the calibration file.

Morganh · July 5, 2023, 3:49pm

Please add below and retry. xxx means the total images in the training images folder.
--batches xxx
--batch_size 1

ilias.rmouque1 · July 6, 2023, 10:04am

Thanks for the answer,

I tried it and the results are even worse … Like I get only one detection in the corner of my image and the same detection for all.

Here’s the obtained calibration file

Morganh · July 6, 2023, 2:10pm

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

Can you official yolo_v3 notebook to check if it works? It will train against KITTI dataset.

system · July 24, 2023, 7:54am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unable to export QAT yolov3 in int8 TAO Toolkit	7	549	April 25, 2023
TLT YOLOv3 Int8 can not detect anything TAO Toolkit	17	1688	October 12, 2021
Inference YOLO_v4 int8 mode doesn't show any bounding box TAO Toolkit	31	2540	November 12, 2021
Convert TAO Yolov4 model to DLA engine fails TAO Toolkit	22	1662	March 1, 2022
Error when training YOLOV3 with TAO TAO Toolkit	5	553	May 20, 2022
How to generate the correct engine with tensorrt for Yolov3 TAO Toolkit	8	989	July 22, 2023
Cannot produce calibration binary for INT8 compression with YOLOv4 TAO Toolkit	2	290	June 2, 2023
Unable to deploy TAO 4.0.1 yolov4 model on deepstream6.0 TAO Toolkit deepstream	43	1077	August 18, 2023
Error while converting model using TAO TAO Toolkit	32	796	October 27, 2021
TAO converter - INT8 engine generated with YOLOV4(CSPDarknet53) gives wrong predictions(0 mAP) for models trained with fish-eye datasets TAO Toolkit	20	2033	December 22, 2021

Yolov3 worklfow or incorrect calibration file for int8 inference

Related topics