The “resnet34_peoplenet_int8.etlt” is a quantized model.
See PeopleNet | NVIDIA NGC, for quantized INT8 model, a third quantization-aware training (QAT) phase is carried out. Regularization is not included in second and third phase. The quantized models share the same structure as the pruned model, however, these models have been trained by employing Quantization Aware Training and is intended for int-8 deployment.
So, “resnet34_peoplenet_int8.etlt” is used in int8 deployment.
Understand the second command will generate etlt with int8 calibration file. But regarding to etlt file, are the etlt files generated by these two commands same? In other words, can etlt file generated by the second command using for fp32/fp16 deployment and vice versa?