Most accurate TLT model

Hi, I searched about it in TLT blog and docs but I didn’t found a mAP/latency comparisson between backbones/models available in TLT. Which backbone + model should have more accuracy (regardless of performance/latency)? It would be the Faster RCNN + EfficientNet B1? Has plans to make an public comparisson table between models?

In TLT user guide, it provides mAP and FPS information for some Purpose-built models. They are mostly based on TLT detectnet_v2 network. See Overview — TAO Toolkit 3.22.05 documentation .
The mAP result usually varies due to different input_size, dataset, pretrained models, networks, backbones, etc. For example, pretrained weights trained on the ImageNet dataset tend to provide good accuracy for object detection. But we cannot release the pretrained models trained on ImageNet. So, we write some blogs to show the steps to achieve this accuracy with TLT. For example, https://developer.nvidia.com/blog/preparing-state-of-the-art-models-for-classification-and-object-detection-with-tlt/ .
For your case, if you want to find a model which has more accuracy (regardless of performance/latency), you can consider following blog to get pretrained model trained on the ImageNet dataset and then try TLT yolo_v4 or retinanet.

1 Like

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.