NVIDIA TAO 5.5 Release : New Foundation Models and Training Capabilities

TomNVIDIA · August 28, 2024, 9:53pm

Highlights from this release:

Explore new foundation and multi-modal models:
- Grounding-DINO—Open vocabulary object detection with fine-tuning
- Mask-GroundingDINO—Open vocabulary instance segmentation with fine-tuning
- NV-CLIP—Foundation model for image and text embedding
- BEVFusion—Sensor fusion model combining image and lidar data for 3D understanding with fine-tuning
- SEGIC—In-context segmentation on any object based on visual prompting.
- FoundationPose—Six DoF object pose estimation for any novel objects
- Mask2Former—State-of-the-art instance and panoptic segmentation model with fine-tuning
Automatically create label datasets for object detection and segmentation using text prompts.
Knowledge distillation—Create smaller efficient and accurate networks from distilling knowledge of larger networks.

Morganh · August 29, 2024, 2:44am

Source code can be found in the bottom of link.

Topic		Replies	Views
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5 Technical Blog	1	25	August 27, 2024
ANNOUNCEMENT: General Availability of TAO Toolkit Version 5.0 TAO Toolkit	1	450	July 26, 2023
Announcing the new version of the NVIDIA TAO Toolkit v3.22.05 TAO Toolkit	1	1447	June 6, 2022
Develop and Optimize Vision AI Models for Trillions of Devices with NVIDIA TAO Technical Blog	0	320	December 6, 2023
New Release: NVIDIA TAO 5.2 Technical Blog	1	305	January 5, 2024
Announcing general availability for Transfer Learning Toolkit 2.0 TAO Toolkit	2	770	August 4, 2020
Fine Tuning Retail Object Detection Models provided in NGC TAO Toolkit ngc	9	37	November 20, 2024
A few questions regarding TAO 5.0 TAO Toolkit	9	621	August 3, 2023
Nvidia tao 툴킷 5.0으로 최신 비전 ai 모델 개발 워크플로우에 액세스하세요 Technical Blog - South Korea korean	0	473	July 27, 2023
Upcoming webinar: Low-code AI model development with the NVIDIA TAO Toolkit TAO Toolkit	1	684	April 26, 2022