Distillation training

rishikesan · October 28, 2024, 5:36am

Please provide the following information when requesting support.

• Hardware (T4)
• Network Type (Dino and Yolo)

• How to reproduce the issue ?
I am trying to get a small model which can be effectively deployed to Jetson devices like Yolo models , but i want them accurate as bigger models like Dino

There is this distillation approach now available for Dino as shown in the tutorial below

github.com

NVIDIA/tao_tutorials/blob/main/notebooks/tao_launcher_starter_kit/dino/dino_distillation.ipynb

{
 "cells": [
  {
   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Knowledge Distillation using TAO DINO\n",
    "\n",
    "Transfer learning is the process of transferring learned features from one application to another. It is a commonly used training technique where you use a model trained on one task and re-train to use it on a different task. \n",
    "\n",
    "Train Adapt Optimize (TAO) Toolkit  is a simple and easy-to-use Python based AI toolkit for taking purpose-built AI models and customizing them with users' own data.\n",
    "\n",
    "<img align=\"center\" src=\"https://d29g4g2dyqv443.cloudfront.net/sites/default/files/akamai/TAO/tlt-tao-toolkit-bring-your-own-model-diagram.png\" width=\"1080\">\n",
    "\n",
    "## What is DINO?\n",
    "\n",
    "[DINO](https://arxiv.org/abs/2203.03605) is a state of the art transformer-based object detection model. Similar to Deformable DETR, DINO does not use heuristics based methods like NMS or IOU assignment found in convolution-based object detection models like Faster RCNN. Compared to Deformable DETR, DINO uses de-noising during training which can help training to converge faster.\n",
    "\n",
    "In TAO, three different types of backbone networks are supported: [ResNet50](https://arxiv.org/abs/1512.03385), [GCViT](https://arxiv.org/abs/2206.09959), and [FAN](https://arxiv.org/abs/2204.12451). In this notebook, we use the most advanced network called FAN which is also a transformer-based classification network. For more details about training FAN backbones, please refer to the classification pytorch notebook.\n",

This file has been truncated. show original

Also i saw there is this code base as well
NVIDIA general distillation modules :

NVIDIA dino distillation models

github.com

NVIDIA/tao_pytorch_backend/blob/main/nvidia_tao_pytorch/cv/dino/distillation/distiller.py

# Copyright (c) 2023, NVIDIA CORPORATION.  All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""Distiller module for DINO model"""

import copy
from contextlib import ExitStack
import json

This file has been truncated. show original

I want to do distillation training , by using Dino model as a teacher and Yolo model as a student
Will this possible currently in the TAO pipeline

Or how this can be implemented inside the TAO repositories ,

Or is there any other way using External training code of Yolo to use the Dino training pipeline in TAO
to achieve this purpose

Appreciate your support on this

Morganh · October 28, 2024, 9:37am

Currently in DINO, you can use (DINO+ Resnet50) as the student. Please try to follow notebeook to run distillation. I am afraid this student can get expected fps when comparing to YOLOv4.
Performance table can be found in Overview - NVIDIA Docs.
More info can be found in https://arxiv.org/pdf/2004.10934

yingliu · December 5, 2024, 2:25am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

system · December 19, 2024, 2:26am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
DINO Retail Object Detection - Distillation TAO Toolkit tao , retail-object-detection	13	106	February 12, 2025
Fine Tuning Retail Object Detection Models provided in NGC TAO Toolkit ngc	17	232	February 7, 2025
Training a model (with TAO?) using deepstream SDK 6.4 docker TAO Toolkit	5	448	January 29, 2024
Pruning Dino model TAO Toolkit	8	737	December 26, 2023
Is it posible to run TAO DINO on jetson nano? TAO Toolkit	3	285	May 2, 2024
TAO Toolkit with Yolov4-Tiny and custom pretrained model TAO Toolkit	30	1195	June 26, 2023
Training SegFormer with Nv-DinoV2 backbone on Segmentation Task TAO Toolkit segmentation , tao	10	83	August 12, 2025
Tao toolkit version5 is getting error when comes to training part TAO Toolkit	45	1879	August 22, 2023
TAO Dino training pipeline TAO Toolkit	5	50	August 27, 2024
How to download and use nv_dinov2 in tao dino object detection? TAO Toolkit	7	363	April 3, 2024

Distillation training

Related topics