TAO5 unet vs segformer

david9xqqb · July 30, 2023, 5:32pm

In this NVIDIA article Improve Accuracy and Robustness of Vision AI Apps with Vision Transformers and NVIDIA TAO the implied message is to use segformers for Accuracy and Robustness.

Yet I trained both TAO5 unet and segformers examples out-of-the box to very different results:

unet inference:

{‘foreground’: {‘precision’: 0.7040575, ‘Recall’: 0.74182844, ‘F1 Score’: 0.7224496623348184, ‘iou’: 0.565496}, ‘background’: {‘precision’: 0.9340356, ‘Recall’: 0.9214057, ‘F1 Score’: 0.9276776205874399, ‘iou’: 0.8651108}}

segformer inference


+------------+-------+-------+
| Class      | IoU   | Acc   |
+------------+-------+-------+
| foreground | 36.33 | 41.23 |
| background | 84.15 | 96.6  |
+------------+-------+-------+

It is obvious that segformer is not performing better by any means

Is it worth it for me to migrate my unet models to segformer? how?

Continuing the discussion from Migrating TAO3 unet model to segformer, Foreground has performance of 0.0 !:

In my previous to last post I explained the very bad results I was getting with my custom dataset and segformer on TAO4 which led to abandon the idea of using segformers with my data because it would not converge despite running for several days…

@Morganh answer then was add more data, but these custom images are super expensive, and I don’t want to spend a lot of money generating more custom images just to find out that I am in the same place.

The Question: Under what dataset or training conditions can I expect better results from segformer?

Thanks!!!

Morganh · July 31, 2023, 8:48am

When you run above experiments for both Unet and segformer, you are running default notebook and default spec file, correct?
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/resources/tao-getting-started/version/5.0.0/files/notebooks/tao_launcher_starter_kit/unet/tao_isbi/unet_isbi.ipynb
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/resources/tao-getting-started/version/5.0.0/files/notebooks/tao_launcher_starter_kit/segformer/segformer.ipynb

david9xqqb · July 31, 2023, 9:18pm

@Morganh That’s exactly what I said, out of the box, meaning no changes made…Not in specs or dataset

Morganh · August 1, 2023, 1:49am

Thanks for the info. I will check further.

david9xqqb · August 1, 2023, 4:24am

@Morganh Thanks

Morganh · August 3, 2023, 5:55am

For segformer in TAO5.0, there are more backbones. SegFormer - NVIDIA Docs
You can change to deeper backbone instead. For example, mit_b5.

Also, in your current experiment for segformer, there is not pretrained model.
You can download mit_b5 version of pretrained model from ngc, and set it in training spec file.
The pretrained model can be found in

david9xqqb · August 3, 2023, 9:53am

@Morganh

I did that and results in an cryptic error that I fail to figure out. Please see the error log attached.

segformer errors 2023 08 03.txt (7.0 KB)

Please confirm: I think you are saying to download the CitySemSegformer trainable_mit-b5_v1.0 from ngc and change the spec file


model:
  input_height: 512
  input_width: 512
  pretrained_model_path: null
  backbone:
    type: "mit_b1"

to

model:
  input_height: 512
  input_width: 512
  pretrained_model_path: 
  	- /pretrained/citysemsegformer_mit.pth
  backbone:
    type: "mit_b5"

Morganh · August 3, 2023, 2:15pm

yaml.scanner.ScannerError: while scanning for the next token
found character '\t' that cannot start any token
  in "/specs/train_isbi.yaml", line 18, column 3

Please double check your train_isbi.yaml.

david9xqqb · August 4, 2023, 8:21am

@Morganh Thanks for the suggestion here!

Using the mit5 pretrained weights made a great difference:

After the default training the evaluation of the resulting model is:


+------------+-------+-------+
| Class      | IoU   | Acc   |
+------------+-------+-------+
| foreground | 64.04 | 75.23 |
| background | 89.99 | 95.6  |
+------------+-------+-------+
Summary:

+--------+-------+-------+------+
| Scope  | mIoU  | mAcc  | aAcc |
+--------+-------+-------+------+
| global | 77.01 | 85.42 | 91.5 |
+--------+-------+-------+------+

An the resulting inference images:

Many thanks!

david9xqqb · August 4, 2023, 8:54am

I’d like to test also the fan backbone.

I found the trainable_fan_v1.0 pretrained weights file here.

For which of the fan backbones is this applicable?

fan_tiny_8_p4_hybrid
fan_large_16_p4_hybrid
fan_small_12_p4_hybrid
fan_base_16_p4_hybrid

@Morganh thanks!

Morganh · August 4, 2023, 8:58am

It is fan_base_16_p4_hybrid.

david9xqqb · August 5, 2023, 8:37pm

@Morganh Thanks!

system · August 19, 2023, 8:37pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Migrating TAO3 unet model to segformer, Foreground has performance of 0.0 ! TAO Toolkit	28	1031	February 27, 2023
SegFormer Pretrained Models - Different Architectures TAO Toolkit	5	660	September 11, 2023
Question of Pretrained Segformer in NGC TAO Toolkit	5	312	November 20, 2023
SegFormer Backbones Available TAO Toolkit	2	362	June 16, 2023
Tao5 unet catastrophic changes from tao3 unet TAO Toolkit	11	604	January 3, 2024
MAJOR ACCURACY LOSS when EXPORTING tao unet model after retraining pruned model TAO Toolkit	29	1364	November 22, 2022
Transforming Industrial Defect Detection with NVIDIA TAO and Vision AI Models Technical Blog	1	391	September 21, 2024
TAO 4 Segformer Input and output dimensions and tensors TAO Toolkit	11	787	March 20, 2023
TAO Toolkit with Yolov4-Tiny and custom pretrained model TAO Toolkit	30	1135	June 26, 2023
Troubles Replicating TLT Model Training Experiment with TAO TAO Toolkit	6	535	November 21, 2023

TAO5 unet vs segformer

Related topics