SegFormer Pretrained Models - Different Architectures

atcnthn · September 8, 2023, 10:35am

Although I can find pre-trained Segformer models on NGC Catalog, the model architectures are different from the paper (B0-B5).
In the training configuration schema, these are the options for the backbones:
-mit_b0
-mit_b1
-mit_b2
-mit_b3
-mit_b4
-mit_b5
-fan_tiny_8_p4_hybrid
-fan_large_16_p4_hybrid
-fan_small_12_p4_hybrid
-fan_base_16_p4_hybrid

The models in the catalog are FAN models: Pre-trained SegFormer ImageNet weights | NVIDIA NGC

What’s the correspondence of these models to the ones described in the paper? (For instance, is the tiny FAN model equivalent to B0?)
Where can I find the pre-trained models for mit_b(0-5) backbones?

Morganh · September 8, 2023, 4:37pm

You can find the mit_b5 pretrained model in CitySemSegformer | NVIDIA NGC
The fan_tiny is not equivalent to B0.

atcnthn · September 9, 2023, 12:27pm

I was able to reach the pretrained models for mit_b0 to b5 from the GitHub repo: GitHub - NVlabs/SegFormer: Official PyTorch implementation of SegFormer
Now, the link on the repo isn’t available.

The ones that I downloaded earlier (mit_b0 and b1) from this repo was working with tao 4.0.0-pyt, however it throws out an error with the current version 5.0.0-pyt.

Are you planning to share some pretrained models for these backbones, or is this FAN structure a permanent update on the architecture of SegFormer?

Morganh · September 11, 2023, 6:38am

For the GitHub repo: GitHub - NVlabs/SegFormer: Official PyTorch implementation of SegFormer, please try to download from ( google drive | onedrive ).

atcnthn · September 11, 2023, 7:26am

As I indicated earlier, I receive the following error (in tao v4, there was no problem) when I choose the backbone option as ‘mit_b0’ and use the mit_b0.pth file in the link that you have provided:

:245: UserWarning: Log file already exists at /workspace/tao-experiments/SegFormer_TAO/results/train/status.json
2023-09-11 07:17:51,363 - mmseg - INFO - Start logging for Training
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 0: Train Id 0
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 1: Train Id 1
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 2: Train Id 2
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 3: Train Id 3
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 4: Train Id 4
2023-09-11 07:17:51,474 - mmseg - INFO - Label Id 5: Train Id 5
2023-09-11 07:17:51,474 - mmseg - INFO - Completed Data Module Construction
Concatenating datasets
2023-09-11 07:17:51,606 - mmseg - INFO - Loaded 10140 images
Error executing job with overrides: [‘results_dir=/workspace/tao-experiments/SegFormer_TAO/results’]
An error occurred during Hydra’s exception formatting:
AssertionError()
Traceback (most recent call last):
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 254, in run_and_report
assert mdl is not None
AssertionError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “</usr/local/lib/python3.8/dist-packages/nvidia_tao_pytorch/cv/segformer/scripts/train.py>”, line 3, in
File “”, line 176, in
File “”, line 107, in wrapper
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 389, in _run_hydra
_run_app(
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 452, in _run_app
run_and_report(
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 296, in run_and_report
raise ex
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 213, in run_and_report
return func()
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/utils.py”, line 453, in
lambda: hydra.run(
File “/usr/local/lib/python3.8/dist-packages/hydra/_internal/hydra.py”, line 132, in run
_ = ret.return_value
File “/usr/local/lib/python3.8/dist-packages/hydra/core/utils.py”, line 260, in return_value
raise self._return_value
File “/usr/local/lib/python3.8/dist-packages/hydra/core/utils.py”, line 186, in run_job
ret.return_value = task_function(task_cfg)
File “”, line 172, in main
File “”, line 162, in main
File “”, line 124, in run_experiment
File “”, line 85, in build_segmentor
File “”, line 51, in build
File “/usr/local/lib/python3.8/dist-packages/mmcv/utils/registry.py”, line 72, in build_from_cfg
raise type(e)(f’{obj_cls.name}: {e}')
KeyError: “EncoderDecoder: ‘state_dict’”

The mit_b0 backbone introduced in tao 5.0.0-pyt doesn’t match with the ones in GitHub repo.
Where else can I find these backbones?

Morganh · September 11, 2023, 7:31am

Will sync with internal team if there are B0/B1/B2/B3/B4 pretrained models.

Topic		Replies	Views
TAO5 unet vs segformer TAO Toolkit	12	954	August 19, 2023
SegFormer Backbones Available TAO Toolkit	2	359	June 16, 2023
TAO Toolkit Version 5.3 - Segformer ValueError: need at least one array to concatenate TAO Toolkit	14	642	April 16, 2024
Question of Pretrained Segformer in NGC TAO Toolkit	5	304	November 20, 2023
Error while running tao deformable_detr train TAO Toolkit	9	1306	July 6, 2023
TAO Toolkit 5.2 (5.2.0.1-pyt1.14.0:Segformer) - OSError: [Errno 39] Directory not empty: '/results/train/.eval_hook' TAO Toolkit	10	517	March 8, 2024
Tao toolkit version5 is getting error when comes to training part TAO Toolkit	45	1741	August 22, 2023
Add class to pretrained model using TAO 5.5.0 TAO Toolkit	2	24	February 4, 2025
TAO API 5.3 : How to create experiments that leverage pretrained base_experiments from NGC? TAO Toolkit yolo , api , tao	16	64	November 6, 2024
TAO 5.3 Direct Container usage issues TAO Toolkit jetson	10	72	August 27, 2024

SegFormer Pretrained Models - Different Architectures

Related topics