Skip to content

Mix ViT in_channels restriction #823

Closed
@ghost

Description

What's the reasoning behind limiting the Mix Visual Transformer encoder #632 to 3 input channels?

https://github.com/qubvel/segmentation_models.pytorch/blob/master/segmentation_models_pytorch/encoders/mix_transformer.py#L468

I couldn't spot anything in the paper or the original SegFormer implementation.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions