This model zoo file is a short version of Mask2Former's model zoo file.
This file documents a collection of models reported in our paper. All numbers were obtained on Big Basin servers with 8 NVIDIA V100 GPUs & NVLink (except Swin-L models are trained with 16 NVIDIA V100 GPUs).
- The "Name" column contains a link to the config file. Running
train_net.py --num-gpus 8
with this config file will reproduce the model (except Swin-L models are trained with 16 NVIDIA V100 GPUs with distributed training on two nodes). - The model id column is provided for ease of reference. To check downloaded file integrity, any model on this page contains its md5 prefix in its file name.
Our paper also uses ImageNet pretrained models that are not part of Detectron2, please refer to tools to get those pretrained models.
All models available for download through this document are licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.
Name | Backbone | epochs | PQ | AP | mIoU | model id | download |
---|---|---|---|---|---|---|---|
Mask2Former | Swin-T | 50 | 53.2 | 43.3 | 63.2 | 48558700_1 | model |
Mask2Former | Swin-S | 50 | 54.6 | 44.7 | 64.2 | 48558700_3 | model |
Mask2Former | Swin-B | 50 | 55.1 | 45.2 | 65.1 | 48558700_5 | model |
Mask2Former | Swin-B (IN21k) | 50 | 56.4 | 46.3 | 67.1 | 48558700_7 | model |
Mask2Former (200 queries) | Swin-L (IN21k) | 100 | 57.8 | 48.6 | 67.4 | 47429163_0 | model |