Skip to content

Latest commit

 

History

History
93 lines (81 loc) · 4.46 KB

MODEL_ZOO.md

File metadata and controls

93 lines (81 loc) · 4.46 KB

Mask2Former Model Zoo and Baselines

This model zoo file is a short version of Mask2Former's model zoo file.

Introduction

This file documents a collection of models reported in our paper. All numbers were obtained on Big Basin servers with 8 NVIDIA V100 GPUs & NVLink (except Swin-L models are trained with 16 NVIDIA V100 GPUs).

How to Read the Tables

  • The "Name" column contains a link to the config file. Running train_net.py --num-gpus 8 with this config file will reproduce the model (except Swin-L models are trained with 16 NVIDIA V100 GPUs with distributed training on two nodes).
  • The model id column is provided for ease of reference. To check downloaded file integrity, any model on this page contains its md5 prefix in its file name.

Third-party ImageNet Pretrained Models

Our paper also uses ImageNet pretrained models that are not part of Detectron2, please refer to tools to get those pretrained models.

License

All models available for download through this document are licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

COCO Model Zoo

Panoptic Segmentation

Name Backbone epochs PQ AP mIoU model id download
Mask2Former Swin-T 50 53.2 43.3 63.2 48558700_1 model
Mask2Former Swin-S 50 54.6 44.7 64.2 48558700_3 model
Mask2Former Swin-B 50 55.1 45.2 65.1 48558700_5 model
Mask2Former Swin-B (IN21k) 50 56.4 46.3 67.1 48558700_7 model
Mask2Former (200 queries) Swin-L (IN21k) 100 57.8 48.6 67.4 47429163_0 model