Amodal Segmentation Based on Visible Region Segmentation and Shape Prior (VRSP-Net)

This is the PyTorch preview version implementation of our AAAI 2021 paper:

"Amodal Segmentation Based on Visible Region Segmentation and Shape Prior" (VRSP-Net).

It is based on the Detectron2. Detectron2 is Facebook AI Research's next generation software system. The details of this frame work can be found at https://github.com/facebookresearch/detectron2

Installation

cd Amodal-Segmentation-Based-on-Visible-Region-Segmentation-and-Shape-Prior

We test this implementation on Python 3.6+, Pytorch 1.4, torchvision 0.5, cuda10.1, fvcore 0.1. Download these from PyTorch. You can install other common packages by:

pip install -r requirements.txt

Setup Detectron2:

python -m pip install -e .

You can see INSTALL.md to setup Detectron2. This is for the previous version of Detectron2. If you want to use the latest version of Detectron2, you should go to Detectron2 and use their setup files.

Install COCO API:

pip install git+https://github.com/philferriere/cocoapi.git#subdirectory=PythonAPI

Metrics

In our paper, we provide the mAP(Occluded) to evaluate the performance on occluded object (occlusion rate > 0.15). If you want to compute this metrics, you should use the cocoeval.py in detectron2/data/amodal_datasets/pycocotools. Besides, this metrics can only be appiled on methods predicting both amodal and visible mask.

The evaluator outputs tasks: amodal_segm(coarse), amodal2_segm(refined), visible_segm(coarse), visible2_segm(refined), bbox. The amodal2_segm corresponds to the results in the paper.

About Detectron2

If you want to know more information about how to use Detectron2 framework, see GETTING_STARTED.md, or the Colab Notebook.

Learn more at documentation.

Download Resource

D2SA dataset

The D2S Amodal dataset could be found at mvtec-d2sa.

KINS dataset

Download the Images from KITTI dataset.

The Amodal Annotations could be found at KINS dataset

COCOA dataset

The COCOA dataset annotation: ftp://guest:GU.205dldo@ftp.softronics.ch/cocoa/COCOA_annotations_detectron.tar.xz

The images of COCOA dataset is the train2014 and val2014 of COCO dataset. The COCO API (pycocotools) is used for COCO format data.

Preparation

Set the image path and annotation path in the ./detectron2/data/datasets/builtin.py. Besides, the items whose key end with "visible" are used for visible mask evaluation.

Set the output path in the ./detectron2/config/defaults.py:

_C.OUTPUT_DIR = 'your_output_path'

or in the respective .yaml file

Train

cd Amodal-Segmentation-Based-on-Visible-Region-Segmentation-and-Shape-Prior

For example, on the D2SA dataset, we show how to train our method and other baselines.

1.Train our model (ResNet50 backbone) on D2SA dataset with pretrained shape prior auto-encoder and codebook:

python tools/train_net.py --config-file configs/D2SA-AmodalSegmentation/mask_rcnn_R_50_FPN_1x_parallel_CtRef_VAR_SPRef_SPRet_FM.yaml

2.We have provided the pretrained auto-encoder (d2sa_recon_net.pth) and codebook (d2sa_codebook.npy). If you want to get your own pretrained shape prior codebook and auto-encoder, you can set this in the config .yaml file:

MODEL.ROI_MASK_HEAD.RECON_NET.LOAD_CODEBOOK: False

If you do so. The model will be trained on amodal mask segmentation and amodal mask reconstruction simultaneously. After the training, the codebook will be generated.

The input of amodal mask reconstrction has two parts: ground truth amodal mask and predicted amodal mask, and you can use the parameter MODEL.ROI_MASK_HEAD.RECON_NET.MASK_THS to filter the predicted amodal mask based on the mask IOU. If you only want to use ground truth mask as shape prior, you can set:

MODEL.ROI_MASK_HEAD.RECON_NET.MASK_THS: 1.0

Hint:

Since the .yaml files with "SPRef" (Shape Prior Refinement) in the name need to use shape prior in the training, you should have the shape prior codebook and auto-encoder before training. So, the operations for generating shape prior are not applicable for the .yaml files with "SPRef".

We recommend to use our pretrained shape prior codebook and auto-encoder directly.

2.Train the Mask-RCNN (ResNet50 backbone) on D2SA dataset:

python tools/train_net.py --config-file configs/D2SA-AmodalSegmentation/mask_rcnn_R_50_FPN_1x_amodal.yaml

3.Train the ORCNN (ResNet50 backbone) on D2SA dataset:

python tools/train_net.py --config-file configs/D2SA-AmodalSegmentation/mask_orcnn_R_50_FPN_1x.yaml

Test

If you want to eval your saved checkpoints:

python tools/train_net.py --config-file configs/{your_yaml_file} 
--eval-only MODEL.WEIGHTS {your_OUTPUT_DIR}/model_final.pth'

Citation

If you find the code useful in your research, please consider citing the paper. Thanks！

@InProceedings{yuting2021amodal,
author = {Yuting Xiao, Yanyu Xu, Ziming Zhong, Weixin Luo, Jiawei Li and Shenghua Gao},
title = {Amodal Segmentation Based on Visible Region Segmentation and Shape Prior},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
configs		configs
demo		demo
detectron2.egg-info		detectron2.egg-info
detectron2		detectron2
dev		dev
docker		docker
docs		docs
projects		projects
runs		runs
tests		tests
tools		tools
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
cocoa_codebook.npy		cocoa_codebook.npy
cocoa_recon_net.pth		cocoa_recon_net.pth
d2sa_codebook.npy		d2sa_codebook.npy
d2sa_recon_net.pth		d2sa_recon_net.pth
hoglayer.py		hoglayer.py
kins_codebook.npy		kins_codebook.npy
kins_recon_net.pth		kins_recon_net.pth
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
system_amodal.py		system_amodal.py
teaser2.jpg		teaser2.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior (VRSP-Net)

Installation

Metrics

About Detectron2

Download Resource

D2SA dataset

KINS dataset

COCOA dataset

Preparation

Train

Test

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

YutingXiao/Amodal-Segmentation-Based-on-Visible-Region-Segmentation-and-Shape-Prior

Folders and files

Latest commit

History

Repository files navigation

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior (VRSP-Net)

Installation

Metrics

About Detectron2

Download Resource

D2SA dataset

KINS dataset

COCOA dataset

Preparation

Train

Test

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages