Learning Integral Objects with Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation
Learning Integral Objects with Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation, Junsong Fan, Zhaoxiang Zhang, Chunfeng Song, Tieniu Tan, CVPR2020 [paper].
The previous class activation map (CAM) based approach learns inter-class boundaries, which focus on the difference between different foreground classes. It may not be optimal for the weakly supervised segmentation problem because the target object and the background share the same classes in the same image. To alleviate this problem, we propose an ICD approach to learning per-category intra-class boundaries between the foreground objects and the background, which is more appropriate for weakly supervised segmentation problems.
The proposed approach is end-to-end and can be trained together with the CAM branch in a single round. The pseudo-masks derived by the proposed ICD is more complete than the CAM based results.
- Python 3.7, MXNet 1.3.1, Numpy, OpenCV, pydensecrf
- NVIDIA GPU
Prepare the data and pretrained parameters:
- Download the VOC 2012 dataset, which should contain
VOC2012/JPEGImages
andVOC2012/Annotations
. - Download the ImageNet pretrained parameters for the VGG16 backbone and put it in the folder
ICD/data/pretrained
- Download the pre-computed superpixels here, untar it, and put it into the folder
ICD/data/superpixels
. You can also generate them by yourself, to do this, please follow the following commands:
cd external/superpixel
make -j8 ## relies on OpenCV, edit Makefile if necessary.
./makeSuperpixel ../../data/VOC2012/train_aug.txt Your/VOC2012/JPEGImages ./data/VOC 75
cd ../..
python generate_superpixel.py
Train ICD and generate pseudo-masks:
python run_icd.py --gpus 0,1 --image-root Your/VOC2012/JPEGImages --annotation-root Your/VOC2012/Annotations --superpixel-root ./data/superpixels/voc_superpixels
The above script contains codes for training and generating the seeds. By default, the seeds will be generated at ./snapshot/icd/results/seeds/crf
. After obtaining the seeds, you can take them as pseudo labels and train any segmentation models on the training set, for example, the DeepLab-v2 used in the paper. Codes to train the Deeplab-v2 models will also be released soon.
Generate seeds with saliency maps:
To generate seeds with the saliency, you need pre-computed saliency maps firstly. We adopt the same saliency maps used by the OAA project, and the saliency maps can be downloaded here. To generate seeds with saliency maps, use the --use-sal
flag, e.g.,
python run_icd.py --image-root Your/VOC2012/JPEGImages --annotation-root Your/VOC2012/Annotations --use-sal --only-gen-seeds
Seeds will be placed in ./snapshot/icd/results/seeds/sal
by default.
@InProceedings{Fan_2020_CVPR,
author = {Fan, Junsong and Zhang, Zhaoxiang and Song, Chunfeng and Tan, Tieniu},
title = {Learning Integral Objects With Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2020}
}