PointRend-Paddle

PointRend

1 Introduction

Paddle version of Paper“PointRend: Image Segmentation as Rendering(CVPR2020)”.

This project uses Baidu's paddlepaddle framework to reproduce the CVPR2020 paper's model PointRend. Note: only the semantic segmentation experiment of Semantic FPN + PointRend on the cityscapes dataset is done here, excluding the instance segmentation experiment of Maskrcnn + Pointrend. The correctness of PointRend based on paste reproduction is verified.

The project relies on the paddleseg tool.

PointRend With Seg Architecture:

Paper: PointRend: Image Segmentation as Rendering

2 Metrics

Model	mIOU
SemanticFPN+PointRend(paper-Pytorch)	78.5
SemanticFPN+PointRend(ours-Paddle)	78.78

3 Dataset

The dataset is Cityscapes

The size of dataset: There are 19 categories, and 5000 images are of 1024*2048 pixels in width and height
- Training set: 2975 images
- Validation set: 500 images
- Test set: 1525 images

data should be located at data/

data/
├── cityscapes
│   ├── gtFine
│   │   ├── test
│   │   ├── train
│   │   └── val
│   ├── leftImg8bit
│   │   ├── test
│   │   │   ├── berlin
│   │   │   ├── ...
│   │   │   └── munich
│   │   ├── train
│   │   │   ├── aachen
│   │   │   ├── ...
│   │   │   └── zurich
│   │   └── val
│   │       ├── frankfurt
│   │       ├── lindau
│   │       └── munster
│   ├── train.txt
│   ├── val.txt
│   ├── test.txt

.txt format style like as follow:

leftImg8bit/test/mainz/mainz_000001_036412_leftImg8bit.png,gtFine/test/mainz/mainz_000001_036412_gtFine_labelTrainIds.png

which can achieved by use PaddleSeg's create_dataset_list.py(need to clone PaddleSeg from PaddleSeg's git repo firstly):

python PaddleSeg/tools/create_dataset_list.py ./data/cityscapes/ --type cityscapes --separator ","

4 Environment

Hardwares: XPU, GPU, CPU
Framework:
- PaddlePaddle >= 2.0.2

5 Quick Start

The project is developed based on Paddleseg. Except that train.py is modified, other val.py and predict.py are the same as Paddleseg. The model and user-defined loss function definitions are located in the paddleseg/models directory.

install(cmd line)

pip install -r requirements.txt

step1: clone

# clone this repo(Note: maybe need to checout branch after git clone)
git clone git@github.com:CuberrChen/PointRend-Paddle.git

Step2: Training

The training adopts the warmup learning rate strategy opened by default and the momentum optimizer. See line 181 in train.py. If closed, use the policy in .yml .

# V100*4
export CUDA_VISIBLE_DEVICES=0,1,2,3 
python -m paddle.distributed.launch train.py --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml --num_workers=16 --use_vdl --do_eval --save_interval 1000 --save_dir output

# single V100 (I haven't tried it yet, so you need to adjust the learning rate, iters and batchsize according to the specific configuration)

python train.py --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml--num_workers 4 --use_vdl --do_eval --save_interval 1000 --save_dir output --batch_size 4

Step3: Eval

The default path of the pre training model is'output/best_model/model.pdparams'

# eval  
CUDA_VISIBLE_DEVICES=0 
python val.py --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml --model_path output/best_model/model.pdparams

Use Pre-trained Models to Infer

The Pre-trained model is used to test the image, For specific use, please refer to Paddleseg doc

The use example is as follows:

# Use Pre-trained Models to Infer
python predict.py \
       --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml \
       --model_path output/best_model/model.pdparams \
       --image_path data/xxx/JPEGImages/0003.jpg \
       --save_dir output/result

6 Code Structure and Explanation

6.1 Code Structure

├── README.md
├── README_EN.md
├── images/ # save images for README
├── data/ #data path
├── paddleseg/ # paddleseg tool include models/loss definition
├── utils/ # tools
├── lr_scheduler/ # scheduler defined by self
├── output/ # output path
├── run.sh # AIStudio 4 card training shell  
├── ...
├── train.py 
├── eval.py 
└── predict.py

6.2 Parameter Explanation

For specific parameter settings (mainly the modification of config file), please refers to Paddleseg doc

The only thing to note here is that the parameters of warmup are temporarily viewed in train.py.

the parameter setting of the model(You can enter parameter values in the config file) should refer to paddleseg/models/pointrendseg.py. Users need to refer to the paper to know the meaning of this part.

6.3 Training Process

One GPU Training

# single V100 (I haven't tried it yet, so you need to adjust the learning rate, iters and batchsize according to the specific configuration)

python train.py --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml--num_workers 4 --use_vdl --do_eval --save_interval 1000 --save_dir output --batch_size 4

Multiple GPUs Training

# V100*4
export CUDA_VISIBLE_DEVICES=0,1,2,3 
python -m paddle.distributed.launch train.py --config configs/pointrendfpn/pointrend_resnet101_os8_cityscapes_512×1024_80k.yml --num_workers=16 --use_vdl --do_eval --save_interval 1000 --save_dir output

7 Model Information

Please refer to the following list to check other models’ information

Information Name	Description
Announcer	xbchen
Time	2021.08
Framework Version	Paddle 2.0.2
Application Scenario	Image Segmentation
Supported Hardwares	XPU GPU CPU
Download Links	PointRendFPN: code：33h7
Online Running	AIStudio shell
Online Running	AIStudio notebook

8 Customization

Special thanks to the platform and resources provided by Baidu paddle.

SemanticFPN+PointRend Model analysis：

80000 iter,batch_size=16 for 4 GPUs(4 imgs for per gpu),base_lr=0.01 warmup+poly,SemanticFPN+PointRend with ResNet101 's best mIOU=78.78 at Cityscaps VAL dataset. **Note: the reason for adopting this scheme is that the 4 cards 32g environment provided by aistudio allows 1024 × 512 enter the maximum batch_size can't reach 32(paper's setting). If the memory is enough / multiple cards are used, the parameters provided by the author are recommended. The trained model has a link at the bottom. The training code and train_0.log (78.78 miou completed training log can be find in output/) have been uploaded to the repo

Refrence:

Paper Official PyTorch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PointRend-Paddle

1 Introduction

2 Metrics

3 Dataset

4 Environment

5 Quick Start

install(cmd line)

step1: clone

Step2: Training

Step3: Eval

Use Pre-trained Models to Infer

6 Code Structure and Explanation

6.1 Code Structure

6.2 Parameter Explanation

6.3 Training Process

One GPU Training

Multiple GPUs Training

7 Model Information

8 Customization

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
configs		configs
images		images
lr_scheduler		lr_scheduler
output		output
paddleseg		paddleseg
utils		utils
LICENSE		LICENSE
README.md		README.md
README_cn.md		README_cn.md
export.py		export.py
predict.py		predict.py
requirements.txt		requirements.txt
run.sh		run.sh
run_eval.sh		run_eval.sh
run_single.sh		run_single.sh
setup.py		setup.py
train.py		train.py
val.py		val.py
复现心得和遇到的问题及对应解决方案.md		复现心得和遇到的问题及对应解决方案.md

License

CuberrChen/PointRend-Paddle

Folders and files

Latest commit

History

Repository files navigation

PointRend-Paddle

1 Introduction

2 Metrics

3 Dataset

4 Environment

5 Quick Start

install(cmd line)

step1: clone

Step2: Training

Step3: Eval

Use Pre-trained Models to Infer

6 Code Structure and Explanation

6.1 Code Structure

6.2 Parameter Explanation

6.3 Training Process

One GPU Training

Multiple GPUs Training

7 Model Information

8 Customization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages