Foveation for Segmentation of Ultra-High Resolution Images in PyTorch

This is a PyTorch implementation of Foveation for Segmentation of Ultra-High Resolution Images which extends our prior publication at MICCAI 2020.

Updates

MICCAI 2020 talk on youtube available HERE
Pre-processed Gleason2019 datasets is shared HERE
We use configuration files to store most options which were in argument parser. The definitions of options are detailed in config/defaults.py (TO-BE-UPDATED).

Introduction

We demonstrates that the trade-off between FoV and resolution affects the segmentation performance on ultra high resolution images. Its influence also varies spatially according to local patterns in different areas. We demonstrate, on three publicly available high-resolution image datasets that our proposed Foveation Module combines the strength of the cases trained with patches of different fixed FoV/resolution trade-off and improves segmentation performance, below is a validation segmentation example on CityScapes compared ours against two fixed trade-off configurations:

Foveation Module

Foveation Module is a learnable dataloader which, for a given ultra high resolution image, adaptively chooses the appropriate configuration (FoV/resolution trade-off) of the input patch to feed to the downstream segmentation model at each spatial location of the image, illustrated as below:

We demonstrate that our Foveation Module can effectively learn spatial distribution ofthe FoV/Resolution trade-off, illustrated as below:

Conceptually, the Foveation Module has an effect of adjusting patch-wise resolution at each location at optimal states for downstream task (segmentation in this case), illustrated as below:

Citation

If you use this code for your research, please cite our paper:

@misc{jin2020foveation,
    title={Foveation for Segmentation of Ultra-High Resolution Images},
    author={Chen Jin and Ryutaro Tanno and Moucheng Xu and Thomy Mertzanidou and Daniel C. Alexander},
    year={2020},
    eprint={2007.15124},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

@inproceedings{jin2020foveation,
  title={Foveation for Segmentation of Mega-Pixel Histology Images},
  author={Jin, Chen and Tanno, Ryutaro and Xu, Moucheng and Mertzanidou, Thomy and Alexander, Daniel C},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={561--571},
  year={2020},
  organization={Springer}
}

Install dependencies

Install dependencies with one of the following options:

Method 1: Pip installation:

python -m pip install -r requirements.txt

Method 2: Conda installation:

conda env create -f environment.yml
conda activate Fov_Seg

This code was tested with python 3.7, pytorch 1.2 and CUDA 11.0

Code Summary

We provide below three ways for users to easily run/deploy Foveation Segmentation: Direct-Executable, Deploy-Your-Model and Deploy-Foveation-Module-Locally.

Direct-Executable: full code direct executable to reproduce paper results, also suitable to run large scale experiments on customerized data in multiple GPU Environment, Syncronized Batch Normalization are deployed.
Deploy-Your-Model: Our Foveation Module can attach to any segmentation models. Our framework includes a set of State-of-the-Art segmentation models (PSPNet/UPerNet/Unet/HRNet) for user to choose. It's also straightforward to deploy personlized segmentation model in this framework, instruction are given.
Deploy-Foveation-Module-Locally: a Simple-Demo segmentation example to illustrated how to implement Foveation Module to your own code.

Direct-Executable

Data preparation

Download example ultra-high resolution datasets in following table as used in the paper, or prepare of your choice ultra-high resolution images (e.g. medical/aerial/camera-array/material images) for multiclass segmentation purpose.

Dataset (with Link) Content Resolution (pixels) Number of Classes

Cityscapes urban scenes 2048x1024 19

DeepGlobe aerial scenes 2448x2448 6

Gleason2019 histopathological 5000x5000 4
Your directory tree should be look like this:

$SEG_ROOT/data
├── cityscapes
│   ├── annotations
│   │   ├── testing
│   │   ├── training
│   │   └── validation
│   └── images
│       ├── testing
│       ├── training
│       └── validation

Prepare data list file according to .odgt files in ./data/Cityscape/, a script ./data/write_MIT_format.m has been given to prepare .odgt file, PATH and suffix need to be customerized.

Training

Train a model by selecting the GPUs ($GPUS) and configuration file ($CFG) to use. During training, checkpoints by default are saved in folder ckpt.

python3 train_fove.py --gpus $GPUS --cfg $CFG

To choose which gpus to use, you can either do --gpus 0-7, or --gpus 0,2,4,6.

You can start with our provided configurations (cityscape example):

python3 train_fove.py --gpus GPUS --cfg config/foveation-cityscape-hrnetv2.yaml

Alternatively, you can quick start with provided bash script

bash quick_start_bash/cityscape_gumbel_softmax_1_10th.sh

You can also override options in commandline, for example python3 train_fove.py TRAIN.num_epoch 10 .

Evaluation

Evaluate a trained model on the validation set. run eval_multipro.py with VAL.* arguments configured.

You can start with our provided configurations:

python3 eval_multipro.py --gpus GPUS --cfg config/foveation-cityscape-hrnetv2.yaml

Alternatively, you can quick start with provided bash script

bash quick_start_bash/deepglob_argmax_st_1_10th_eval.sh

Deploy-Your-Model

The segmentation models in our framework are split into encoder and decoder, where encoders are usually modified directly from classification networks, and decoders consist of final convolutions and upsampling. Following architectures are supported:

Encoder	Decoder
MobileNetV2dilated	C1 (one convolution module)
ResNet18/ResNet18dilated	C1_deepsup (C1 + deep supervision trick)
ResNet50/ResNet50dilated	Pyramid Pooling Module
ResNet101/ResNet101dilated	PPM_deepsup (PPM + deep supervision trick)
Unet	UPerNet (Pyramid Pooling + FPN head)
HRNetV2 (W48)	-

To deploy your personlized model, first prepare your model follow models/attention_u_net.py and put it under models/;
Then simply add your model in def build_encoder and def build_decoder functions in models/models.py.

Deploy-Foveation-Module-Locally

A Simple-Demo segmentation example is given as follows:

dataset for simple-demo are prepared at data/CITYSCAPE_single_example
quick test simple demo with:

python3 train_sim_demo.py

implement foveation module to your own code following train_sim_demo.py, includes:

import our libs:

from config import cfg
from dataset import TrainDataset, b_imresize
from models import ModelBuilder, SegmentationModule, FovSegmentationModule
from train_fove import checkpoint_last, train, create_optimizers
from eval import eval_during_train

add foveation module in network builder, train and evaluation (i.e. lines following if cfg.MODEL.foveation: in train_sim_demo.py)

Performance

Segmentation performance measured in IoU/mIoU (%) on Cityscapes. For class names: All referred to all classes average, and the rest single class codes are as follows: DeepGlobe: Ro.-road, Sw.-sidewalk, Bu.-building, W.-wall, F.-fence, P.-pole, T.L.-traffic light, R.S.-raffic sign, V.-vegetation, Te.-terrain, Sk.-sky, P.-person, RI.-rider, C.-car, Tru.-truck, Tra.-train, M.-motorcycle, Bi.-bicycle.

Class	All	Ro.	Sw.	Bu.	W.	F.	P.	T.L.	R.S.	V.	Link
Baseline-Patch 1	67.7	97.6	80.8	89.8	49.7	51.4	53.9	39.0	63.3	89.6	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 2	68.0	97.3	81.0	89.5	41.9	52.2	51.5	32.9	66.0	89.2	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 3	69.8	97.5	81.2	90.3	47.0	55.8	54.2	46.8	68.6	90.0	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 4	70.6	97.0	82.1	89.8	47.3	53.4	57.6	56.9	72.1	90.3	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 5	70.2	95.0	84.4	90.3	45.6	56.2	63.4	61.4	76.5	91.4	OneDrive/BaiduYun(Access Code:123)
Baseline-Random	50.3	94.1	68.1	80.7	36.5	40.2	18.8	13.3	31.8	79.6	OneDrive/BaiduYun(Access Code:123)
Baseline-Average	56.6	96.3	73.6	85.7	31.6	40.4	39.0	19.8	49.8	86.5	OneDrive/BaiduYun(Access Code:123)
Ensemble	72.5	97.7	83.8	91.1	51.7	58.2	58.9	50.9	72.6	90.9	OneDrive/BaiduYun(Access Code:123)
Ours-Mean	76.1	98.2	84.9	92.3	55.2	61.4	64.5	71.4	80.1	92.5	OneDrive/BaiduYun(Access Code:123)
Ours-Mode	71.4	97.5	80.8	90.8	45.4	55.3	55.6	56.5	67.7	90.7	OneDrive/BaiduYun(Access Code:123)
Ours-GSM	70.5	97.7	82.2	89.1	44.8	55.4	64.5	66.1	76.2	91.8	OneDrive/BaiduYun(Access Code:123)

Class	Te.	Sk.	P.	Ri.	C.	Tru.	Bus	Tra.	M.	Bi.	Link
Baseline-Patch 1	59.4	90.4	73.9	51.6	91.9	54.9	74.0	61.3	45.3	69.1	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 2	55.9	90.1	75.3	55.4	92.0	54.1	80.2	74.8	43.7	69.8	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 3	57.4	88.8	77.1	56.9	92.8	57.4	76.8	63.6	51.5	72.5	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 4	56.7	85.6	79.0	60.0	92.8	54.0	78.2	58.1	56.8	74.2	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 5	59.9	68.9	81.2	58.0	93.2	53.0	69.6	50.6	59.3	76.8	OneDrive/BaiduYun(Access Code:123)
Baseline-Random	50.7	58.4	53.8	34.4	79.1	49.3	46.3	37.8	31.2	51.6	OneDrive/BaiduYun(Access Code:123)
Baseline-Average	47.5	89.3	65.7	43.6	89.0	47.6	45.7	38.6	23.2	61.6	OneDrive/BaiduYun(Access Code:123)
Ensemble	60.9	90.9	79.4	60.1	93.2	57.1	79.6	69.0	56.4	74.8	OneDrive/BaiduYun(Access Code:123)
Ours-Mean	64.2	94.1	82.8	64.0	94.5	63.6	81.0	63.3	59.2	78.2	OneDrive/BaiduYun(Access Code:123)
Ours-Mode	60.8	92.1	75.5	55.8	93.1	66.0	76.4	69.9	54.5	71.6	OneDrive/BaiduYun(Access Code:123)
Ours-GSM	62.1	75.1	80.8	60.8	92.8	53.4	74.8	42.6	52.9	76.6	OneDrive/BaiduYun(Access Code:123)

Segmentation performance measured in IoU/mIoU (%) on DeepGlobe. For class names: All referred to all classes average, and the rest single class codes are as follows: U.-Urban, A.-Agriculture, R.-Rangeland, F.-Forest, W.-Water, B.-Barren.

Class	All	U.	A.	R.	F.	W.	B.	Link
Baseline-Patch 1	73.8	79.4	88.2	45.1	77.2	87.4	65.4	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 2	73.4	79.0	88.4	43.6	77.7	85.6	66.2	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 3	56.4	57.0	81.4	27.8	62.3	69.1	40.7	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 4	72.3	77.2	87.8	41.8	78.5	84.2	64.2	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 5	71.3	76.9	87.0	39.6	79.1	83.0	62.2	OneDrive/BaiduYun(Access Code:123)
Baseline-Random	46.4	56.3	77.0	20.1	41.0	45.8	38.5	OneDrive/BaiduYun(Access Code:123)
Baseline-Average	73.5	78.4	88.1	45.0	80.8	82.3	66.7	OneDrive/BaiduYun(Access Code:123)
Ensemble	74.3	79.4	88.7	44.5	80.0	86.3	66.8	OneDrive/BaiduYun(Access Code:123)
Ours-Mean	74.0	79.5	88.7	45.2	77.6	86.4	66.6	OneDrive/BaiduYun(Access Code:123)
Ours-Mode	73.2	77.5	88.2	43.4	79.9	84.4	65.5	OneDrive/BaiduYun(Access Code:123)
Ours-GSM	73.8	78.4	88.5	44.7	80.1	84.4	66.8	OneDrive/BaiduYun(Access Code:123)
GLnet	71.6	-	-	-	-	-	-

Segmentation performance measured in IoU/mIoU (%) on Gleason2019.

Class	All	Benign	Grade 3	Grade 4	Link
Baseline-Patch 1	69.3	81.0	61.8	65.0	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 2	70.1	80.5	64.4	65.3	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 3	70.7	81.0	64.9	66.1	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 4	67.3	76.4	61.4	64.0	OneDrive/BaiduYun(Access Code:123)
Baseline-Patch 5	61.4	67.5	55.6	61.0	OneDrive/BaiduYun(Access Code:123)
Baseline-Random	64.9	72.2	58.6	63.8	OneDrive/BaiduYun(Access Code:123)
Baseline-Average	65.7	78.8	52.2	66.0	OneDrive/BaiduYun(Access Code:123)
Ensemble	71.0	80.8	65.4	66.9	-
Ours-Mean	71.1	82.4	63.0	67.8	OneDrive/BaiduYun(Access Code:123)
Ours-Mode	69.5	81.6	61.3	65.6	OneDrive/BaiduYun(Access Code:123)
Ours-GSM	67.4	79.4	58.3	64.4	OneDrive/BaiduYun(Access Code:123)
Clinical experts	66.9	83.9	56.4	60.3	-

Acknowledgements

Thank CSAILVision. Our code structure is derived from their repository semantic-segmentation-pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
data		data
lib		lib
models		models
quick_start_bash		quick_start_bash
teaser		teaser
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
criterion.py		criterion.py
dataset.py		dataset.py
environment.yml		environment.yml
eval.py		eval.py
eval_multipro.py		eval_multipro.py
requirements.txt		requirements.txt
requirements_jupyter.txt		requirements_jupyter.txt
train_fove.py		train_fove.py
train_sim_demo.py		train_sim_demo.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Foveation for Segmentation of Ultra-High Resolution Images in PyTorch

Table of Contents

Updates

Introduction

Foveation Module

Citation

Install dependencies

Code Summary

Direct-Executable

Data preparation

Training

Evaluation

Deploy-Your-Model

Deploy-Foveation-Module-Locally

Performance

Acknowledgements

About

Releases

Packages

Languages

Dataset (with Link)	Content	Resolution (pixels)	Number of Classes
Cityscapes	urban scenes	2048x1024	19
DeepGlobe	aerial scenes	2448x2448	6
Gleason2019	histopathological	5000x5000	4

License

lxasqjc/Foveation-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Foveation for Segmentation of Ultra-High Resolution Images in PyTorch

Table of Contents

Updates

Introduction

Foveation Module

Citation

Install dependencies

Code Summary

Direct-Executable

Data preparation

Training

Evaluation

Deploy-Your-Model

Deploy-Foveation-Module-Locally

Performance

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages