VcT: Visual change Transformer for Remote Sensing Image Change Detection

Here, we provide the pytorch implementation of the paper. For more information, please see our published paper at IEEE TGRS or arXiv. [IEEE TGRS-2023] VcT: Visual change Transformer for Remote Sensing Image Change Detection, Bo Jiang, Zitian Wang, Xixi Wang, Ziyan Zhang, Lan Chen, Xiao Wang, Bin Luo [arXiv] [IEEE]

Abstract

Given two remote sensing images, the goal of visual change detection task is to detect significantly changed areas between them. Existing visual change detectors usually adopt convolutional neural networks (CNNs) or transformers for feature representation learning and focus on learning effective representation for the changed regions between images. Although good performance can be obtained by enhancing the features of the change regions, however, these works are still limited mainly due to the ignorance of mining the unchanged background context information. It is known that one main challenge for change detection is how to obtain the consistent representations for two images involving different variations, such as spatial variation and sunlight intensity. In this work, we demonstrate that carefully mining the common background information provides an important cue to learn the consistent representations for the two images which thus obviously facilitates the visual change detection problem. Based on this observation, we propose a novel visual change transformer (VcT) model for visual change detection problem. To be specific, a shared backbone network is first used to extract the feature maps for the given image pair. Then, each pixel of feature map is regarded as a graph node and the graph neural network (GNN) is proposed to model the structured information for coarse change map prediction. Top-K reliable tokens can be mined from the map and refined by using the clustering algorithm. Then, these reliable tokens are enhanced by first utilizing self/cross-attention (CA) schemes and then interacting with original features via an anchor-primary attention (APA) learning module. Finally, the prediction head is proposed to get a more accurate change map. Extensive experiments on multiple benchmark datasets validated the effectiveness of our proposed VcT model.

Requirements

Python 3.7
pytorch 1.11.0
einops  0.6.0
torch-scatter 2.0.9
scipy 1.7.3
matplotlib  3.5.3

Train

You can find the training script run_cd.sh. You can run the script file by sh run_cd.sh in the command environment.

The dataset path is modified in data_config.py.

The detailed script file run_cd.sh is as follows:

gpus=0
checkpoint_root=checkpoints 
data_name=LEVIR  # dataset name 

img_size=256
batch_size=8
lr=0.01
max_epochs=200  #training epochs
net_G=Reliable_transformer # model name
lr_policy=linear

split=train  # training txt
split_val=val  #validation txt
project_name=CD_${net_G}_${data_name}_b${batch_size}_lr${lr}_${split}_${split_val}_${max_epochs}_${lr_policy}

python main_cd.py --img_size ${img_size} --checkpoint_root ${checkpoint_root} --lr_policy ${lr_policy} --split ${split} --split_val ${split_val} --net_G ${net_G} --gpu_ids ${gpus} --max_epochs ${max_epochs} --project_name ${project_name} --batch_size ${batch_size} --data_name ${data_name}  --lr ${lr}

Evaluate

Checkpoints of our model can be downloaded from: [DropBox] or [Baiduyun (passward: AHUE)]

You can find the evaluation script eval.sh. You can run the script file by sh eval.sh in the command environment.

The detailed script file eval.sh is as follows:

gpus=0
data_name=LEVIR # dataset name
net_G=Reliable_transformer # model name 
split=test # test.txt
project_name=VcT_LEVIR # the name of the subfolder in the checkpoints folder 
checkpoint_name=best_ckpt.pt # the name of evaluated model file 

python eval_cd.py --split ${split} --net_G ${net_G} --checkpoint_name ${checkpoint_name} --gpu_ids ${gpus} --project_name ${project_name} --data_name ${data_name}

Dataset Preparation

Data structure

"""
Change detection data set with pixel-level binary labels；
├─A
├─B
├─label
└─list
"""

A: images of t1 phase;

B:images of t2 phase;

label: label maps;

list: contains train.txt, val.txt and test.txt, each file records the image names (XXX.png) in the change detection dataset.

Our Processed Dataset Download

Download method #1

LEVIR-CD (2.3GB): [DropBox]
WHU-CD (1.82GB): [DropBox]
DSIFN-CD (3.38GB): [DropBox]

Download method #2

BaiduYun: [Baiduyun (passward: AHUE)]

License

Code is released for non-commercial and research purposes only. For commercial purposes, please contact the authors.

Citation

If you use this code for your research, please cite our paper:

@article{jiang2023vct,
  title={VcT: Visual change Transformer for Remote Sensing Image Change Detection},
  author={Jiang, Bo and Wang, Zitian and Wang, Xixi and Zhang, Ziyan and Chen, Lan and Wang, Xiao and Luo, Bin},
  journal={arXiv preprint arXiv:2310.11417},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
VcT_code		VcT_code
figures		figures
README.md		README.md
VcT_Highly Cited Papers.png		VcT_Highly Cited Papers.png
changeDet_Paper_List.md		changeDet_Paper_List.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VcT: Visual change Transformer for Remote Sensing Image Change Detection

Abstract

Requirements

Train

Evaluate

Dataset Preparation

Data structure

Our Processed Dataset Download

License

Citation

About

Uh oh!

Releases

Packages

Languages

Event-AHU/VcT_Remote_Sensing_Change_Detection

Folders and files

Latest commit

History

Repository files navigation

VcT: Visual change Transformer for Remote Sensing Image Change Detection

Abstract

Requirements

Train

Evaluate

Dataset Preparation

Data structure

Our Processed Dataset Download

License

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages