Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Preprocess		Preprocess
dataloader		dataloader
github		github
model		model
util		util
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
engine_pretrain.py		engine_pretrain.py
engine_pretrain_er.py		engine_pretrain_er.py
main_buffer_kmean.py		main_buffer_kmean.py
main_pretrain_medcoss.py		main_pretrain_medcoss.py
main_pretrain_single_modal.py		main_pretrain_single_modal.py
run_ssl.sh		run_ssl.sh

Repository files navigation

MedCoSS

This is the official Pytorch implementation of our CVPR 2024 paper "Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning".

Requirements

CUDA 11.5
Python 3.8
Pytorch 1.11.0
CuDNN 8.3.2.44

Data Preparation

Download the MIMIC-CXR dataset.
Download the DeepLesion dataset.
Download the ADNI dataset.
Download seven TCGA datasets (TCGA-THYM, TCGA-THCA, TCGA-BRCA, TCGA-UCEC, TCGA-UVM, TCGA-OV, and TCGA-MESO).

Pre-processing

Report: Following MGCA's procedure to pre-process the MIMIC-CXR dataset.
X-ray: Using Preprocess/MIMIC_CXR_JPG_Preprocess.py to pre-process the MIMC-CXR dataset.
CT:
- Using Preprocess/DL_save_nifti.py (from downloaded files) to transfer the PNG image to the nii.gz form.
- Using Preprocess/re_spacing_ITK.py to resample CT volumes.
- Using Preprocess/splitting_to_patches.py to extract about 125k sub-volumes, and the pre-processed dataset will be saved in DL_patches_v2/.
- Using Preprocess/DeepLesion_Resize.py to resize images.
MRI:
- Using Preprocess/ADNI_Resize.py to resize images.
- Using Preprocess/ADNI_split_slice.py to extract about 59k sub-volumes.
Pathological imaging: Using Preprocess/TCGA_Preprocess.py to pre-process seven TCGA datasets.

Pre-training

Download uni-perceiver-base-L12-H768-224size-torch-pretrained.pth.
Run sh run_ssl.sh for training (4 GPUs with 24G). (Before running it, you need to modify some addresses.)

To do

Citation

If this code is helpful for your study, please cite:

@article{ye2024medcoss,
  title={Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning},
  author={Ye, Yiwen and Xie, Yutong and Zhang, Jianpeng and Chen, Ziyang and Wu, Qi and Xia, Yong},
  booktitle={Proceedings of the IEEE/CVF conference on computer vision and pattern recognition},
  pages={},
  year={2024},
}

Acknowledgements

The whole framework is based on MAE, Uni-Perceiver, and MGCA.

Contact

Yiwen Ye (ywye@mail.nwpu.edu.cn)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedCoSS

Requirements

Data Preparation

Pre-processing

Pre-training

To do

Citation

Acknowledgements

Contact

About

Releases

Packages

Languages

License

yeerwen/MedCoSS

Folders and files

Latest commit

History

Repository files navigation

MedCoSS

Requirements

Data Preparation

Pre-processing

Pre-training

To do

Citation

Acknowledgements

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages