Introduction

This project forms the artifacts for a BSc Dissertation at the University of St Andrews. The report is provided in the repository under DeepLearningForCancerDetectionReport.pdf

Dataset & Running Instructions

The dataset has not been included as it is 6.8 GB. It can be downloaded using the IBM Aspera Connect plugin from the following link: https://wiki.cancerimagingarchive.net/pages/viewpage.action?pageId=101941770

The downloaded dataset should be named “BM_cytomorphology_data” and placed in the submission folder for the following steps.

The submission contains a Dockerfile to create a container containing all the required dependencies. The following commands can be used to build and run the container. The following code should then be all run on the docker container’s command line:

docker build -t model .


docker run -v <replace-with-path-to-the-following-directory>/Deep-Learning-for-Cancer-Detection:/Deep-Learning-for-Cancer-Detection -w /Deep-Learning-for-Cancer-Detection --gpus 1 --shm-size=1g -it -p 8888:8888 --rm model

In the preprocess directory, run the following command to remove the identified corrupted images from the dataset:

python DeleteCorrupted.py

The dataset can then be split into the train, validation, and test subsets by creating 2 directories for the validation and test subsets and running the following command. For the purpose of training and testing the model for execution, the 2 directories should be named “validation” and “test”:

python Split.py

This script will then ask for the train (the BM_cytomorphology_data directory), validation, and test directories to be input.

To create reproducible results, data augmentation was not performed on the fly. To augment images, copy or rename the “BM_cytomorphology_data” directory to “BM_cytomorphology_data_augmented” should be created. The following command can then be run (not this will take a long time):

python AugmentImages.py

To generate explanations to perform the LIME experiment, the following command can be run after creating an explanations/validation directory:

./CreatePerturbations.sh

Before training the optimised model, the following directories should be created. This is used to save the model itself and its training history as a pickle file:

pickle/augmented

The following script can then be run in the docker container’s command line to train the model, note that the expected training directory is “BM_cytomorphology_data_augmented”. Therefore, if the data has not been augmented, it should be renamed to “BM_cytomorphology_data_augmented” anyways.

python OptimisedModel.py

The LIMEResults notebook can be run through JupyterLab in the docker container to produce the LIME experiment results.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
BM_random_search		BM_random_search
LIME_Experiment_Extras		LIME_Experiment_Extras
classification_reports		classification_reports
example_notebooks		example_notebooks
models		models
pickle		pickle
preprocess		preprocess
results		results
tuning_logs		tuning_logs
.gitignore		.gitignore
AugmentImages.py		AugmentImages.py
BasicModel.ipynb		BasicModel.ipynb
ClassificationReportToCSV.py		ClassificationReportToCSV.py
CreatePerturbations.py		CreatePerturbations.py
CreatePerturbations.sh		CreatePerturbations.sh
DeepLearningForCancerDetectionReport.pdf		DeepLearningForCancerDetectionReport.pdf
Dockerfile		Dockerfile
HyperparameterTuning.py		HyperparameterTuning.py
MinimumViableAnalysisModel.ipynb		MinimumViableAnalysisModel.ipynb
OptimisedModel.py		OptimisedModel.py
README.md		README.md
Verify.py		Verify.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Dataset & Running Instructions

About

Uh oh!

Releases

Packages

Languages

ejml1/Deep-Learning-for-Cancer-Detection

Folders and files

Latest commit

History

Repository files navigation

Introduction

Dataset & Running Instructions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages