This repository consists of an attempt to detect and diagnose Alzheimer's using 3D MRI T1 weighted scans from the ADNI database.It contains a data preprocessing pipeline to make the data suitable for feeding to a 3D Convnet or Voxnet followed by a Deep Neural Network definition and an exploration into all the utilities that could be required for such a task.
- Python3 (Os,Matplotlib,Panda,Numpy)
- Scikit-Image
- Nibabel
- Nipype
- FSL(FMRIB Software Library v5.0)
- Tensorflow
- Tflearn
The ideal form in which the brain MRI image data can be sent for training is when it is skull-stripped, resized to a common size and labeled for all the different labels in the classification task.
The first step is to load the data into Numpy arrays for futrher manipulation.The python library Nibabel has been used to access the MRI scan using its image object.The data attribute of this object is used to acquire the image in the form of a numpy array.
FSL is a library used for analysis and manipulation of MRI brain imaging data.Nipype provides an interface to use the FSL library via python code.Thus this is used to skull strip the images in the given code.The frac parameter is used to pass values for the fractional intensity threshold.A smaller value will give a much better estimate of the brain at the cost of lesser stripping.A slice of a sample image has been displayed after skull stripping it with different frac values has been given below
An approximation of the amount of grey matter,white matter and CSF(Cerebrospinal Fluid) can be found using Multi Otsu Histogram Thresholding where the maximum variance is given by the formula:
There's no predefined function for this in python packages therefore the code for it has been written firsthand in python. A sample of the thresholds derived in a skull stripped images are given in the histogram below.
This is a form of global thresholding however better approximations can be made using adaptive and dynamic forms of thresholding where parts of the image are segmented at a time.
All the images are resized to the same dimensions using the predefined skimage transform.resize function. The values of all the pixels are then normalised so that faster training would occur after which the images are grouped with their labels and saved as numpy objects.
A 3D convolutional neural network has been defined using Tflearn which basically serves to provide wrapper functions for the tensorflow framework thus making it easier to create the network.The network uses mini batch gradient descent with batch normalisation for each activation layer.It uses dropout and L2 regularisation to tackle high variance and is optimised by the adam optimiser.
Training a 3D CNN for an end to end task like this is practically possible yet extremely difficult. Through such a procedure if the CNN is very Deep it's likely to overfit and if it's too shallow it's likely to underfit. This is largely due to the scarcity of the data as well as complexity of the problem. However the most successful approach has been to train the network on an 3D autoencoder as mentioned in thes paper: Predicting Alzheimer’s disease: a neuroimaging study with 3D convolutional neural networks.All this has been done on an 8GB RAM CPU laptop along with google colaboratory for network training. Anyhow this project cannot move further due to lack of computational resources and is thus at a standstill for a time in the future.