Skip to content

Latest commit

 

History

History
46 lines (23 loc) · 1.85 KB

README.md

File metadata and controls

46 lines (23 loc) · 1.85 KB

Explainable Ensemble Learning for Alzheimer's Disease Diagnosis

This project aims to develop an explainable multi modality machine learning tool for the diagnosis of the Alzheimer's Disease. Data for this project was collect from the ADNI initiative (http://adni.loni.usc.edu).

We use two types of data:

  1. Tabular data containing neuropsychological exams of a patient
  2. Tabular data containing demographic information of a patient
  3. 3D images of Magnetic Resonance Scans of the brain of a patient

We integrate each data type through separate preprocessing and machine learning pipelines, and join all of them in a final classifier. The name ensemble learning is due to the fact that more than one classifier is used to make intermediate and final predictions.

We provide explanations for predictions at patient level (local explanations) and at the population level (global explanation).

Steps to Run Experiments:

  1. Download ADNIMERGE.csv file from http://adni.loni.usc.edu.

  2. Preprocess ADNIMERGE.csv file.

    Run cognitive_tests_preprocessing.py. Outputfile will be COGNITIVE_DATA_PROCESSED.csv

  3. Preprocess metadata from potential MRIs and ADNIMERGE.csv (metadata_preprocessing.py)

    Run metadata_preprocessing.py and outputfile will indicate the right MRIs files to download and preprocess.

  1. Download MRIs from selected classes and subjects at http://adni.loni.usc.edu. MACRO_GROUPs used were AD,CN and MCI.

  2. Preprocess MRIs (mri_preprocess.py).

  3. Process MRIs (3D to 2D+Augmentation - mri_preparation).

  4. Train/Validate/Test CNNs generating prediction probabilities (mri_train.py).

  5. Train/Validate/Test ML models with cognitive tests (cognitive_train.py).

  6. Train/Validate/Test Ensemble models with cognitive tests (ensemble_train.py).

  7. Results Report