| Literature DB >> 28119818 |
Lauge Sørensen1, Christian Igel2, Akshay Pai1, Ioana Balas2, Cecilie Anker3, Martin Lillholm1, Mads Nielsen1.
Abstract
This paper presents a brain T1-weighted structural magnetic resonance imaging (MRI) biomarker that combines several individual MRI biomarkers (cortical thickness measurements, volumetric measurements, hippocampal shape, and hippocampal texture). The method was developed, trained, and evaluated using two publicly available reference datasets: a standardized dataset from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the imaging arm of the Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL). In addition, the method was evaluated by participation in the Computer-Aided Diagnosis of Dementia (CADDementia) challenge. Cross-validation using ADNI and AIBL data resulted in a multi-class classification accuracy of 62.7% for the discrimination of healthy normal controls (NC), subjects with mild cognitive impairment (MCI), and patients with Alzheimer's disease (AD). This performance generalized to the CADDementia challenge where the method, trained using the ADNI and AIBL data, achieved a classification accuracy 63.0%. The obtained classification accuracy resulted in a first place in the challenge, and the method was significantly better (McNemar's test) than the bottom 24 methods out of the total of 29 methods contributed by 15 different teams in the challenge. The method was further investigated with learning curve and feature selection experiments using ADNI and AIBL data. The learning curve experiments suggested that neither more training data nor a more complex classifier would have improved the obtained results. The feature selection experiment showed that both common and uncommon individual MRI biomarkers contributed to the performance; hippocampal volume, ventricular volume, hippocampal texture, and parietal lobe thickness were the most important. This study highlights the need for both subtle, localized measurements and global measurements in order to discriminate NC, MCI, and AD simultaneously based on a single structural MRI scan. It is likely that additional non-structural MRI features are needed to further improve the obtained performance, especially to improve the discrimination between NC and MCI.Entities:
Keywords: Alzheimer's disease; Biomarker; Classification; Machine learning; Mild cognitive impairment; Structural MRI
Mesh:
Substances:
Year: 2016 PMID: 28119818 PMCID: PMC5237821 DOI: 10.1016/j.nicl.2016.11.025
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Characteristics of the datasets.
| n | Age, years | Male | MMSE | Field strength | |
|---|---|---|---|---|---|
| (mean ± std) | (%) | (mean ± std) | (1.5 T/3 T) | ||
| ADNI standardized dataset | |||||
| All | 504 | 75.3 ± 6.5 | 58.1 | 27.0 ± 2.6 | 504/0 |
| NC | 169 | 76.0 ± 5.1 | 50.9 | 29.2 ± 1.0 | 169/0 |
| MCI | 234 | 74.9 ± 7.0 | 66.7 | 27.1 ± 1.7 | 234/0 |
| AD | 101 | 75.3 ± 7.4 | 50.5 | 23.2 ± 1.9 | 101/0 |
| ADNI HHP subset | |||||
| All | 40 | 74.0 ± 7.6 | 47.5 | 26.3 ± 2.9 | 40/0 |
| NC | 13 | 75.9 ± 6.8 | 46.2 | 28.8 ± 1.1 | 13/0 |
| MCI | 11 | 70.4 ± 7.4 | 54.5 | 27.5 ± 1.2 | 11/0 |
| AD | 16 | 74.9 ± 8.0 | 43.8 | 23.4 ± 2.0 | 16/0 |
| AIBL imaging arm | |||||
| All | 145 | 75.4 ± 7.4 | 46.2 | 27.1 ± 4.0 | 1/144 |
| NC | 88 | 75.2 ± 7.2 | 47.7 | 28.9 ± 1.3 | 1/87 |
| MCI | 29 | 77.5 ± 7.1 | 51.7 | 27.0 ± 2.0 | 0/29 |
| AD | 28 | 73.6 ± 8.1 | 35.7 | 21.2 ± 5.6 | 0/28 |
| CADDementia validation | |||||
| All | 30 | 65.2 ± 7.0 | 43.3 | 0/30 | |
| NC | 12 | 62.3 ± 6.3 | 25.0 | 0/12 | |
| MCI | 9 | 68.0 ± 8.5 | 44.4 | 0/9 | |
| AD | 9 | 66.1 ± 5.2 | 66.7 | 0/9 | |
| CADDementia test | |||||
| All | 354 | 65.1 ± 7.8 | 60.2 | 0/354 | |
MMSE was not available for the CADDementia data.
Overview of individual MRI biomarkers.
| MRI biomarker | ROI segmentation | Training dataset |
|---|---|---|
| Cortical thickness | ||
| Frontal lobe | FreeSurfer | No training |
| Occipital lobe | FreeSurfer | No training |
| Parietal lobe | FreeSurfer | No training |
| Temporal lobe | FreeSurfer | No training |
| Cingulate cortex | FreeSurfer | No training |
| Volumetry | ||
| Amygdala | FreeSurfer | No training |
| Caudate nucleus | FreeSurfer | No training |
| Hippocampus | FreeSurfer | No training |
| Pallidum | FreeSurfer | No training |
| Putamen | FreeSurfer | No training |
| Ventricular | FreeSurfer | No training |
| Whole brain | FreeSurfer | No training |
| Special purpose hippocampus | ||
| NL patch | NL patch | HHP |
| Left hipppocampus shape | FreeSurfer | ADNI + AIBL |
| Right hipppocampus shape | FreeSurfer | ADNI + AIBL |
| Hippocampal texture | FreeSurfer | ADNI + AIBL |
Fig. 1Sketch illustrating the algorithm. (A) A range of individual structural MRI biomarkers capturing different aspects of the scan are extracted. (B) The individual MRI biomarkers are z-score normalized dependent on the age of the subject with respect to each diagnostic group. (C) The z-score normalized MRI biomarkers are entered to a 3-class LDA classifier to produce the combination biomarker score. A hard classification was obtained using fhardLDA(zcombination), and a soft classification was obtained using fsoftLDA(ω|zcombination).
Performance measures.
| CA | True positive fraction | AUC | ||||||
|---|---|---|---|---|---|---|---|---|
| NC | MCI | AD | All | NC | MCI | AD | ||
| ADNI + AIBL | 62.7 | 79.0 | 57.8 | 40.3 | 78.1 | 86.1 | 68.3 | 81.8 |
| CADDementia validation | 73.3 | 91.7 | 44.4 | 77.8 | 83.2 | 86.6 | 68.3 | 95.8 |
| CADDementia test | 63.0 | 96.9 | 28.7 | 61.2 | 78.8 | 86.3 | 63.1 | 87.5 |
10-Fold cross-validation stratified by cohort and diagnostic group.
Results from Bron et al. (2015).
Confusion matrices. Rows are predicted class and columns are true class.
| ADNI+AIBL | CADDementia validation | CADDementia test | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| NC | MCI | AD | NC | MCI | AD | NC | MCI | AD | |||
| NC | 203 | 65 | 9 | NC | 11 | 3 | 0 | NC | 125 | 64 | 15 |
| MCI | 48 | 152 | 68 | MCI | 1 | 5 | 2 | MCI | 3 | 35 | 25 |
| AD | 6 | 46 | 52 | AD | 0 | 1 | 7 | AD | 1 | 23 | 63 |
10-fold cross-validation stratified by cohort and diagnostic group.
results from Bron et al. (2015).
Fig. 2Result of sequential forward feature selection (SFS). (A) Performance as a function of number of selected features. The thin gray lines correspond to each of the 10 training sets in the ADNI+AIBL 10FCV dataset, and the thick black line is the average performance across the 10 training folds. The average curve converges at 10 features indicated by the gray circle. This is the number of features that is used in the subsequent analysis. (B) Frequency of selection when the first 10 features are considered in the SFS. Each feature can be selected maximally 30 times (corresponding to the 3 different z-score versions across the 10 folds). The color-coding corresponds to how early in the SFS procedure a feature is selected, and it ranges from white (1. iteration) to black (10. and last iteration).
Pair-wise Pearson correlation between features. Asterisk marks significance Bonferroni corrected across pair-wise comparisons (p < 0.000008 = 0.001/((162 − 16)/2)). Bold font marks significant correlations of at least 0.5.
| FreeSurfer cortical thickness | FreeSurfer volumetry | Special purpose hippocampus | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FL | OL | PL | TL | CC | AM | CN | HI | PA | PU | VE | WB | NL patch | Shape (l) | Shape (r) | |
| OL | |||||||||||||||
| PL | |||||||||||||||
| TL | |||||||||||||||
| CC | 0.4* | ||||||||||||||
| AM | 0.3* | 0.4* | 0.4* | 0.3* | |||||||||||
| CN | 0.1 | 0.2* | 0.2* | 0.1 | 0.1 | 0.2* | |||||||||
| HI | 0.4* | 0.3* | 0.4* | 0.4* | 0.1 | ||||||||||
| PA | 0.2* | 0.2* | 0.2* | 0.2* | 0.1 | 0.3* | 0.3* | 0.4* | |||||||
| PU | 0.3* | 0.4* | 0.4* | 0.4* | 0.3* | ||||||||||
| VE | −0.3* | −0.2 | −0.2* | −0.4* | −0.3* | −0.3* | 0.1 | −0.4* | −0.4* | ||||||
| WB | 0.2* | 0.2* | 0.2* | 0.2* | 0.2* | 0.4* | 0.3* | −0.3* | |||||||
| NL patch | 0.3* | 0.2* | 0.2* | 0.3* | 0.1 | 0.4* | 0.4* | −0.4* | 0.4* | ||||||
| Shape (l) | −0.2* | −0.2* | −0.2* | −0.1 | −0.4* | 0.0 | −0.2* | −0.2* | 0.3* | −0.2* | |||||
| Shape (r) | −0.1 | 0.0 | −0.1 | −0.1 | 0.0 | 0.0 | 0.0 | −0.1 | 0.0 | 0.0 | 0.1 | 0.0 | −0.1 | 0.1 | |
| Texture | −0.4* | −0.4* | −0.4* | −0.3* | −0.1 | −0.2* | −0.3* | 0.4* | −0.3* | 0.1 | |||||
Abbreviations: FL, frontal lobe; OL, occipital lobe; PL, parietal lobe; TL, temporal lobe; CC, cingulate cortex; AM, amygdala; CN, caudate nucleus; HI, hippocampus; PA, pallidum; PU, putamen; VE, ventricular; WB, whole brain.
Fig. 3Learning curves with MRI biomarkers as features. (A) LDA (the classifier used in the CADDementia challenge). (B) SVM with a radial Gaussian kernel. (C) Random forest classifier. Note that the training curve is not shown here because the training CA by design is ≈ 100%. (D) kNN classifier. Error bars mark ±standard deviation. The dashed black curve corresponds to the mean ADNI + AIBL validation accuracy of the LDA classifier, i.e., the green curve in (A).