| Literature DB >> 35922851 |
Hugo Alexandre Ferreira1, Diana Prata2,3, Vasco Sá Diogo4,5.
Abstract
BACKGROUND: Early and accurate diagnosis of Alzheimer's disease (AD) is essential for disease management and therapeutic choices that can delay disease progression. Machine learning (ML) approaches have been extensively used in attempts to develop algorithms for reliable early diagnosis of AD, although clinical usefulness, interpretability, and generalizability of the classifiers across datasets and MRI protocols remain limited.Entities:
Keywords: Alzheimer’s disease; Classification; Dementia; Early diagnosis; Graph theory; Machine learning; Mild cognitive impairment; Prognosis
Mesh:
Year: 2022 PMID: 35922851 PMCID: PMC9347083 DOI: 10.1186/s13195-022-01047-y
Source DB: PubMed Journal: Alzheimers Res Ther Impact factor: 8.823
Fig. 1Classification approach for binary classifiers. See the “Machine learning classification” section for a detailed explanation. l-SVM, linear support vector machine; DT, decision tree; RF, random forest; ET, extremely randomized tree; LDA, linear discriminant analysis; LR, logistic regression; LR-SGD, logistic regression with stochastic gradient descent learning; MCC, Matthews correlation coefficient; CV, cross-validation; X̄, average; σ, standard deviation. Green color indicates classifiers using only structural features and blue indicates classifiers using only GT features
Fig. 2Classification approach for multi-class classifiers. See the “Machine learning classification” section for a detailed explanation. CV, cross-validation; SVM, support vector machine. Green color indicates classifiers using only structural features and blue indicates classifiers using only GT features
Summary of diagnostic classification experiments
| Experiment | Training set | Testing set | GT metrics |
|---|---|---|---|
| | ADNI MPRAGE | ADNI MPRAGE | No |
| | ADNI IR-SPGR | ADNI IR-SPGR | No |
| | ADNI MPRAGE | ADNI IR-SPGR | No |
| | ADNI IR-SPGR | ADNI MPRAGE | No |
| | ADNI IR-SPGR and ADNI MPRAGE | ADNI IR-SPGR and ADNI MPRAGE | No |
| | ADNI MPRAGE | ADNI MPRAGE | Yes |
| | ADNI IR-SPGR | ADNI IR-SPGR | Yes |
| | ADNI MPRAGE | ADNI IR-SPGR | Yes |
| | ADNI IR-SPGR | ADNI MPRAGE | Yes |
| | ADNI IR-SPGR and ADNI MPRAGE | ADNI IR-SPGR and ADNI MPRAGE | Yes |
| | ADNI MPRAGE | ADNI MPRAGE | No |
| | OASIS (MPRAGE) | OASIS (MPRAGE) | No |
| | OASIS (MPRAGE) | ADNI MPRAGE | No |
| | ADNI MPRAGE | OASIS (MPRAGE) | No |
| | ADNI MPRAGE and OASIS (MPRAGE) | ADNI MPRAGE and OASIS (MPRAGE) | No |
| | ADNI MPRAGE | ADNI MPRAGE | Yes |
| | OASIS (MPRAGE) | OASIS (MPRAGE) | Yes |
| | OASIS (MPRAGE) | ADNI MPRAGE | Yes |
| | ADNI MPRAGE | OASIS (MPRAGE) | Yes |
| | ADNI MPRAGE and OASIS (MPRAGE) | ADNI MPRAGE and OASIS (MPRAGE) | Yes |
‘GT metrics’ refers to whether graph theory metrics were given as an input to classifiers, regardless of whether the classifier selected them or not. For each of the experiments A1 through A10, 4 classifiers were built (“HC vs. MCI”; “HC vs. AD”; “MCI vs. AD”; and “HC vs. MCI vs. AD”). For each of the experiments B1 through B10, only 1 classifier was built (“HC vs. AD”, as OASIS did not have MCI subjects available)
Descriptive statistics of demographic and clinical variables
| Dataset | Diagnosis | Sex (M/F) | Age (at MRI scanning) | Time from MRI to the most recent diagnosis (months) | |||
|---|---|---|---|---|---|---|---|
| Mean | Standard deviation | Mean | Standard deviation | ||||
| ADNI | 211 | 98/113 | 72.4 | 6.6 | 33.6 | 5.5 | |
| HC stable | 186 | 86/100 | 73.1 | 6.3 | 33.3 | 5.7 | |
| MCI transition to HC | 26 | 13/12 | 67.8 | 7.3 | 36.0 | 0.0 | |
| 188 | 113/75 | 73.1 | 7.5 | 31.3 | 7.9 | ||
| MCI stable | 172 | 102/70 | 72.5 | 7.4 | 31.4 | 7.9 | |
| HC transition to MCI | 16 | 11/5 | 78.9 | 5.7 | 31.1 | 8.0 | |
| 171 | 97/74 | 74.6 | 7.9 | 26.2 | 9.6 | ||
| AD stable | 90 | 50/40 | 75.2 | 7.9 | 20.2 | 7.9 | |
| MCI transition to AD | 83 | 34/47 | 73.9 | 7.9 | 32.8 | 6.5 | |
| OASIS | 439 | 179/260 | 67.4 | 8.1 | 34.6 | 4.2 | |
| 67 | 34/33 | 75.6 | 7.5 | 30.2 | 7.9 | ||
M male, F female
Classifier performance using only morphometric features for classifiers using ADNI subjects with IR-SPGR or MPRAGE scans
| Experiment | Training set | Testing set | Classification task | MCC | BAC | ROC AUC | Sens | Spec | PPV (prevalence) | NPV | PPV | NPV | TN | FP | FN | TP | MCC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A1 | ADNI MPRAGE ( | ADNI MPRAGE ( | HC vs. MCI | 0.056 [−0.159;0.259] | 52.8% [42.1%; 62.9%] | 58.2% [46.5%; 70.1%] | 50.0% [35.0%; 64.6%] | 55.6% [41.5%; 69.6%] | 33.3% [21.0%; 48.5%] | 71.5% [59.0%; 81.6%] | 53.0% [37.4%; 68.0%] | 52.7% [39.0%; 66.3%] | 25 | 20 | 22 | 22 | 0.060 |
| HC vs. AD | 0.814 [0.679; 0.927] | 90.8% [84.1%; 96.2%] | 97.3% [93.6%; 99.6%] | 94.9% [87.2%; 100.0%] | 86.7% [76.2%; 95.5%] | 81.7% [69.6%; 93.3%] | 96.4% [90.5%; 100.0%] | 87.7% [78.6%; 95.7%] | 94.4% [85.6%; 100.0%] | 39 | 6 | 2 | 37 | 0.974 | |||
| MCI vs. AD | 0.588 [0.424; 0.751] | 79.0% [70.7%; 87.5%] | 87.9% [79.9%; 95.3%] | 89.7% [78.6%; 97.7%] | 68.2% [54.3%; 82.4%] | 80.0% [70.9%; 88.7%] | 82.4% [64.2%; 96.2%] | 73.8% [63.2%; 84.7%] | 86.9% [71.7%; 97.3%] | 30 | 14 | 4 | 35 | 0.627 | |||
| HC vs. MCI vs. AD | 0.350 [0.227; 0.480] | 57.7% [50.2%; 65.2%] | 76.6% [70.8%; 82.3%] | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | 0.674 | |||
| A2 | ADNI IR-SPGR ( | ADNI IR-SPGR ( | HC vs. MCI | 0.087 [−0.267; 0.457] | 53.8% [38.7%; 70.4%] | 69.2% [51.2%; 86.0%] | 28.6% [7.1%; 54.5%] | 78.9% [61.1%; 95.0%] | 37.5% [7.5%; 82.8%] | 71.4% [59.8%; 82.5%] | 57.5% [15.4%; 91.6%] | 52.5% [39.7%; 67.6%] | 15 | 4 | 10 | 4 | 0.860 |
| HC vs. AD | 0.607 [0.313; 0.867] | 79.4% [65.0%; 92.9%] | 93.5% [82.7%; 100.0%] | 69.2% [44.4%; 92.9%] | 89.5% [75.0%; 100.0%] | 80.5% [52.6%; 100.0%] | 82.3% [68.3%; 95.7%] | 86.8% [64.0%; 100.0%] | 74.4% [57.4%; 93.4%] | 17 | 2 | 4 | 9 | 0.384 | |||
| MCI vs. AD | 0.408 [−0.028; 0.742] | 70.1% [51.5%; 86.7%] | 77.5% [55.1%; 94.0%] | 61.5% [33.3%; 86.7%] | 78.6% [50.0%; 100.0%] | 80.3% [48.5%; 100.0%] | 59.1% [34.6%; 84.2%] | 74.2% [40.0%; 100.0%] | 67.1% [42.8%; 88.3%] | 11 | 3 | 5 | 8 | 0.737 | |||
| HC vs. MCI vs. AD | 0.263 [0.054; 0.491] | 49.8% [37.1%; 62.9%] | 77.1% [66.4%; 87.5%] | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | 0.996 | |||
| A3 | ADNI MPRAGE ( | ADNI IR-SPGR ( | HC vs. MCI | 0.268 [0.093; 0.423] | 61.9% [54.2%; 69.4%] | 73.5% [62.9%; 82.7%] | 88.4% [77.8%; 97.4%] | 35.5% [23.9%; 48.3%] | 37.8% [31.2%; 45.5%] | 87.4% [70.8%; 97.7%] | 57.8% [50.6%; 65.3%] | 75.4% [51.8%; 94.9%] | 22 | 40 | 5 | 38 | 0.816 |
| HC vs. AD | 0.745 [0.612; 0.867] | 87.6% [80.7%; 94.0%] | 96.3% [92.7%; 98.9%] | 88.1% [77.8%; 97.2%] | 87.1% [77.9%; 95.1%] | 81.0% [68.8%; 92.5%] | 92.1% [84.9%; 98.2%] | 87.2% [77.9%; 95.2%] | 88.0% [77.8%; 97.1%] | 54 | 8 | 5 | 37 | 0.367 | |||
| MCI vs. AD | 0.647 [0.480; 0.805] | 82.4% [74.1%; 90.1%] | 87.4% [79.8%; 90.1%] | 83.3% [71.4%; 93.8%] | 81.4% [68.9%; 92.9%] | 86.4% [76.5%; 94.9%] | 77.5% [63.0%; 91.3%] | 81.7% [69.7%; 92.9%] | 83.0% [70.7%; 93.7%] | 35 | 8 | 7 | 35 | 0.496 | |||
| HC vs. MCI vs. AD | 0.424 [0.312; 0.533] | 61.7% [54.6%; 68.8%] | 81.6% [75.6%; 87.3%] | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | 0.618 | |||
| A4 | ADNI IR-SPGR ( | ADNI MPRAGE ( | HC vs. MCI | 0.178 [0.074; 0.287] | 57.1% [53.0%; 61.7%] | 65.6% [59.5%; 72.0%] | 26.9% [20.4%; 34.1%] | 87.2% [81.9%;92.3%] | 48.2% [33.3%; 66.2%] | 72.9% [69.9%; 76.0%] | 67.8% [53.0%; 81.6%] | 54.4% [50.7%; 58.3%] | ## | 19 | ## | 39 | 0.462 |
| HC vs. AD | 0.711 [0.631; 0.792] | 85.4% [81.2%; 89.4%] | 93.5% [90.7%; 95.9%] | 82.2% [75.0%; 88.1%] | 88.6% [83.3%; 93.3%] | 81.9% [73.8%;89.2%] | 88.8% [84.2%; 92.6%] | 87.8% [81.8%; 92.9%] | 83.3% [76.9%; 88.7%] | ## | 17 | 23 | ## | 0.893 | |||
| MCI vs. AD | 0.454 [0.353; 0.551] | 71.7% [66.6%; 76.5%] | 81.4% [76.2%; 86.3%] | 87.6% [81.4%; 92.9%] | 68.3% [56.5%; 70.9%] | 73.7% [68.9%; 78.5%] | 76.16% [64.6%; 86.4%] | 66.5% [61.0%; 72.0%] | 81.8% [72.1%; 90.0%] | 81 | 64 | 16 | 113 | 0.114 | |||
| HC vs. MCI vs. AD | 0.372 [0.307; 0.431] | 57.0% [53.4%; 60.3%] | 75.5% [72.2%; 78.8%] | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a | 0.492 | |||
| A5 | ADNI IR-SPGR and ADNI MPRAGE ( | ADNI IR-SPGR and ADNI MPRAGE ( | HC vs. MCI | 0.265 [0.085; 0.441] | 62.6% [54.0%; 71.0%] | 64.4% [53.7%; 74.1%] | 47.2% [34.5%; 60.4%] | 78.0% [66.7%; 87.7%] | 48.7% [31.5%; 68.5%] | 76.9% [69.7%; 83.3%] | 68.2% [50.9%; 83.1%] | 59.6% [50.5%; 68.9%] | 46 | 13 | 28 | 25 | 0.404 |
| HC vs. AD | 0.789 [0.669; 0.892] | 89.5% [83.7%; 94.7%] | 96.0% [92.3%; 98.8%] | 94.2% [87.0%; 100.0%] | 84.7% [75.4%; 93.8%] | 79.4% [68.9%; 91.0%] | 95.9% [90.3%; 100.0%] | 86.0% [78.0%; 94.2%] | 93.6% [85.3%; 100.08%] | 50 | 9 | 3 | 49 | 0.883 | |||
| MCI vs. AD | 0.592 [0.433; 0.734] | 79.1% [71.0%; 86.2%] | 88.1% [80.6%; 93.6%] | 88.5% [79.2%; 96.2%] | 69.8% [55.9%; 81.1%] | 80.6% [71.8%; 87.8%] | 81.1% [65.5%; 93.8%] | 74.6% [64.2%; 83.6%] | 85.9% [72.9%; 95.5%] | 37 | 16 | 6 | 46 | 0.774 | |||
MCC p-value refers to the p-value for the MCC metric for the comparison with the equivalent classifier (i.e., same training and test sets) with morphometric and GT features used as input. PPV/NPV “prevalence” are calculated with an MCI prevalence of 30.7% for the “HC vs. MCI” classifiers; an AD prevalence of 38.5% for the “HC vs. AD” classifiers; and an AD prevalence of 58.6% for the MCI vs. AD classifier (these correspond to the relative prevalence of the positive class based on prevalence estimates from the first visit in the clinical setting of 42.0% for HC, 18.6% for MCI, and 26.3% for AD) [48]. PPV/NPV “standard” are calculated with a prevalence of 50% to allow comparison with other studies
Legend: CI confidence interval, MCC Matthew’s correlation coefficient, ROC AUC area under the receiver operating characteristic curve, BAC balanced accuracy, Sens sensitivity, Spec specificity, PPV positive predict value, NPV negative predictive value, TN true negatives, FP false positives, FN false negatives, TP true positives
Fig. 3Relative contribution of features for the “HC vs. AD” classifier from experiment A5. Relative contributions are grouped by anatomical region (A) and by morphometric feature type (B). CSF, cerebrospinal fluid; HPC, hippocampus; StDev Thickness, standard deviation of the cortical thickness
Classifier confusion matrix for the multi-diagnostic classifier from Experiment A5.
| Classifier prediction | |||
|---|---|---|---|
| Ground-truth (2-year) | HC | MCI | AD |
| HC stable | 36 | 12 | 4 |
| MCI transition to HC | 4 | 4 | 0 |
| MCI stable | 29 | 13 | 10 |
| HC transition to MCI | 2 | 1 | 3 |
| AD stable | 0 | 5 | 18 |
| HC transition to AD | 0 | 0 | 1 |
| MCI transition to AD | 4 | 7 | 19 |
Classifier performance for “HC vs. AD” using morphometric features only using ADNI and OASIS subjects
| Experiment | Training set | Testing set | MCC | BAC | ROC AUC | Sens | Spec | PPV (prevalence) | NPV | PPV | NPV | TN | FP | FN | TP | MCC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B1 | ADNI MPRAGE ( | ADNI MPRAGE ( | 0.814 [0.679;0.927] | 90.8% [84.1%;96.2%] | 97.3% [93.6%;99.6%] | 94.9% [87.2%;100.0%] | 86.7% [76.2%;95.5%] | 81.7% [69.6%;93.3%] | 96.4% [90.5%;100.0%] | 87.7% [78.6%;95.7%] | 94.4% [85.6%;100.0%] | 39 | 6 | 2 | 37 | 0.974 |
| B2 | OASIS ( | OASIS ( | 0.564 [0.344;0.771] | 75.6% [65.3%;87.2%] | 94.1% [89.7%;97.8%] | 55.0% [33.3%;77.3%] | 96.2% [92.5%;99.2%] | 90.1% [73.5%;98.4%] | 77.3% [68.9%;87.5%] | 93.5% [81.6%;99.0%] | 68.1% [58.1%;81.4%] | 127 | 5 | 9 | 11 | 0.998 |
| B3 | OASIS ( | ADNI MPRAGE ( | 0.722 [0.657;0.793] | 84.3% [80.3%;88.4%] | 95.5% [93.1%;97.5%] | 70.5% [63.0%;78.2%] | 98.0% [95.7%;100.0%] | 95.7% [90.2%;100.0%] | 84.1% [80.5%;88.0%] | 97.2% [93.6%;100.0%] | 76.9% [72.1%;82.1%] | 146 | 3 | 38 | 91 | 0.623 |
| B4 | ADNI MPRAGE ( | OASIS ( | 0.641 [0.551;0.733] | 87.2% [82.4%;97.0%] | 94.3% [91.3%;97.0%] | 83.6% [74.6%;91.7%] | 90.9% [88.3%;93.5%] | 85.2% [80.0%;89.8%] | 89.9% [84.7%;94.7%] | 90.2% [86.4%;93.4%] | 84.7% [77.7%;91.8%] | 409 | 41 | 11 | 56 | 0.265 |
MCC p-value refers to the p-value for the MCC metric for the comparison with the equivalent classifier (i.e., same training and test sets) with morphometric and GT features. PPV/NPV “prevalence” are calculated with an AD prevalence of 38.5% (this corresponds to the prevalence of AD relative to HC based on prevalence estimates from the first visit in the clinical setting of 42.0% for HC and 26.3% for AD) [48]. PPV/NPV “standard” are calculated with a prevalence of 50% to allow for comparison with other studies
CI confidence interval, MCC Matthew’s correlation coefficient, ROC AUC area under the receiver operating characteristic curve, BAC balanced accuracy, Sens sensitivity, Spec specificity, PPV positive predict value, NPV negative predictive value, TN true negatives, FP false positives, FN false negatives, TP true positives. The most global classifier is highlighted
Fig. 4Relative contribution of features for the “HC vs. AD” classifier from experiment B5. Relative contributions are grouped by anatomical region (A) and by morphometric feature type (B). CSF, cerebrospinal fluid; HPC, hippocampus; StDev Thickness, standard deviation of the cortical thickness