| Literature DB >> 30803110 |
Mark J R J Bouts1,2,3, Jeroen van der Grond2, Meike W Vernooij4,5, Marisa Koini6, Tijn M Schouten1,2,3, Frank de Vos1,2,3, Rogier A Feis2,3, Lotte G M Cremers4,5, Anita Lechner6, Reinhold Schmidt6, Mark de Rooij1,3, Wiro J Niessen5,7,8, M Arfan Ikram4,5,9, Serge A R B Rombouts1,2,3.
Abstract
Early and accurate mild cognitive impairment (MCI) detection within a heterogeneous, nonclinical population is needed to improve care for persons at risk of developing dementia. Magnetic resonance imaging (MRI)-based classification may aid early diagnosis of MCI, but has only been applied within clinical cohorts. We aimed to determine the generalizability of MRI-based classification probability scores to detect MCI on an individual basis within a general population. To determine classification probability scores, an AD, mild-AD, and moderate-AD detection model were created with anatomical and diffusion MRI measures calculated from a clinical Alzheimer's Disease (AD) cohort and subsequently applied to a population-based cohort with 48 MCI and 617 normal aging subjects. Each model's ability to detect MCI was quantified using area under the receiver operating characteristic curve (AUC) and compared with an MCI detection model trained and applied to the population-based cohort. The AD-model and mild-AD identified MCI from controls better than chance level (AUC = 0.600, p = 0.025; AUC = 0.619, p = 0.008). In contrast, the moderate-AD-model was not able to separate MCI from normal aging (AUC = 0.567, p = 0.147). The MCI-model was able to separate MCI from controls better than chance (p = 0.014) with mean AUC values comparable with the AD-model (AUC = 0.611, p = 1.0). Within our population-based cohort, classification models detected MCI better than chance. Nevertheless, classification performance rates were moderate and may be insufficient to facilitate robust MRI-based MCI detection on an individual basis. Our data indicate that multiparametric MRI-based classification algorithms, that are effective in clinical cohorts, may not straightforwardly translate to applications in a general population.Entities:
Keywords: Alzheimer's disease; MRI; classification; community-dwelling cohort; diffusion tensor imaging; machine learning; mild cognitive impairment
Mesh:
Year: 2019 PMID: 30803110 PMCID: PMC6563478 DOI: 10.1002/hbm.24554
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
Demographics of the AD and RS cohort
| AD cohort | RS cohort | |||
|---|---|---|---|---|
| Control | AD | Control | MCI | |
|
| 173 | 77 | 617 | 48 |
| Age (mean ± | 66.1 ± 8.7 | 68.6 ± 8.6 | 67.3 ± 5.2 | 68.8 ± 6.6 |
| Female gender (%) | 99 (57.2) | 46 (59.7) | 319 (51.7) | 23 (47.9) |
| Disease duration (months) | 26.4 ± 24.6 | |||
| MMSE (mean ± | 27.5 ± 1.8 | 20.4 ± 4.5 | 28.1 ± 2.0 | 26.9 ± 1.8 |
AD: Alzheimer's disease, MCI: mild cognitive impairment, MMSE: mini‐mental state examination, SD: standard deviation.
Versus control subjects, p < 0.01.
Versus AD cohort, p < 0.05.
Versus AD cohort, p < 0.001.
Demographics of amnestic MCI, nonamnestic MCI, and control subjects of the RS cohort
| RS cohort | ||||
|---|---|---|---|---|
| Amnestic MCI | Nonamnestic MCI | Control | ||
|
| 23 | 25 | 617 | |
| Age (mean ± | 69.9 ± 7.6 | 67.8 ± 5.4 | 67.3 ± 5.2 | |
| Female gender (%) | 8 (34.8) | 15 (60.0) | 298 (51.7) | |
| MMSE | 27 [25–28] | 28 [26–29]* | 28 [27–29] | |
| Memory | ||||
| (median [iqr]) | WLT im | 7 [6–8] | 12 [10–15] | 14 [11–17] |
| WLT delay | 3 [2–4] | 6 [5–9] | 7 [6–9] | |
| Information processing speed | ||||
| (median [iqr]) | Stroop I | 18.9 [16.9–20.7] | 23.1 [19.4–29.7] | 16.8 [15.0–18.3] |
| Stroop II | 24.8 [23.0–27.1] | 27.8 [25.5–31.0] | 22.4 [20.2–24.9] | |
| LDST | 28 [22–30] | 23 [19–27] | 30 [26–35] | |
| Executive functioning | ||||
| (median [iqr]) | VFT | 18 [16–21] | 16 [14–22] | 22 [19–26] |
| Stroop III | 62.6 [49.0–89.2] | 67.6 [55.7–97.0] | 46.2 [39.2–54.2] | |
delay: delayed recall; im: immediate recall; iqr: inter‐quartile range; LDST: letter digit substitution task; MCI: mild cognitive impairment; MMSE: mini‐mental state examination; SD: standard deviation; Stroop I: Stroop reading subtask; Stroop II: Stroop color‐naming subtask; Stroop III: Stroop interference subtask; VFT: verbal fluency test; WLT: 15‐word verbal learning test.
Versus control subjects p < 0.05.
Versus control subjects, p < 0.001.
Versus amnestic MCI subjects, p < 0.001.
Figure 1Receiver‐operating curves of MCI versus control classifications within the RS cohort. Classifications were obtained by training an AD versus control classification model using the AD cohort and subsequently applying it within the RS cohort (AD‐model). Mild‐AD‐model and moderate‐AD‐model classifications were calculated similarly to the AD‐model but respectively included mild‐AD patients (MMSE > 20) or moderate‐AD patients (MMSE ≤ 20) only. Finally, MCI versus control classifications were obtained through 10‐fold nested cross‐validation within the RS cohort (MCI‐model). Mean AUC values of classifications within the RS cohort were comparable (AD‐model: 0.600, mild‐AD‐model: 0.619; moderate‐AD‐model: 0.567; MCI‐model: 0.611 [Table 3]). Only classifications with the AD‐model (p = 0.025), mild‐AD‐model (p = 0.008), and the MCI‐model (p = 0.014) were significantly better than chance level. The diagonal line represents random classification performance [Color figure can be viewed at http://wileyonlinelibrary.com]
Classification performance values of the AD, mild‐AD, moderate‐AD, and MCI classification models within the RS cohort
| Model | Measure | AUC | Min–max | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|---|---|
| AD | Multiparametric | 0.600 | 0.572–0.631 | 0.556 | 0.647 | 0.641 |
| Mild‐AD | Multiparametric | 0.619 | 0.587–0.651 | 0.594 | 0.658 | 0.653 |
| Moderate‐AD | Multiparametric | 0.567 | 0.549–0.591 | 0.533 | 0.621 | 0.615 |
| MCI | Multiparametric | 0.611 | 0.577–0.644 | 0.628 | 0.615 | 0.616 |
Mean, minimum, and maximum area under the ROC curve (AUC) after 100 classification repetitions. Classifications with the AD‐, mild‐AD‐, and moderate‐AD‐models resulted from 100 times repeated training on the AD cohort and applying it to the RS cohort. The MCI‐model resulted from 100 times repeated, 10‐fold nested cross‐validations using RS cohort data. Mean sensitivity, specificity, and accuracy were calculated at the optimal operating point on the ROC curve. DGMV: deep gray matter volumes; FA: fractional anisotropy; GMD: gray matter density; MD: mean diffusivity; Multiparametric: classification model including GMD, DGMV, WMD, FA, and MD; WMD: white matter density.
Significantly higher than random classification, p < 0.05.
Figure 2Box‐ and scatter plots of AD probability scores—ranging from control (0.0) to AD patient (1.0)—of each RS cohort subject as calculated with the AD‐model (a), mild‐AD‐model (b), or moderate‐AD‐model (c). AD probability scores calculated with the AD‐model (a) resulted from training an AD versus control classification model with all AD cohort subjects and subsequently applying it to subjects of the RS cohort. AD probability scores obtained with the mild‐AD‐model (b) were similarly calculated, but were trained with MRI measures of mild‐AD patients (MMSE > 20) and control subjects only, whereas AD probability scores of the moderate‐AD‐model (c) were calculated with MRI measures of moderate‐AD patients (MMSE ≤ 20) and control subjects only. AD‐model‐based probability scores from each subject in the AD cohort were added for reference (d). Within the RS cohort, mean AD probability scores for MCI subjects were higher than control subjects for classifications with the mild‐AD‐model (b, p = 0.047), but not for classifications with the AD‐model (a, p = 0.140) or moderate‐AD‐model (c, p = 0.870). Compared with scores within the AD cohort (d), AD probability scores within the RS cohort were lower and overlapped more between MCI and control subjects for the AD‐model (a, p = 0.002), mild‐AD model (b, p = 0.002), and moderate‐AD model (c, p = 0.002). For visual purposes, AD probability scores were offset adjusted by for each model subtracting each model's calculated minimal score from each subject's individual score [Color figure can be viewed at http://wileyonlinelibrary.com]
Figure 3Box‐ and scatter plots of MCI probability score—ranging from control (0.0) to MCI (1.0) subject—of each RS cohort subject as calculated with the MCI‐model. Mean MCI probability scores for MCI subjects were slightly higher than control subjects (p = 0.060), but scores were lower and overlapped more than AD probability scores in the AD cohort (p = 0.002, Figure 2d). For visual purposes, MCI probability scores were offset adjusted by subtracting the MCI model's minimal score from each subject's individual score [Color figure can be viewed at http://wileyonlinelibrary.com]