| Literature DB >> 34136635 |
Jack Albright1, Miriam T Ashford2,3, Chengshi Jin4, John Neuhaus4, Gil D Rabinovici5,6, Diana Truran2,3, Paul Maruff7, R Scott Mackin3,8, Rachel L Nosheny3,8, Michael W Weiner2,3,5,6,8,9.
Abstract
INTRODUCTION: This study investigated the extent to which subjective and objective data from an online registry can be analyzed using machine learning methodologies to predict the current brain amyloid beta (Aβ) status of registry participants.Entities:
Year: 2021 PMID: 34136635 PMCID: PMC8190559 DOI: 10.1002/dad2.12207
Source DB: PubMed Journal: Alzheimers Dement (Amst) ISSN: 2352-8729
FIGURE 1Generation of samples and subsamples
Comparison of data subsets
| SP‐ECog+ / CBB+ | SP‐ECog+ / CBB‐ | SP‐ECog‐ / CBB+ | SP‐ECog‐ / CBB‐ | |
|---|---|---|---|---|
|
| 148 | 264 | 361 | 664 |
|
| 72.7 (5.3) | 73.4 (5.4) | 73.0 (5.4) | 73.5 (5.5) |
|
| 56 (37.8%) | 108 (40.9%) | 147 (40.7%) | 294 (44.3%) |
|
| 16.5 (2.7) | 16.2 (2.8) | 16.5 (2.6) | 16.2 (2.7) |
|
| 49 (33.1%) | 80 (30.3%) | 119 (33.0%) | 213 (32.1%) |
|
| 133 (89.9%) | 242 (91.7%) | 313 (86.7%) | 593 (89.3%) |
|
| 1.78 (0.57) | 1.84 (0.64) | 1.70 (0.54) | 1.78 (0.61) |
|
| 2.14 (2.34) | 2.46 (2.68) | 2.17 (2.39) | 2.46 (2.68) |
|
| 11 (7.4%) | 52 (19.7%) | 30 (8.3%) | 105 (15.8%) |
|
| 110 (74.3%) | 185 (70.1%) | 243 (67.3%) | 459 (69.1%) |
|
| 84 (56.8%) | 168 (63.6%) | 177 (49.0%) | 365 (55.0%) |
Abbreviations: AD, diagnosed with Alzheimer's disease; Aβ+, amyloid beta positive.; CBB‐, subset does not includes Cogstate Brief Battery scores; CBB+, subset includes Cogstate Brief Battery scores; FamHxAD, family history of Alzheimer's disease; GDS_Score, Geriatric Depression Scale (short form) score; MCI, diagnosed with mild cognitive Impairment; SD, standard deviation; Self_SMC, self‐reported subjective memory concern; Self‐ECog, self‐assessed Everyday Cognition metric; SP‐ECog‐, subset does not include study partner–assessed Everyday Cognition scores; SP‐ECog+, subset includes study partner–assessed Everyday Cognition scores.
Results of feature selection
| Model | SP‐ECog | CBB | N | AUC, mean (SD) | Sens. | Spec. | PPV | NPV | AUC (n = 148), mean (SD) |
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
| 64.82% | 49.16% | 60.97% | 53.53% |
|
|
|
|
| 49.84% | 53.34% | 50.25% | 52.57% |
| ||
|
|
|
|
| 75.59% | 42.22% | 70.00% | 48.84% |
| |
|
|
|
| 65.37% | 44.76% | 61.11% | 49.00% |
| ||
|
|
|
|
|
| 75.37% | 37.44% | 59.49% | 56.11% |
|
|
|
|
| 48.12% | 52.84% | 50.74% | 51.37% |
| ||
|
|
|
|
| 88.70% | 19.33% | 65.94% | 52.71% |
| |
|
|
|
| 79.77% | 22.22% | 57.36% | 48.37% |
|
Final column (labeled as n = 148) reports scores generated using a data set consisting only of the 148 subjects with data for all 12 features. Values marked with * are significantly different (P < .05) from the corresponding means in the fifth column. Abbreviations: RF, random forest; SVM, support vector machine; SP‐ECog, study partner–assessed Everyday Cognition score; CBB, Cogstate Brief Battery score; AUC, area under the receiver‐operating characteristic curve, measured by 10‐fold cross‐validation; SD, standard deviation; Sens., sensitivity; Spec., specificity; PPV, positive predictive value; NPV, negative predictive value.
FIGURE 2Performance of models after feature selection. Blue boxes represent baseline performance of models with hyperparameter optimization but no feature selection. Orange boxes represent performance of models with both hyperparameter optimization and feature selection. Abbreviations: RF, random forest; SVM, support vector machine; SP‐ECog, study partner–assessed Everyday Cognition score; CBB, Cogstate Brief Battery score; AUC, area under the receiver‐operating characteristic curve
FIGURE 4Frequency of feature selection. (A) Non‐imputed data. (B) Imputed data. Abbreviations: RF, random forest; SVM, support vector machine; SP‐ECog or SP_ECog_score, study partner–assessed Everyday Cognition score; CBB, Cogstate Brief Battery score; FamHxAD, family history of Alzheimer's disease; Self_SMC, self‐reported subjective memory concern; Self_ECog_score, self‐assessed Everyday Cognition metrics; GDS_Score, Geriatric Depression Scale (short form) score; Det_BS, Cogstate Detection test; IDN_BS, Cogstate Identification test; OCL_BS, Cogstate One Card Learning test; ONB_BS, Cogstate One Back test
FIGURE 3Effect of imputation of SP‐ECog and CBB data. Models having significantly different scores with and without imputation (P < .05) are denoted with an asterisk. No imputation was necessary for models without SP‐ECog and CBB scores, but these were built in parallel with rest as controls; minor differences between paired results for these models are the result of random variation inherent in cross‐validation process. Abbreviations: RF, random forest; SVM, support vector machine; SP‐ECog, study partner–assessed Everyday Cognition score; CBB, Cogstate Brief Battery score; AUC, area under the receiver‐operating characteristic curve