| Literature DB >> 30567554 |
Telma Pereira1,2, Francisco L Ferreira3, Sandra Cardoso4, Dina Silva5, Alexandre de Mendonça4, Manuela Guerreiro4, Sara C Madeira6.
Abstract
BACKGROUND: Predicting progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD) is an utmost open issue in AD-related research. Neuropsychological assessment has proven to be useful in identifying MCI patients who are likely to convert to dementia. However, the large battery of neuropsychological tests (NPTs) performed in clinical practice and the limited number of training examples are challenge to machine learning when learning prognostic models. In this context, it is paramount to pursue approaches that effectively seek for reduced sets of relevant features. Subsets of NPTs from which prognostic models can be learnt should not only be good predictors, but also stable, promoting generalizable and explainable models.Entities:
Keywords: Alzheimer’s disease; Ensemble learning; Feature selection; Mild cognitive impairment; Neuropsychological data; Prognostic prediction; Time windows
Mesh:
Year: 2018 PMID: 30567554 PMCID: PMC6299964 DOI: 10.1186/s12911-018-0710-y
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Workflow of the proposed feature selection ensemble approach combining stability and predictability (FSE-StabPred)
Fig. 2Workflow of the ensemble-based approach of the proposed FS ensemble combining stability and predictability (FSE-StabPred) using different classifiers
Details on ADNI and CCC datasets for time windows of 2 to 4 years
| ADNI | CCC | |||
|---|---|---|---|---|
| sMCI | cMCI | sMCI | cMCI | |
| 2-Year window | 311 (78%) | 89 (22%) | 409 (81%) | 96 (19%) |
| 3-Year window | 235 (68%) | 111 (32%) | 310 (68%) | 143 (32%) |
| 4-Year window | 143 (54%) | 122 (46%) | 227 (56%) | 175 (44%) |
Note: sMCI stable MCI, cMCI converter MCI. Class imbalance (per time window) is shown as % within brackets
Baseline demographic characterization data
| Time window, years | ADNI | CCC | |||||
|---|---|---|---|---|---|---|---|
| sMCI | cMCI | sMCI | cMCI | ||||
| Age, years (M ± SD) | 2 | 73.1 ± 7.8 | 74.4 ± 7.7 | 0.182 | 67.2 ± 8.9 | 72.5 ± 7.9 | < 10–7* |
| 3 | 72.5 ± 7.6 | 74.9 ± 7.6 | <0.006* | 66.6 ± 8.8 | 72.3 ± 8.2 | < 10–10* | |
| 4 | 72.1 ± 7.3 | 74.8 ± 7.6 | <0.004* | 65.5 ± 9.1 | 71.9 ± 8.3 | < 10–12* | |
| Formal Education, years (M ± SD) | 2 | 15.9 ± 2.7 | 16.2 ± 2.7 | 0.483 | 10.0 ± 4.7 | 8.9 ± 5.0 | < 10–7* |
| 3 | 16.2 ± 2.7 | 16.0 ± 2.7 | 0.715 | 10.1 ± 4.8 | 8.6 ± 4.8 | <0.003* | |
| 4 | 16.1 ± 2.8 | 16.0 ± 2.6 | 0.895 | 10.4 ± 4.7 | 8.8 ± 4.8 | <0.001* | |
| Gender (male/female) | 2 | 183/128 | 48/41 | 0.408 | 151/258 | 39/57 | 0.499 |
| 3 | 136/99 | 64/47 | 0.969 | 119/191 | 50/93 | 0.483 | |
| 4 | 82/61 | 70/52 | 0.995 | 85/142 | 60/115 | 0.513 | |
Group comparison (converter MCI vs stable MCI) were performed with Independent samples t-tests (Age and Formal Education) and X2 Person Chi-Square test (Gender). Statistically significant (p < 0.05) are marked with an asterisk (*)
Mean (M) and standard deviation (SD) values are illustrated
Individual and pairwise stability of the base FS algorithms used in the ensemble. Results are averaged over the 10 × 5 stratified CV and m subsets (for each possible subset size) for the 2-years (upper values), 3-years (middle values) and 4-years (bottom values), using ADNI data
| ReliefF | MIM | CMIM | MRMR | Chi-Squared | SVM-RFE | LL21 | |
|---|---|---|---|---|---|---|---|
| ReliefF | 0.784 ± 0.127 | – | – | – | – | – | – |
| 0.755 ± 0.147 | |||||||
| 0.728 ± 0.158 | |||||||
| MIM | 0.601 ± 0.192 | 0.863 ± 0.112 | – | – | – | – | – |
| 0.589 ± 0.215 | 0.862 ± 0.108 | ||||||
| 0.570 ± 0.208 | 0.861 ± 0.114 | ||||||
| CMIM | 0.589 ± 0.243 | 0.704 ± 0.124 | 0.774 ± 0.152 | – | – | – | – |
| 0.533 ± 0.242 | 0.667 ± 0.139 | 0.758 ± 0.164 | |||||
| 0.479 ± 0.23 | 0.618 ± 0.168 | 0.739 ± 0.168 | |||||
| MRMR | 0.446 ± 0.211 | 0.301 ± 0.162 | 0.396 ± 0.174 | 0.858 ± 0.054 | – | – | – |
| 0.371 ± 0.176 | 0.252 ± 0.144 | 0.379 ± 0.171 | 0.852 ± 0.064 | ||||
| 0.367 ± 0.185 | 0.237 ± 0.161 | 0.390 ± 0.165 | 0.858 ± 0.054 | ||||
| Chi-Squared | 0.583 ± 0.1850 | 0.646 ± 0.118 | 0.529 ± 0.158 | 0.335 ± 0.212 | 0.871 ± 0.136 | – | – |
| 0.591 ± 0.195 | 0.668 ± 0.130 | 0.514 ± 0.159 | 0.289 ± 0.193 | 0.874 ± 0.149 | |||
| 0.574 ± 0.217 | 0.668 ± 0.135 | 0.497 ± 0.167 | 0.286 ± 0.194 | 0.875 ± 0.148 | |||
| SVM-RFE | 0.233 ± 0.097 | 0.184 ± 0.089 | 0.226 ± 0.107 | 0.323 ± 0.087 | 0.141 ± 0.068 | 0.302 ± 0.142 | – |
| 0.219 ± 0.100 | 0.233 ± 0.089 | 0.251 ± 0.086 | 0.272 ± 0.064 | 0.184 ± 0.07 | 0.269 ± 0.076 | ||
| 0.217 ± 0.089 | 0.201 ± 0.089 | 0.227 ± 0.098 | 0.277 ± 0.059 | 0.193 ± 0.084 | 0.273 ± 0.097 | ||
| LL21 | 0.618 ± 0.224 | 0.617 ± 0.215 | 0.552 ± 0.236 | 0.414 ± 0.215 | 0.584 ± 0.202 | 0.200 ± 0.088 | 0.908 ± 0.056 |
| 0.606 ± 0.22 | 0.613 ± 0.229 | 0.539 ± 0.241 | 0.353 ± 0.185 | 0.574 ± 0.203 | 0.206 ± 0.099 | 0.887 ± 0.064 | |
| 0.546 ± 0.253 | 0.565 ± 0.242 | 0.475 ± 0.235 | 0.324 ± 0.186 | 0.541 ± 0.209 | 0.184 ± 0.084 | 0.856 ± 0.087 |
Results obtained with the FS ensemble with subset size defined by RPT using β = 0.1 (upper values), β = 1 (middle values) and β = 10 (bottom values) for different classifiers. Results are averaged over 10 × 5 stratified cross validation, for each time window, using ADNI data
| AUC | Sensitivity | Specificity | Stability | # Features | ||
|---|---|---|---|---|---|---|
| 2Y | NB | 0.758 ± 0.00 | 0.599 ± 0.01 | 0.834 ± 0.00 | 1.0 ± 0.0 | 2 |
| 0.839 ± 0.00 | 0.744 ± 0.01 | 0.779 ± 0.01 | 0.982 ± 0.01 | 17 | ||
| 0.864 ± 0.00 | 0.791 ± 0.01 | 0.819 ± 0.01 | 0.912 ± 0.01 | 35 | ||
| SVM Poly | 0.460 ± 0.08 | 0.323 ± 0.03 | 0.933 ± 0.00 | 1.0 ± 0.0 | 2 | |
| 0.849 ± 0.00 | 0.758 ± 0.01 | 0.797 ± 0.01 | 0.982 ± 0.01 | 17 | ||
| 0.889 ± 0.01 | 0.789 ± 0.02 | 0.838 ± 0.01 | 0.913 ± 0.02 | 30 | ||
| SVM RBF | 0.770 ± 0.00 | 0.571 ± 0.02 | 0.829 ± 0.01 | 1.0 ± 0.0 | 2 | |
| 0.864 ± 0.01 | 0.758 ± 0.02 | 0.825 ± 0.01 | 0.978 ± 0.02 | 22 | ||
| 0.891 ± 0.01 | 0.777 ± 0.02 | 0.841 ± 0.01 | 0.913 ± 0.02 | 30 | ||
| DT | 0.588 ± 0.02 | 0.446 ± 0.03 | 0.732 ± 0.03 | 1.0 ± 0.0 | 2 | |
| 0.706 ± 0.02 | 0.574 ± 0.04 | 0.839 ± 0.01 | 0.934 ± 0.02 | 38 | ||
| 0.715 ± 0.03 | 0.568 ± 0.06 | 0.861 ± 0.01 | 0.919 ± 0.02 | 34 | ||
| LR | 0.769 ± 0.00 | 0.637 ± 0.01 | 0.784 ± 0.00 | 1.0 ± 0.0 | 2 | |
| 0.846 ± 0.01 | 0.732 ± 0.03 | 0.821 ± 0.01 | 0.978 ± 0.02 | 22 | ||
| 0.882 ± 0.01 | 0.727 ± 0.02 | 0.848 ± 0.01 | 0.913 ± 0.02 | 32 | ||
| 3Y | NB | 0.772 ± 0.01 | 0.521 ± 0.01 | 0.901 ± 0.00 | 1.0 ± 0.0 | 2 |
| 0.859 ± 0.00 | 0.761 ± 0.01 | 0.804 ± 0.01 | 0.985 ± 0.01 | 22 | ||
| 0.872 ± 0.00 | 0.775 ± 0.02 | 0.829 ± 0.01 | 0.889 ± 0.01 | 30 | ||
| SVM Poly | 0.734 ± 0.01 | 0.0 ± 0.0 | 1.0 ± 0.0 | 1.0 ± 0.0 | 2 | |
| 0.879 ± 0.00 | 0.584 ± 0.02 | 0.925 ± 0.01 | 0.927 ± 0.01 | 37 | ||
| 0.876 ± 0.01 | 0.626 ± 0.02 | 0.912 ± 0.01 | 0.780 ± 0.02 | 55 | ||
| SVM RBF | 0.777 ± 0.01 | 0.169 ± 0.02 | 0.982 ± 0.01 | 1.0 ± 0.0 | 2 | |
| 0.871 ± 0.01 | 0.614 ± 0.01 | 0.924 ± 0.01 | 0.985 ± 0.02 | 22 | ||
| 0.872 ± 0.01 | 0.619 ± 0.02 | 0.914 ± 0.01 | 0.942 ± 0.01 | 25 | ||
| DT | 0.602 ± 0.02 | 0.463 ± 0.02 | 0.742 ± 0.02 | 1.0 ± 0.0 | 2 | |
| 0.704 ± 0.02 | 0.603 ± 0.03 | 0.804 ± 0.02 | 0.959 ± 0.03 | 22 | ||
| 0.719 ± 0.02 | 0.622 ± 0.03 | 0.816 ± 0.01 | 0.890 ± 0.02 | 33 | ||
| LR | 0.777 ± 0.01 | 0.505 ± 0.01 | 0.920 ± 0.00 | 1.0 ± 0.0 | 2 | |
| 0.864 ± 0.01 | 0.658 ± 0.02 | 0.889 ± 0.01 | 0.985 ± 0.02 | 22 | ||
| 0.864 ± 0.01 | 0.658 ± 0.02 | 0.889 ± 0.01 | 0.985 ± 0.02 | 22 | ||
| 4Y | NB | 0.858 ± 0.03 | 0.779 ± 0.01 | 0.820 ± 0.01 | 0.937 ± 0.02 | 15 |
| 0.891 ± 0.01 | 0.775 ± 0.01 | 0.844 ± 0.02 | 0.925 ± 0.02 | 36 | ||
| 0.886 ± 0.00 | 0.789 ± 0.01 | 0.819 ± 0.01 | 0.895 ± 0.02 | 32 | ||
| SVM Poly | 0.849 ± 0.01 | 0.706 ± 0.02 | 0.824 ± 0.02 | 0.937 ± 0.02 | 15 | |
| 0.904 ± 0.00 | 0.757 ± 0.02 | 0.884 ± 0.01 | 0.846 ± 0.02 | 36 | ||
| 0.908 ± 0.01 | 0.757 ± 0.02 | 0.884 ± 0.01 | 0.847 ± 0.02 | 45 | ||
| SVM RBF | 0.871 ± 0.01 | 0.702 ± 0.01 | 0.887 ± 0.01 | 0.937 ± 0.02 | 15 | |
| 0.901 ± 0.00 | 0.754 ± 0.02 | 0.863 ± 0.02 | 0.925 ± 0.02 | 10 | ||
| 0.905 ± 0.01 | 0.758 ± 0.01 | 0.873 ± 0.03 | 0.829 ± 0.03 | 70 | ||
| DT | 0.735 ± 0.03 | 0.708 ± 0.05 | 0.761 ± 0.02 | 0.937 ± 0.02 | 15 | |
| 0.735 ± 0.03 | 0.708 ± 0.05 | 0.761 ± 0.02 | 0.937 ± 0.02 | 15 | ||
| 0.735 ± 0.03 | 0.708 ± 0.05 | 0.761 ± 0.02 | 0.937 ± 0.02 | 17 | ||
| LR | 0.870 ± 0.01 | 0.745 ± 0.02 | 0.862 ± 0.01 | 0.937 ± 0.02 | 15 | |
| 0.870 ± 0.01 | 0.745 ± 0.02 | 0.862 ± 0.01 | 0.937 ± 0.02 | 15 | ||
| 0.869 ± 0.01 | 0.751 ± 0.02 | 0.846 ± 0.01 | 0.905 ± 0.02 | 25 |
Results obtained with the FS ensemble with subset size defined by RPT using β = 0.1 (upper values), β = 1 (middle values) and β = 10 (bottom values) for different classifiers. Results are averaged over 10 × 5 stratified cross validation, for each time window, using CCC data
| AUC | Sensitivity | Specificity | Stability | # Features | ||
|---|---|---|---|---|---|---|
| 2Y | NB | 0.803 ± 0.01 | 0.746 ± 0.01 | 0.681 ± 0.00 | 1.0 ± 0.0 | 9 |
| 0.829 ± 0.00 | 0.771 ± 0.01 | 0.733 ± 0.01 | 0.971 ± 0.03 | 18 | ||
| 0.829 ± 0.00 | 0.765 ± 0.01 | 0.744 ± 0.01 | 0.936 ± 0.01 | 20 | ||
| SVM Poly | 0.815 ± 0.01 | 0.863 ± 0.01 | 0.767 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.839 ± 0.00 | 0.789 ± 0.01 | 0.767 ± 0.01 | 0.965 ± 0.03 | 19 | ||
| 0.841 ± 0.00 | 0.788 ± 0.01 | 0.758 ± 0.01 | 0.936 ± 0.01 | 20 | ||
| SVM RBF | 0.820 ± 0.00 | 0.803 ± 0.02 | 0.673 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.841 ± 0.00 | 0.771 ± 0.01 | 0.765 ± 0.01 | 0.965 ± 0.03 | 19 | ||
| 0.841 ± 0.00 | 0.771 ± 0.01 | 0.765 ± 0.01 | 0.965 ± 0.03 | 19 | ||
| DT | 0.616 ± 0.02 | 0.445 ± 0.03 | 0.786 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.643 ± 0.02 | 0.578 ± 0.05 | 0.633 ± 0.03 | 0.918 ± 0.02 | 1 | ||
| 0.643 ± 0.02 | 0.578 ± 0.05 | 0.633 ± 0.03 | 0.918 ± 0.02 | 1 | ||
| LR | 0.811 ± 0.01 | 0.752 ± 0.02 | 0.726 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.811 ± 0.01 | 0.752 ± 0.02 | 0.726 ± 0.01 | 1.0 ± 0.0 | 9 | ||
| 0.821 ± 0.01 | 0.765 ± 0.01 | 0.765 ± 0.01 | 0.936 ± 0.01 | 20 | ||
| 3Y | NB | 0.833 ± 0.00 | 0.749 ± 0.01 | 0.735 ± 0.01 | 1.0 ± 0.0 | 9 |
| 0.857 ± 0.00 | 0.779 ± 0.01 | 0.772 ± 0.01 | 0.966 ± 0.01 | 19 | ||
| 0.859 ± 0.00 | 0.778 ± 0.01 | 0.781 ± 0.01 | 0.950 ± 0.01 | 20 | ||
| SVM Poly | 0.844 ± 0.00 | 0.0 ± 0.0 | 1.0 ± 0.0 | 1.0 ± 0.0 | 9 | |
| 0.872 ± 0.00 | 0.633 ± 0.01 | 0.886 ± 0.00 | 0.966 ± 0.02 | 19 | ||
| 0.873 ± 0.01 | 0.643 ± 0.01 | 0.874 ± 0.00 | 0.909 ± 0.02 | 25 | ||
| SVM RBF | 0.842 ± 0.00 | 0.582 ± 0.01 | 0.870 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.873 ± 0.00 | 0.608 ± 0.01 | 0.891 ± 0.00 | 0.966 ± 0.02 | 19 | ||
| 0.874 ± 0.00 | 0.612 ± 0.01 | 0.895 ± 0.01 | 0.950 ± 0.02 | 20 | ||
| DT | 0.664 ± 0.02 | 0.556 ± 0.03 | 0.773 ± 0.03 | 1.0 ± 0.0 | 9 | |
| 0.686 ± 0.02 | 0.587 ± 0.03 | 0.784 ± 0.02 | 0.986 ± 0.02 | 12 | ||
| 0.686 ± 0.02 | 0.587 ± 0.03 | 0.784 ± 0.02 | 0.986 ± 0.02 | 12 | ||
| LR | 0.838 ± 0.01 | 0.619 ± 0.02 | 0.859 ± 0.00 | 1.0 ± 0.0 | 9 | |
| 0.838 ± 0.01 | 0.619 ± 0.02 | 0.859 ± 0.01 | 1.0 ± 0.0 | 9 | ||
| 0.853 ± 0.01 | 0.635 ± 0.01 | 0.859 ± 0.01 | 0.950 ± 0.02 | 20 | ||
| 4Y | NB | 0.852 ± 0.00 | 0.796 ± 0.01 | 0.768 ± 0.01 | 1.0 ± 0.0 | 9 |
| 0.852 ± 0.00 | 0.796 ± 0.01 | 0.768 ± 0.01 | 1.0 ± 0.0 | 9 | ||
| 0.868 ± 0.00 | 0.793 ± 0.01 | 0.788 ± 0.01 | 0.955 ± 0.02 | 19 | ||
| SVM Poly | 0.853 ± 0.00 | 0.821 ± 0.01 | 0.720 ± 0.00 | 1.0 ± 0.0 | 9 | |
| 0.872 ± 0.00 | 0.775 ± 0.01 | 0.821 ± 0.01 | 0.959 ± 0.2 | 20 | ||
| 0.872 ± 0.00 | 0.775 ± 0.01 | 0.821 ± 0.01 | 0.959 ± 0.02 | 20 | ||
| SVM RBF | 0.858 ± 0.00 | 0.754 ± 0.01 | 0.798 ± 0.00 | 1.0 ± 0.0 | 9 | |
| 0.858 ± 0.00 | 0.754 ± 0.01 | 0.798 ± 0.00 | 1.0 ± 0.0 | 9 | ||
| 0.871 ± 0.00 | 0.763 ± 0.01 | 0.820 ± 0.01 | 0.949 ± 0.03 | 16 | ||
| DT | 0.675 ± 0.01 | 0.641 ± 0.02 | 0.713 ± 0.01 | 1.0 ± 0.0 | 2 | |
| 0.675 ± 0.01 | 0.641 ± 0.02 | 0.713 ± 0.01 | 1.0 ± 0.0 | 2 | ||
| 0.682 ± 0.02 | 0.655 ± 0.03 | 0.717 ± 0.01 | 0.937 ± 0.04 | 14 | ||
| LR | 0.852 ± 0.00 | 0.737 ± 0.01 | 0.801 ± 0.01 | 1.0 ± 0.0 | 9 | |
| 0.852 ± 0.00 | 0.737 ± 0.01 | 0.801 ± 0.01 | 1.0 ± 0.0 | 9 | ||
| 0.742 ± 0.01 | 0.742 ± 0.01 | 0.803 ± 0.01 | 0.929 ± 0.01 | 15 |
Fig. 3Stability and classification performance for subsets of features with different sizes (k) following 10 × 5 stratified CV and using time windows of 2-years (upper), 3-years (middle) and 4-years (bottom) obtained with ADNI (left panel) and CCC (right panel) data, using the NB and LR. RPT thresholds with β set as 0.1, 1 and 10 are illustrated
Results obtained with the entire set of features, the FS ensemble and the individual FS algorithms for time-windows of a) 2-years, b) 3-years and c) 4-years, using ADNI data. Results are averaged over the 10 × 5 stratified cross validation with subset size defined by the optimized RPT threshold (β = 10)
| Ensemble | ReliefF | MIM | CMIM | MRMR | Chi-Squared | LL21 | All features | ||
|---|---|---|---|---|---|---|---|---|---|
| 2-years windows | AUC | 0.882 ± 0.01 | 0.861 ± 0.01 | 0.865 ± 0.01 | 0.882 ± 0.01 | 0.859 ± 0.01 | 0.864 ± 0.0 | 0.851 ± 0.01 | 0.860 ± 0.01 |
| Sensitivity | 0.727 ± 0.02 | 0.758 ± 0.02 | 0.754 ± 0.03 | 0.736 ± 0.03 | 0.749 ± 0.02 | 0.771 ± 0.01 | 0.752 ± 0.01 | 0.594 ± 0.02 | |
| Specificity | 0.848 ± 0.01 | 0.826 ± 0.01 | 0.833 ± 0.02 | 0.847 ± 0.01 | 0.803 ± 0.00 | 0.815 ± 0.01 | 0.821 ± 0.01 | 0.903 ± 0.01 | |
| Stability | 0.913 ± 0.02 | 0.888 ± 0.02 | 0.907 ± 0.02 | 0.892 ± 0.02 | 1.0 ± 0.0 | 1.0 ± 0.0 | 0.837 ± 0.02 | – | |
| # Features | 32 | 8 | 25 | 32 | 7 | 4 | 11 | 79 | |
| 3-years windows | AUC | 0.872 ± 0.004 | 0.857 ± 0.00 | 0.871 ± 0.00 | 0.872 ± 0.00 | 0.863 ± 0.00 | 0.859 ± 0.01 | 0.868 ± 0.00 | 0.835 ± 0.01 |
| Sensitivity | 0.775 ± 0.018 | 0.784 ± 0.01 | 0.778 ± 0.01 | 0.782 ± 0.02 | 0.728 ± 0.01 | 0.776 ± 0.02 | 0.793 ± 0.02 | 0.714 ± 0.02 | |
| Specificity | 0.829 ± 0.011 | 0.817 ± 0.01 | 0.831 ± 0.01 | 0.827 ± 0.00 | 0.805 ± 0.01 | 0.841 ± 0.01 | 0.804 ± 0.01 | 0.782 ± 0.01 | |
| Stability | 0.889 ± 0.013 | 0.892 ± 0.02 | 0.913 ± 0.02 | 0.941 ± 0.01 | 0.789 ± 0.02 | 0.986 ± 0.02 | 0.944 ± 0.03 | – | |
| # Features | 30 | 15 | 23 | 29 | 70 | 10 | 45 | 79 | |
| 4-years windows | AUC | 0.886 ± 0.003 | 0.876 ± 0.00 | 0.881 ± 0.00 | 0.883 ± 0.01 | 0.867 ± 0.01 | 0.870 ± 0.01 | 0.872 ± 0.01 | 0.853 ± 0.006 |
| Sensitivity | 0.789 ± 0.007 | 0.788 ± 0.01 | 0.799 ± 0.01 | 0.798 ± 0.01 | 0.732 ± 0.01 | 0.761 ± 0.01 | 0.792 ± 0.02 | 0.705 ± 0.009 | |
| Specificity | 0.819 ± 0.013 | 0.817 ± 0.02 | 0.818 ± 0.01 | 0.831 ± 0.01 | 0.815 ± 0.01 | 0.837 ± 0.01 | 0.819 ± 0.01 | 0.831 ± 0.017 | |
| Stability | 0.895 ± 0.023 | 0.839 ± 0.02 | 0.881 ± 0.02 | 0.899 ± 0.02 | 0.906 ± 0.05 | 0.93 ± 0.02 | 0.923 ± 0.03 | – | |
| # Features | 32 | 22 | 23 | 33 | 75 | 23 | 50 | 79 |
Results obtained with the entire set of features, the FS ensemble and the individual FS algorithms for time-windows of a) 2-years, b) 3-years and c) 4-years, using CCC data. Results are averaged over the 10 × 5 stratified cross validation with subset size defined by the optimized RPT threshold (β = 10)
| Ensemble | ReliefF | MIM | CMIM | MRMR | Chi-Squared | LL21 | All features | ||
|---|---|---|---|---|---|---|---|---|---|
| 2-years windows | AUC | 0.821 ± 0.00 | 0.813 ± 0.01 | 0.806 ± 0.01 | 0.817 ± 0.01 | 0.827 ± 0.01 | 0.820 ± 0.01 | 0.809 ± 0.01 | 0.814 ± 0.01 |
| Sensitivity | 0.738 ± 0.02 | 0.744 ± 0.03 | 0.743 ± 0.02 | 0.735 ± 0.03 | 0.757 ± 0.02 | 0.742 ± 0.02 | 0.750 ± 0.02 | 0.385 ± 0.04 | |
| Specificity | 0.765 ± 0.01 | 0.758 ± 0.01 | 0.762 ± 0.01 | 0.767 ± 0.01 | 0.764 ± 0.01 | 0.763 ± 0.01 | 0.746 ± 0.01 | 0.920 ± 0.01 | |
| Stability | 0.936 ± 0.01 | 0.889 ± 0.03 | 0.928 ± 0.03 | 0.872 ± 0.03 | 0.975 ± 0.02 | 0.894 ± 0.02 | 0.986 ± 0.02 | – | |
| # Features | 20 | 14 | 16 | 25 | 16 | 20 | 12 | 40 | |
| 3-years windows | AUC | 0.859 ± 0.00 | 0.860 ± 0.00 | 0.861 ± 0.00 | 0.853 ± 0.04 | 0.855 ± 0.00 | 0.861 ± 0.00 | 0.863 ± 0.00 | 0.853 ± 0.00 |
| Sensitivity | 0.778 ± 0.01 | 0.779 ± 0.01 | 0.778 ± 0.01 | 0.778 ± 0.01 | 0.762 ± 0.01 | 0.776 ± 0.01 | 0.775 ± 0.01 | 0.734 ± 0.01 | |
| Specificity | 0.781 ± 0.01 | 0.778 ± 0.01 | 0.784 ± 0.01 | 0.779 ± 0.01 | 0.785 ± 0.01 | 0.786 ± 0.00 | 0.792 ± 0.00 | 0.819 ± 0.00 | |
| Stability | 0.950 ± 0.02 | 0.922 ± 0.03 | 0.939 ± 0.01 | 0.885 ± 0.03 | 0.767 ± 0.03 | 0.992 ± 0.02 | 0.996 ± 0.02 | – | |
| # Features | 20 | 18 | 18 | 19 | 35 | 20 | 18 | 40 | |
| 4-years windows | AUC | 0.868 ± 0.00 | 0.868 ± 0.00 | 0.868 ± 0.00 | 0.865 ± 0.00 | 0.850 ± 0.00 | 0.869 ± 0.00 | 0.865 ± 0.00 | 0.859 ± 0.00 |
| Sensitivity | 0.793 ± 0.01 | 0.773 ± 0.01 | 0.774 ± 0.01 | 0.795 ± 0.01 | 0.785 ± 0.01 | 0.793 ± 0.01 | 0.789 ± 0.01 | 0.729 ± 0.01 | |
| Specificity | 0.788 ± 0.00 | 0.792 ± 0.01 | 0.791 ± 0.00 | 0.789 ± 0.01 | 0.782 ± 0.01 | 0.789 ± 0.00 | 0.789 ± 0.00 | 0.841 ± 0.01 | |
| Stability | 0.955 ± 0.02 | 0.908 ± 0.02 | 0.951 ± 0.01 | 0.862 ± 0.3 | 0.802 ± 0.03 | 0.923 ± 0.02 | 0.909 ± 0.02 | – | |
| # Features | 19 | 18 | 16 | 18 | 12 | 16 | 15 | 40 |
Top selected features using the ensemble approach with ADNI data (RPT threshold with β set as 10). Ranking positions of each feature are reported within brackets for the 2,3, and 4 years time windows, respectively
| Common features across all time windows | |
| Trail Making Test (Part B) - time (1,1,1) | AVDELTOT: AVLT Recognition (14,15,20) |
| Forgetting Index (2,2,2) | Boston Test Naming (15,12,11) |
| AVTOT15: RAVLT 15 (3,3,4) | ADAS-Cog Q4: Delayed word recall (16,17,16) |
| ADAS-Cog Total 13 (4,4,5) | ADAS-Cog Q8: Word recognition (17,16,15) |
| Trail Making Test (Part A) - time (5,5,3) | MMSE (total) (18,20,17) |
| ADAS-Cog Total 11 (6,7,8) | ADAS-Cog Q1: Word recall (19,21,19) |
| AVTOT6: RAVLT 6 (7,6,7) | Letter Fluency (20,22,21) |
| Logical Memory Immediate (8,8,9) | Age (21,18,18) |
| Category Fluency (9,10,6) | Years of symptoms (22,19,22) |
| AVDEL30: RAVLT delay (10,9,10) | CDR: Orientation (25,26,26) |
| FAQ: Activities of Daily Living (11,13,13) | CDR: Home (26,23,23) |
| Logical Memory Delayed (12,11,12) | AVTOTB: AVLT Interference (27,25,25) |
| MOCADMDL (13,14,15) | |
| Common features across one or two time windows | |
| ADAS-Cog Q7: Orientation (23,24,-) | MMORIENTOT (29,-,-) |
| GDS (24,27,-) | ADAS-Cog Q13: Number cancelation (−,29,-) |
| Years of formal education (28,-,27) | CDR: Judgment and problem solving (−,-,29) |
| MMDLRECALL (−,28,24) | CDR: Community Affair (−,-,30) |
Top selected features using the ensemble approach with CCC data (RPT threshold with β set as 10). Ranking positions of each feature are reported within brackets for the 2,3, and 4 years time windows, respectively
| Common features across all time windows | |
| Forgetting Index (1,1,1) | Verbal Paired-Associate Learning – Difficult (10,11,11) |
| Verbal Paired-Associate Learning – Total (2,2,2) | Verbal Paired-Associate Learning – Easy (11,10,10) |
| Cancelation Task – A’s time (3,4,6) | Word Recall (Total) (12,12,12) |
| Logical Memory Immediate A free recal (4,3,5) | Orientation (Total) (13,14,14) |
| Age (first symptoms) (5,8,7) | Raven Progressive Matrices (15,14,14) |
| Category Fluency (6,5,4) | Years of formal education (16,17,16) |
| Age (7,6,3) | Word Recall – Free recall (18,19,19) |
| Logical Memory A with Interference- free recall (8,7,9) | Cancelation Task – A’s total (19,18,18) |
| Logical Memory A Immediate Cued (9,9,8) | – |
| Common features across one or two time windows | |
| Interpretation of proverbs - (Verbal Abstraction) (17,-,20) | Calculation (19,-,-) |
| Information (−,16,17) | Orientation – Temporal (20,20,-) |