| Literature DB >> 28724366 |
Telma Pereira1,2, Luís Lemos3,4, Sandra Cardoso5, Dina Silva6, Ana Rodrigues7, Isabel Santana7,8, Alexandre de Mendonça5, Manuela Guerreiro5, Sara C Madeira9,10.
Abstract
BACKGROUND: Predicting progression from a stage of Mild Cognitive Impairment to dementia is a major pursuit in current research. It is broadly accepted that cognition declines with a continuum between MCI and dementia. As such, cohorts of MCI patients are usually heterogeneous, containing patients at different stages of the neurodegenerative process. This hampers the prognostic task. Nevertheless, when learning prognostic models, most studies use the entire cohort of MCI patients regardless of their disease stages. In this paper, we propose a Time Windows approach to predict conversion to dementia, learning with patients stratified using time windows, thus fine-tuning the prognosis regarding the time to conversion.Entities:
Keywords: Mild cognitive impairment; Neurodegenerative diseases; Neuropsychological data; Prognostic prediction; Supervised learning; Time windows
Mesh:
Year: 2017 PMID: 28724366 PMCID: PMC5517828 DOI: 10.1186/s12911-017-0497-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Creation of learning examples following either the First Last approach or the Time Windows approach. A new class is created to define the type of patient’s progression (converting (cMCI) or non-converting (sMCI)) in the interval of k years from the baseline assessment (Time Windows approach) or with no time restrictions (FL approach)
Fig. 2Workflow of the proposed supervised learning approach to predict MCI-to-dementia conversion, based on time windows. It comprises four steps: 1) Data Preprocessing (construction of learning examples based on time windows), ) Model Learning (tune the model for each time window and FL datasets), ) Model Validation (validate the model (tuned to the CV set) with an independent validation set) and ) Using the model (Prognostic prediction of new MCI patients)
Fig. 3Flow chart of the final number of Cognitive Complaints Cohort (CCC) participants: a complete cohort; b cohort of patients recruited in Lisbon; c cohort of patients recruited in Coimbra
Baseline demographic and clinical characterization data
|
|
|
| |
|---|---|---|---|
| Age, years (M ± SD) | 71.7 ± 7.8 | 68.1 ± 8.6 | < 10−8 |
| Gender (male/female) | 93/164 | 196/266 | 0.102 |
| Formal education, years (M ± SD) | 8.9 ± 4.9 | 8.8 ± 4.7 | 0.612 |
| Follow-up time, years (M ± SD) | 2.9 ± 2.3 | 3.5 ± 3.0 | 0.007 |
Group comparisons (converter MCI vs. stable MCI) were performed with independent samples t-tests (or χ^2 Pearson Chi-Square test when appropriate#). Statistically significant (p < 0.05)
Set of parameters and corresponding ranges tested for each classifier within the grid search scheme
|
|
|
|---|---|
| NB |
|
| DT |
|
| SVM RBF |
|
| SVM Poly |
|
| kNN | # |
| RF | # |
| LR |
|
Note: DT: Decision Tree classifier, kNN: k-nearest neighbor classifier, SVM Poly: polynomial-kernel Support Vector Machines, SVM RB: Gaussian-kernel Support Vector Machines, NB: Naïve Bayes classifier, LR: Logistic Regression and RF: Random Forest
Details on CV and validation sets for time windows of 2 to 5 years and the First Last approach
| CV set | Validation set | |||
|---|---|---|---|---|
|
|
|
|
| |
| FL approach | 377 (62%) | 227 (38%) | 85 (74%) | 30 (26%) |
| 2-Year window | 280 (75%) | 94 (25%) | 53 (80%) | 13 (20%) |
| 3-Year window | 206 (60%) | 137 (40%) | 34 (61%) | 22 (39%) |
| 4-Year window | 146 (47%) | 166 (53%) | 22 (47%) | 25 (53%) |
| 5-Year window | 106 (36%) | 190 (64%) | 10 (28%) | 26 (72%) |
Note: sMCI- stable MCI; cMCI – converter MCI
Subset of selected features for each time window and FL dataset
| Features | FL approach | 2-Year window | 3-Year window | 4-Year window | 5-Year window |
|---|---|---|---|---|---|
| Age | X | X | X | X | X |
| Age of first symptons | X | X | X | X | |
|
| X | X | X | ||
|
| X | X | |||
|
| X | ||||
|
| X | X | X | ||
|
| X | X | X | ||
|
| X | X | X | X | X |
|
| X | X | X | X | X |
|
| X | X | X | X | X |
|
| X | X | X | ||
|
| X | X | X | X | X |
|
| X | X | |||
|
| X | X | X | X | |
|
| X | X | X | X | X |
|
| X | X | |||
|
| X | ||||
|
| X | X | X | X | X |
|
| X | ||||
|
| X | X | X | ||
| Token Orders (total) | X | ||||
|
| X | ||||
|
| X | ||||
|
| X | X | X | ||
| Raven Progressive Matrices | X | X | X | X | X |
| Trail Making Test (Part B) - time | X | ||||
| CVLT A list (1sttrial) | X | X | |||
| CVLT A list (3thtrial) | X | ||||
| CVLT A list (4thtrial) | X | X | |||
| CVLT A list (five learning trails total) | X | X | X | X | |
| CVLT A list (Total intrusions in 5 recalls) | X | ||||
| Blessed Dementia Scale (Total of Part 1 - Daily living activities) | X | ||||
| Fi_LM_a | X | X | |||
| Fi_LM_a_m100 | X | X | |||
|
| X | X | |||
|
| X | X | |||
|
| X | X | |||
|
| X | X | X | ||
|
| X | X | X | X | X |
|
| X | X | X | ||
|
| X | X | X | X | X |
|
| X | X | X | X | X |
|
| X | X | |||
|
| X | X | X | X | X |
|
| X | X | X | X | |
|
| X | X | X | X | X |
|
| X | ||||
|
| X | X | |||
| CVLT A list (5sttrial) (Z-score) | X | X | |||
|
| X | X | X | X | X |
|
| X | X | X | X |
The neuropsychological assessment was standardized according to the age and education norms for the Portuguese population and z-scores were calculated
Results of stratified 10 × 5-fold cross validation with the CV set (patients recruited in Lisbon, Table 3), under the Time Windows and the First Last approaches
| AUC | Sensitivity | Specificity | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| DT |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| kNN |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SVM Poly |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SVM RBF |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| NB |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| LR |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| RF |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Note: DT: Decision Tree classifier, kNN: k-Nearest Neighbor classifier, SVM Poly: polynomial-kernel Support Vector Machines, SVM RB: Gaussian-kernel Support Vector Machines, NB: Naïve Bayes classifier, LR: Logistic Regression and RF: Random Forest
The results were highlighted in bold whenever Time Windows approach outperformed the FL approach. cMCI represents the positive class
Fig. 4Results obtained with Naïve Bayes, the best classifier for the Time Windows and the First Last approaches, as assessed by the AUC values within a grid search scheme, under 10 × 5-fold cross validation (using the CV set)
Results of the best prognostic model using the independent validation set (patient recruited in Coimbra, Table 3), for the Time Windows and the First Last approaches
| AUC | Sensitivity | Specificity | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The model was fine-tuned to the CV set (patient recruited in Lisbon, Table 3). cMCI represents the positive class