| Literature DB >> 31841681 |
M Belen Bachli1, Lucas Sedeño2, Jeremi K Ochab3, Olivier Piguet4, Fiona Kumfor4, Pablo Reyes5, Teresa Torralva6, María Roca6, Juan Felipe Cardona7, Cecilia Gonzalez Campo8, Eduar Herrera9, Andrea Slachevsky10, Diana Matallana11, Facundo Manes12, Adolfo M García13, Agustín Ibáñez14, Dante R Chialvo15.
Abstract
Accurate early diagnosis of neurodegenerative diseases represents a growing challenge for current clinical practice. Promisingly, current tools can be complemented by computational decision-support methods to objectively analyze multidimensional measures and increase diagnostic confidence. Yet, widespread application of these tools cannot be recommended unless they are proven to perform consistently and reproducibly across samples from different countries. We implemented machine-learning algorithms to evaluate the prediction power of neurocognitive biomarkers (behavioral and imaging measures) for classifying two neurodegenerative conditions -Alzheimer Disease (AD) and behavioral variant frontotemporal dementia (bvFTD)- across three different countries (>200 participants). We use machine-learning tools integrating multimodal measures such as cognitive scores (executive functions and cognitive screening) and brain atrophy volume (voxel based morphometry from fronto-temporo-insular regions in bvFTD, and temporo-parietal regions in AD) to identify the most relevant features in predicting the incidence of the diseases. In the Country-1 cohort, predictions of AD and bvFTD became maximally improved upon inclusion of cognitive screenings outcomes combined with atrophy levels. Multimodal training data from this cohort allowed predicting both AD and bvFTD in the other two novel datasets from other countries with high accuracy (>90%), demonstrating the robustness of the approach as well as the differential specificity and reliability of behavioral and neural markers for each condition. In sum, this is the first study, across centers and countries, to validate the predictive power of cognitive signatures combined with atrophy levels for contrastive neurodegenerative conditions, validating a benchmark for future assessments of reliability and reproducibility.Entities:
Keywords: Alzheimer’s disease; Classification; Executive functions; Frontotemporal dementia; Machine-learning; Voxel-based morphometry
Mesh:
Substances:
Year: 2019 PMID: 31841681 PMCID: PMC7008715 DOI: 10.1016/j.neuroimage.2019.116456
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Summary of demographic data for each group.
| Country-1 | ||||
| C_1-FTD | HC | |||
| Age [years] | 66.72±9.56 | 68.73±8.48 | 0.57 | .452 |
| Education [years] | 15.05±2.97 | 15.86±2.92 | 0.85 | .360 |
| IFS | 16.30±7.01 | 25.37±1.84 | 46.46 | <.001 |
| ACE | 76.72±15.39 | 93.76±4.25 | 33.02 | <.001 |
| Gender [M/F] | F = 11 (16) | F = 21 (24) | 0.40 | .527 |
| M = 7 (13) | M = 9 (10) | |||
| Age [years] | 75.37±8.72 | 71.54±6.00 | 2.57 | .117 |
| Education [years] | 12.94±4.97 | 15.13±3.10 | 2.81 | .102 |
| IFS | 16.59±4.46 | 25.07±1.87 | 64.34 | <.001 |
| ACE | 69.50±14.11 | 93.90±4.28 | 58.89 | <.001 |
| Sex [M/F] | F = 13 (13) | F = 17 (18) | 0.08 | .766 |
| M = 3 (3) | M = 5 (6) | |||
| Age [years] | 66.55±9.37 | 61.18±7.74 | 2.72 | .109 |
| Education [years] | 15.89±2.31 | 14.73±5.41 | 0.38 | 0.542 |
| IFS | 12.78±6.21 | 22.87±2.97 | 38.13 | <.001 |
| ACE | – | – | – | – |
| Sex [M/F] | F = 7 (12) | F = 11 (16) | 2.03 | .155 |
| M = 2 (4) | M = 11 (12) | |||
| Age [years] | 64.90±9.44 | 69.50±6.43 | 1.88 | .184 |
| Education [years] | 12.20±3.47 | 14.27±2.77 | 2.50 | .128 |
| IFS | – | – | – | – |
| ACE | 75.36±14.99 | 96.25±2.42 | 22.73 | <.001 |
| Sex [M/F] | F = 4 (4) | F = 5 (7) | 0.06 | .794 |
| M = 7 (8) | M = 7 (8) | |||
| Age [years] | 64.00±5.83 | 69.50±6.43 | 4.34 | .051 |
| Education [years] | 12.80±2.89 | 14.27±2.77 | 1.47 | .238 |
| IFS | – | – | – | – |
| ACE | 62.10±11.58 | 96.25±2.41 | 100.09 | <.001 |
| Sex [M/F] | F = 3 (5) | F = 7 (8) | 1.76 | .183 |
| M = 7 (8) | M = 5 (7) | |||
Table 1. Subject groups for the two diseases and the three countries. Because of occasional missing data, we indicate for each gender and country, first the number of subjects actually used in the current study and then in parentheses the total of recruited subjects. Note also that different cognitive tests were used depending on the country. Age, education, IFS and ACE scores are given with the mean± SD. C_1 = Country-1; C_2 = Country-2; C_3 = Country-3.
Fig. 1.Standardize (grey box): data were standardized by converting them to z-scores, so that each feature in the control group had a zero mean and standard deviation of one. Data exploration (light orange boxes): these procedures were used only to explore and obtain knowledge about the behavior of the data. Clustering: we used a k-means algorithm with k = 2 to separate groups in two clusters to explore data distribution, and evaluate the presence of potential sub-groups of participants (details in section 2.4.1). Visualization and inspection: the hypothetically most informative features (cognitive screenings and atrophy) were inspected by graphing pairs of dimensions for the reference dataset (Country-1) (details in section 2.4). Principal component analysis (PCA): We used MATLAB’s default implementation of PCA to explore the most informative combination of features as measured by the explained variance of the data. Classification (light green boxes): Within-country classification: we implemented a logistic regression classifier with cognitive and brain atrophy features within the Country-1 dataset, given that it was the largest one with full completion of cognitive screenings. To evaluate the performance of this model, we used a leave-two-out cross-validation scheme. Cross-country classification: this was performed to further validate the generalization and prediction power of our findings. The logistic regression classifier was trained with Country-1 subjects and tested on participants from Country-2 or Country-3. Finally, to evaluate the relevance of each feature, after performing the classification with the whole feature set, the procedure was repeated but one-by-one each of the features was omitted in the classification (details in sections 2.4.2 and 2.4.3).
Fig. 3.Within-country classification.
The two top panels depict the histograms of the probability of belonging to the patient group, as revealed by logistic regression. The bottom panels correspond to ROC curves obtained for the groups’ data from the first row. Different curves show the ROC calculation omitting the feature denoted in the legend on a one-by-one basis.
Fig. 4.Cross-country classification.
The three top panels depict the real data distribution in z-score values. The three middle panels show the histograms of the probability of belonging to the diseased group, as revealed by logistic regression. The graphs in the three bottom panels correspond to the ROC curves obtained for each condition, by considering all the features (“All”) or by omitting the single features indicated in the legends. The fourth row illustrate the effect of removing a single feature from classification. Removing IFS or ACE affects results the most, which indicates their high informativeness in distinguishing FTD and AD from controls.
Within-country and cross-country classification results (for all features and excluding one feature at a time from the overall model).
| Group | All | IFS | ACE | Atrophy | Age | Gender | Scholarity | |
|---|---|---|---|---|---|---|---|---|
| C_1-AD | AUC | 0.956 | 0.918 | 0.924 | 0.967 | 0.964 | 0.958 | 0.959 |
| Acc | 0.938 | 0.865 | 0.906 | 0.938 | 0.967 | 0.938 | 0.938 | |
| 1-Spec | 1 | 0.824 | 1 | 1 | 1 | 1 | 1 | |
| Sens | 0.875 | 0.906 | 0.813 | 0.875 | 0.935 | 0.875 | 0.875 | |
| C_1-FTD | AUC | 0.967 | 0.921 | 0.958 | 0.957 | 0.969 | 0.969 | 0.965 |
| Acc | 0.912 | 0.885 | 0.895 | 0.885 | 0.920 | 0.912 | 0.893 | |
| 1-Spec | 0.937 | 0.937 | 0.794 | 0.935 | 0.902 | 0.935 | 0.898 | |
| Sens | 0.877 | 0.833 | 0.996 | 0.835 | 0.939 | 0.889 | 0.887 | |
| C_2-FTD | AUC | 0.935±0.022 | 0.800±0.035 | – | 0.919±0.026 | 0.949±0.020 | 0.935±0.019 | 0.925±0.022 |
| Acc | 0.913±0.000 | 0.784±0.008 | – | 0.913±0.011 | 0.914±0.007 | 0.913±0.000 | 0.913±0.000 | |
| 1-Spec | 1±0.000 | 0.999±0.007 | – | 0.997±0.016 | 1±0.000 | 1±0.000 | 1±0.000 | |
| Sens | 0.818±0.000 | 0.548±0.017 | – | 0.821±0.019 | 0.821±0.014 | 0.818±0.000 | 0.818±0.000 | |
| C_3-FTD | AUC | 0.906±0.021 | – | 0.795±0.040 | 0.938±0.021 | 0.919±0.0151 | 0.908±0.018 | 0.882±0.028 |
| Acc | 0.913±0.000 | – | 0.784±0.007 | 0.913±0.011 | 0.914±0.007 | 0.913±0.000 | 0.913±0.000 | |
| 1-Spec | 1±0.000 | – | 0.999±0.007 | 0.997±0.016 | 1±0.000 | 1±0.000 | 1±0.000 | |
| Sens | 0.818±0.000 | – | 0.548±0.017 | 0.821±0.019 | 0.821±0.014 | 0.818±0.000 | 0.818±0.000 | |
| C_3-AD | AUC | 1.0±0.000 | – | 0.936±0.060 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 |
| Acc | 1.0±0.000 | – | 0.906±0.029 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | |
| 1-Spec | 1.0±0.000 | – | 0.847±0.031 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | |
| Sens | 1.0±0.000 | – | 0.978±0.076 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 | 1.0±0.000 |
Table 2: Classification outcomes for each group and conditions (rows) computed for models using different features (columns). The column denoted “All” corresponds to results gathered from a model trained with all the features, while the others correspond to results obtained excluding the denoted feature. In all cases the model was trained with the Country-1 cohort, and prediction computed for the other countries. For cross-country classification the areas are provided with error estimates as described in Sec. III B 2. Auc = area under the ROC; Acc = maximal accuracy; 1-Spec = 1-specificity (for maximal accuracy); Sens = sensitivity (maximal accuracy).
Fig. 2.Atrophy measures and cognitive data distribution.
A. Real distribution of IFS and atrophy data. The degree of brain atrophy and the IFS score are standardized in z-scores. B. Real distribution of ACE and atrophy data. The degree of brain atrophy and the IFS score are standardized in z-scores. C. Clustering of IFS and atrophy results from panel A. Red-white circles represent patients wrongly identified as controls, and red diamonds represent controls who were mistaken with patients. D. Clustering of ACE and atrophy results from panel B. Red-white circles represent patients wrongly identified as controls, and red diamonds represent controls who were mistaken with patients.