| Literature DB >> 27378905 |
Jingjing Wang1, Stephen J Redmond1, Maxime Bertoux2, John R Hodges3, Michael Hornberger2.
Abstract
The clinical distinction between Alzheimer's disease (AD) and behavioral variant frontotemporal dementia (bvFTD) remains challenging and largely dependent on the experience of the clinician. This study investigates whether objective machine learning algorithms using supportive neuroimaging and neuropsychological clinical features can aid the distinction between both diseases. Retrospective neuroimaging and neuropsychological data of 166 participants (54 AD; 55 bvFTD; 57 healthy controls) was analyzed via a Naïve Bayes classification model. A subgroup of patients (n = 22) had pathologically-confirmed diagnoses. Results show that a combination of gray matter atrophy and neuropsychological features allowed a correct classification of 61.47% of cases at clinical presentation. More importantly, there was a clear dissociation between imaging and neuropsychological features, with the latter having the greater diagnostic accuracy (respectively 51.38 vs. 62.39%). These findings indicate that, at presentation, machine learning classification of bvFTD and AD is mostly based on cognitive and not imaging features. This clearly highlights the urgent need to develop better biomarkers for both diseases, but also emphasizes the value of machine learning in determining the predictive diagnostic features in neurodegeneration.Entities:
Keywords: AD; Bayesian; MRI; bvFTD; classification; machine learning
Year: 2016 PMID: 27378905 PMCID: PMC4909756 DOI: 10.3389/fnagi.2016.00119
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Three classes of data, which include two disease classes, Alzheimer's disease (AD) and behavioral variant frontotemporal dementia (bvFTD), and a control group.
| Age (years) | 63.7 (8.1) | 61.2 (9.4) | 67.3 (6.8) | 0.001 |
| Gender (M/F) | 31/23 | 37/18 | 25/32 | 0.043 |
| Education (years) | 12.3 (3.7) | 12.3 (3.3) | 13.1 (2.8) | 0.138 |
| Disease duration (years) | 3.3 (2.1) | 4.7 (3.3) | – | 0.041 |
Age, years of education, and disease duration are tested for group differences using Kruskal-Wallis tests. Gender is tested for group differences using Chi-squared test. Only education is shown not to be different between groups at 5% level of significance.
Figure 1Block diagram of training and testing of Naïve Bayes classification model. One outer loop performs the testing, using 10 different groups with approximately 16 or 17 subjects in each group when n = 166 for three-way classification of AD, bvFTD, and control. The nine groups used for training in each run are subject to further feature selection to remove redundant or noisy features; each candidate feature subset is evaluated using an inner 10-fold cross-validation procedure.
Results for classification of AD vs. bvFTD (.
| Performance metric | Confusion matrix (22 confirmed cases) | |||
| Confusion matrix mean ± SD | 3.6 ± 1.17 3.5 ± 1.27 | 3.4 ± 1.08 2.1 ± 1.10 | 3.2 ± 0.92 2.0 ± 1.15 | |
| Cohen's kappa (Cohen's kappa for 22 confirmed cases) | 0.03 (0.10) | 0.25 (0.03) | 0.23 (−0.02) | |
| Accuracy, 95% CI | 51.38%, CI = [42.00%, 60.76%] | 62.39%, CI = [53.30%, 71.48%] | 61.47%, CI = [52.33%, 70.61%] | |
| (Accuracy, 95% CI for 22 confirmed cases) | (50.00%, CI = [29.11%, 70.89%]) | (54.55%, CI = [33.74%, 75.36%]) | (50.00%, CI = [29.11%, 70.89%]) | |
Each column of a confusion matrix represents the true class label, while each row represents the estimated class label. Within confusion matrices, the first columns/rows represent AD, while the second columns/rows represent bvFTD. The mean and standard deviation (SD) of each confusion matrix entry across the 10 cross-validation runs are also presented. Cohen's kappa coefficient and accuracy are calculated for the confusion matrix. The corresponding confirmed diagnoses are shown in parentheses. Approximate 95% confidence intervals (CI) are provided for classification accuracies.
Figure 2Accumulated feature selection results of 10-fold cross validation in discriminating AD and bvFTD using three different feature sets: MRI volumes (*Scan), neuropsychological (Cognitive), and both combined. Y-axis shows the name of selected features and X-axis shows the accumulated count of a corresponding feature being selected over the 10-folds. Three sets of features are displayed in different colors.
Results for classification of AD, bvFTD, and control (.
| Performance metric | Confusion matrix (confirmed cases) | |||
| Confusion matrix mean ± SD | 2.2 ± 1.23 2.6 ± 1.26 0.8 ± 0.63 | 2.9 ± 1.37 1.5 ± 1.08 0.0 ± 0.00 | 2.9 ± 0.99 1.7 ± 1.16 0.0 ± 0.00 | |
| Cohen's kappa (Cohen's kappa for confirmed cases) | 0.31 (−0.14) | 0.52 (−0.03) | 0.51 (0.13) | |
| Accuracy, 95% CI | 54.22%, CI = [46.64%, 61.80%] | 68.07%, CI = [60.98%, 75.16%] | 67.47%, CI = [60.34%, 74.60%] | |
| (Accuracy, 95% CI for 22 confirmed cases) | (18.18%, CI = [2.06%, 34.30%]) | (40.91%, CI = [20.36%, 61.46%]) | (45.45%, CI = [24.64%, 66.26%]) | |
Each column of a confusion matrix contains the actual disease diagnosis, while the rows contain the disease class estimated by the classifier. The first, second, and third columns/rows represent AD, bvFTD, and control, respectively. Corresponding results for confirmed diagnoses are shown in parentheses. Approximate 95% confidence intervals (CI) are provided for classification accuracies.
Figure 3Accumulated feature selection results of 10-fold cross validation in discriminating AD, bvFTD, and control classes using three different feature sets: MRI volumes (*Scan), neuropsychological (Cognitive), and both combined. Y-axis shows the name of selected features and X-axis shows the accumulated count of a corresponding feature being selected over the 10-folds. Three sets of features are displayed in different colors.