| Literature DB >> 29614658 |
Stefan Klöppel1,2,3,4, Maria Kotschi1,2,3, Jessica Peter4, Karl Egger2,5, Lucrezia Hausner6, Lutz Frölich6, Alex Förster7, Bernhard Heimbach1, Claus Normann3, Werner Vach8, Horst Urbach5, Ahmed Abdulkadir2,4.
Abstract
Older patients with depression or Alzheimer's disease (AD) at the stage of early dementia or mild cognitive impairment may present with objective cognitive impairment, although the pathology and thus therapy and prognosis differ substantially. In this study, we assessed the potential of an automated algorithm to categorize a test set of 65 T1-weighted structural magnetic resonance images (MRI). A convenience sample of elderly individuals fulfilling clinical criteria of either AD (n = 28) or moderate and severe depression (n = 37) was recruited from different settings to assess the potential of the pattern recognition method to assist in the differential diagnosis of AD versus depression. We found that our algorithm learned discriminative patterns in the subject's grey matter distribution reflected by an area under the receiver operator characteristics curve of up to 0.83 (confidence interval ranged from 0.67 to 0.92) and a balanced accuracy of 0.79 for the separation of depression from AD, evaluated by leave-one-out cross validation. The algorithm also identified consistent structural differences in a clinically more relevant scenario where the data used during training were independent from the data used for evaluation and, critically, which included five possible diagnoses (specifically AD, frontotemporal dementia, Lewy body dementia, depression, and healthy aging). While the output was insufficiently accurate to use it directly as a means for classification when multiple classes are possible, the continuous output computed by the machine learning algorithm differed between the two groups that were investigated. The automated analysis thus could complement, but not replace clinical assessments.Entities:
Keywords: Alzheimer’s disease; depression; magnetic resonance imaging; supervised machine learning; support vector machine
Mesh:
Year: 2018 PMID: 29614658 PMCID: PMC5900555 DOI: 10.3233/JAD-170964
Source DB: PubMed Journal: J Alzheimers Dis ISSN: 1387-2877 Impact factor: 4.472
Fig.1Use of the three data sets to estimate hyperparameters for the regression model (A), producing binary cross-validation predictions (B), and multi-class predictions based on a model that was independent from the test data (C). Each leave-one-out cross-validation fold and the training of the multi-class SVM involved a grid search with a series of inner cross-validation loops to find the optimal SVM parameter C (see the Methods for more details).
Demographics and basic clinical information on training and test data set including healthy controls (HC), patients with mild cognitive impairment or dementia due to Alzheimer’s disease (AD), patients with depression (DEP), frontotemporal dementia (FTD), or Lewy body dementia (LBD)
| Diagnostic group | Group size (female) | Mean age [y]±1 SD | Mean MMSE±1 SD | Relapsing depression | |
| TEST | AD | 28 (16) | 69.3±10.4 | 24.3±2.5 | n.a. |
| DEP | 37 (21) | 72.3±6.3 | 27.3±3.4 ( | 18 | |
| TRAIN | AD | 360 (178) | 75.2±7.8 | 23.1±2.0 | n.a. |
| DEP | 24 (16) | 70.4±5.0 | n.a. | 15 | |
| FTD | 39 (19) | 58.6±6.4 | 24.5±3.8 | n.a. | |
| LBD | 23 (7) | 73.4±4.6 | 22.7±3.5 | n.a. | |
| HC | 586 (299) | 74.7±5.8 | 29.0±1.2 | n.a. |
Note. The mini-mental state examination score (MMSE) from patients with depression was available only from 26 subjects of the test set. Healthy controls were used to correct for confounding effects of sex, age, and total intracranial volume.
Fig.2Data pre-processing for extraction of the raw features for each individual T1 weighted image. The pipeline extracts four different features sets from the native T1 image, including smoothed and unsmoothed voxel-wise local grey matter (GM) volumes, average GM volumes of seven ROIs identified in an independent meta-analysis [8], as well as weighted averages of local GM volumes weighted by the LONI probabilistic brain atlas (LPBA) [47]. The estimation of local grey matter was computed using the VBM8 toolbox (http://www.neuro.uni-jena.de/vbm/) with default parameters and modulation by the Jacobian determinant of the local non-linear deformation field. The number of features per feature set varies between 7 (spherical ROI features) and 254255 (unsmoothed voxel features).
Performance was evaluated in groups of patients with mild cognitive impairment dementia due to Alzheimer’s disease (AD) and patients with depression (DEP)
| Classification | Output | AUC [CI] | SE | SP | BAC |
| AD > DEP | cv_pAD | 0.83 [0.67 0.92] | 0.89 | 0.71 | 0.79 |
| AD > DEP | mc_pAD | 0.74 [0.58 0.84] | 0.70 | 0.61 | 0.67 |
| AD > DEP | 1-mc_pHC | 0.84 [0.69 0.93] | 0.43 | 0.86 | 0.63 |
| AD > DEP | 1-mc_pDEP | 0.51 [0.36 0.66] | 0.03 | 0.93 | 0.50 |
Evaluation of the classification performance of the test set obtained either by cross-validation (cv_pAD) or with multiple outputs of an independent multi-class classifier, specifically, multiclass probability for AD (mc_pAD), healthy (mc_pHC), and depression (mc_pDEP). The reported performance is based on the receiver operation characteristic curves (Fig. 3, top row). We report sensitivity and specificity at the 0.5 probability threshold of the classifier output. AD is defined as the positive class. AUC, area under the curve; 95% CI, confidence interval; SE, sensitivity or true positive rate; SP, specificity or 1-false positive rate; BAC, balanced accuracy.
Fig.3Class-probabilities for individuals and corresponding box plots (bottom row) grouped by diagnosis (AD/DEP) and prediction output (background color) with according receiver operator characteristics plots (top row). The bottom panel shows the class probabilities estimated by the cross validation (light blue, cv_pAD) and three class probabilities estimated by the multi-class classification, specifically, probability for AD (mc_pAD, orange), probability for HC (mc_pHC, yellow), and probability for depression (mc_pDEP, violet). The graphs in the top row use the same color-codes as the background in the bottom row and plot true versus false positive rate and highlight the positions at different probability thresholds (0.25/0.75 as dots and 0.5 as asterisk). AD, Alzheimer’s disease; DEP, depression.
Confusion matrix of reference test and multi-class classifier decision
| Index test (multi-class prediction based on structural MRI) | |||||||
| AD | DEP | HC | FTD | LBD | |||
| Reference test | AD | 3 | 5 | 2 | 0 | 28 | |
| (clinical | DEP | 13 | 20 | 2 | 0 | 37 | |
| examination) | 31 | 5 | 25 | 4 | 0 | 65 | |