| Literature DB >> 32671340 |
Marta M Correia1, Timothy Rittman2, Christopher L Barnes3, Ian T Coyle-Gilchrist2, Boyd Ghosh4, Laura E Hughes1,2, James B Rowe1,2,5.
Abstract
The early and accurate differential diagnosis of parkinsonian disorders is still a significant challenge for clinicians. In recent years, a number of studies have used magnetic resonance imaging data combined with machine learning and statistical classifiers to successfully differentiate between different forms of Parkinsonism. However, several questions and methodological issues remain, to minimize bias and artefact-driven classification. In this study, we compared different approaches for feature selection, as well as different magnetic resonance imaging modalities, with well-matched patient groups and tightly controlling for data quality issues related to patient motion. Our sample was drawn from a cohort of 69 healthy controls, and patients with idiopathic Parkinson's disease (n = 35), progressive supranuclear palsy Richardson's syndrome (n = 52) and corticobasal syndrome (n = 36). Participants underwent standardized T1-weighted and diffusion-weighted magnetic resonance imaging. Strict data quality control and group matching reduced the control and patient numbers to 43, 32, 33 and 26, respectively. We compared two different methods for feature selection and dimensionality reduction: whole-brain principal components analysis, and an anatomical region-of-interest based approach. In both cases, support vector machines were used to construct a statistical model for pairwise classification of healthy controls and patients. The accuracy of each model was estimated using a leave-two-out cross-validation approach, as well as an independent validation using a different set of subjects. Our cross-validation results suggest that using principal components analysis for feature extraction provides higher classification accuracies when compared to a region-of-interest based approach. However, the differences between the two feature extraction methods were significantly reduced when an independent sample was used for validation, suggesting that the principal components analysis approach may be more vulnerable to overfitting with cross-validation. Both T1-weighted and diffusion magnetic resonance imaging data could be used to successfully differentiate between subject groups, with neither modality outperforming the other across all pairwise comparisons in the cross-validation analysis. However, features obtained from diffusion magnetic resonance imaging data resulted in significantly higher classification accuracies when an independent validation cohort was used. Overall, our results support the use of statistical classification approaches for differential diagnosis of parkinsonian disorders. However, classification accuracy can be affected by group size, age, sex and movement artefacts. With appropriate controls and out-of-sample cross validation, diagnostic biomarker evaluation including magnetic resonance imaging based classifiers may be an important adjunct to clinical evaluation.Entities:
Keywords: Parkinson’s disease; corticobasal degeneration syndrome; magnetic resonance imaging; progressive supranuclear palsy; support vector machine
Year: 2020 PMID: 32671340 PMCID: PMC7325838 DOI: 10.1093/braincomms/fcaa051
Source DB: PubMed Journal: Brain Commun ISSN: 2632-1297
Figure 1Data analysis pipelines. Analysis pipelines for each combination of data type and feature extraction method. (A) T1-weighted MRI and ROIs. (B) T1-weighted MRI and PCA. (C) Diffusion MRI and ROIs. (D) Diffusion MRI and PCA.
Demographics, clinical severity scores and quality control information.
| Controls | PD | PSP-RS | CBS | Group difference | |
|---|---|---|---|---|---|
| Cross-validation group | |||||
| Sample size | 19 | 19 | 19 | 19 | |
| Sex F/M (%) | 36.8/63.2 | 47.4/52.6 | 42.1/57.9 | 52.6/47.4 |
|
| Age (years) | 66.2 ± 1.6 (54.9–81.1) | 65.0 ± 1.9 (46.9–76.9) | 69.1 ± 1.3 (60.8–83.2) | 68.2 ± 2.3 (39.1–88.2) |
|
| UPDRS-III score | − | 20.5 ± 2.1 (5–32) | 27.2 ± 3.4 (8–44) | 28.9 ± 3.8 (2–51) |
|
| MMSE score | 29.1 ± 0.2 (27–30) | 29.1 ± 0.3 (26–30) | 25.9 ± 0.8 (19–30) | 26.7 ± 0.9 (16–30) |
|
| TIV (×105) | 7.6 ± 0.17 (5.8–9.0) | 7.5 ± 0.16 (6.4–8.8) | 7.3 ± 0.16 (6.2–8.4) | 7.7 ± 0.17 (6.2–8.8) |
|
| MPRAGE smoothness FWHM | 2005.53 ± 34.7 (1796.92–2275.20) | 1993.3 ± 33.0 (1749.62–2296.70) | 2088.2 ± 30.3 (1801.92–2364.93) | 2047.7 ± 38.8 (1809.02–2401.53) |
|
| Absolute head displacement (DWI) | 1.62 ± 0.09 (1.30–2.77) | 1.85 ± 0.17 (1.25–3.82) | 1.71 ± 0.11 (1.27–3.38) | 1.61 ± 0.08 (1.25–2.43) |
|
| Relative head displacement (DWI) | 0.51 ± 0.02 (0.26–0.75) | 0.48 ± 0.03 (0.13–0.71) | 0.42 ± 0.02 (0.28–0.59) | 0.44 ± 0.03 (0.18–0.60) |
|
|
Independent validation group | |||||
| Sample size | 24 | 13 | 14 | 7 | |
| Sex F/M (%) | 58.3/41.7 | 38.5/61.5 | 50.0/50.0 | 42.9/57.1 |
|
| Age (years) | 69.7 ± 1.4 (51.6–81.6) | 69.1 ± 1.9 (54.1–75.8) | 70.9 ± 1.9 (57.9–84.1) | 62.2 ± 3.0 (53.1–75.0) |
|
| TIV (×105) | 7.4 ± 0.16 (6.0–9.2) | 7.5 ± 0.19 (6.1–8.2) | 7.7 ± 0.29 (6.7–8.6) | 7.3 ± 0.03 (6.0–9.0) |
|
| MPRAGE smoothness FWHM | 1989.6 ± 24.7 (1781.9–2238.2) | 2048.9 ± 46.2 (1824.9–2380.2) | 2092.1 ± 28.5 (1937.1–2266.6) | 2061.2 ± 78.9 (1863.2–2364.5) |
|
| Absolute head displacement (DWI) | 1.46 ± 0.04 (1.19–1.98) | 1.47 ± 0.08 (1.27–2.44) | 1.59 ± 0.06 (1.31–2.01) | 1.62 ± 0.11 (1.25–1.96) |
|
| Relative head displacement (DWI) | 0.48 ± 0.02 (0.23–0.70) | 0.48 ± 0.02 (0.25–0.59) | 0.44 ± 0.03 (0.32–0.61) | 0.48 ± 0.05 (0.30–0.65) |
|
Data are shown as mean ± standard error (range).
Chi-squared test,
ANOVA,
ANOVA,
Kruskal–Wallis ANOVA followed by non-parametric Mann–Whitney post hoc tests (Control > CBS P < 0.01, Control > PSP-RS P < 0.05, Parkinson’s disease > CBS P < 0.01 and Parkinson’s disease > PSP-RS P < 0.05),
ANOVA,
ANOVA,
Kruskal–Wallis ANOVA,
ANOVA,
Chi-squared test,
ANOVA,
ANOVA,
Welch’s ANOVA,
Kruskal–Wallis ANOVA,
ANOVA.
More details about the statistical results presented here, including justification for choice of statistical test, can be found in the Supplementary material, Section B.
Figure 2GM volume versus cortical thickness. Comparison between GM volume (blue) and cortical thickness (orange) as feature types. The range of classification accuracies presented for each pairwise comparison and combination of methodological variables corresponds to the results obtained as different numbers of features are included in the statistical model. (A) Cross-validation results when PCA is used for feature extraction. (B) Independent validation results when PCA is used for feature extraction. (C) Cross-validation results when ROIs are used for feature extraction. (D) Independent validation results when ROIs are used for feature extraction.
Classification accuracies achieved for pairwise comparisons using a leave-two-out cross-validation approach
| Mean accuracy (%) | IQR (%) | Max accuracy (%) | |
|---|---|---|---|
| T1-weighted data (GM volume maps) | |||
| GM maps + ROIs | |||
| C vs PD | 71.96 | 2.18 | 85.46 |
| C vs CBS | 83.36 | 0.59 | 91.69 |
| C vs PSP-RS | 73.74 | 5.68 | 81.02 |
| PD vs CBS | 77.93 | 3.88 | 85.87 |
| PD vs PSP-RS | 67.44 | 2.35 | 70.91 |
| PSP vs CBS | 62.16 | 0.55 | 65.65 |
| GM maps + PCA | |||
| C vs PD | 82.54 | 15.17 | 97.78 |
| C vs CBS | 90.33 | 8.00 | 99.31 |
| C vs PSP-RS | 84.60 | 10.77 | 95.84 |
| PD vs CBS | 87.31 | 9.28 | 97.09 |
| PD vs PSP-RS | 87.38 | 13.50 | 96.95 |
| PSP vs CBS | 88.23 | 16.17 | 100.0 |
| Diffusion MRI data (FA and MD maps) | |||
| FA and MD maps + ROIs | |||
| C vs PD | 61.26 | 12.88 | 75.21 |
| C vs CBS | 70.13 | 5.54 | 77.28 |
| C vs PSP-RS | 82.44 | 6.44 | 87.53 |
| PD vs CBS | 72.89 | 10.66 | 81.99 |
| PD vs PSP-RS | 74.93 | 6.58 | 85.04 |
| PSP vs CBS | 79.84 | 7.89 | 90.72 |
| FA and MD maps + PCA | |||
| C vs PD | 85.43 | 19.43 | 99.72 |
| C vs CBS | 90.06 | 9.90 | 97.37 |
| C vs PSP-RS | 92.51 | 10.15 | 99.86 |
| PD vs CBS | 84.40 | 7.34 | 91.55 |
| PD vs PSP-RS | 89.49 | 3.98 | 96.40 |
| PSP vs CBS | 94.95 | 12.67 | 100.0 |
For each pairwise comparison, two patients, one from each group, were left out of the training phase for each cross-validation fold and used to estimate model accuracy. The classification accuracies presented correspond to the mean and maximum accuracies obtained when different numbers of features (ROIs or PCA components) are included in the statistical model. Inter-quartile range (IQR) is also shown.
Classification accuracies achieved using the independent validation group
| Mean accuracy (%) | IQR (%) | Max accuracy (%) | |
|---|---|---|---|
| T1-weighted data (GM volume maps) | |||
| GM maps + ROIs | |||
| C vs PD | 47.75 | 8.11 | 64.86 |
| C vs CBS | 63.95 | 6.45 | 74.19 |
| C vs PSP-RS | 62.78 | 4.61 | 76.32 |
| PD vs CBS | 47.14 | 8.75 | 60.00 |
| PD vs PSP-RS | 57.67 | 3.70 | 62.96 |
| PSP vs CBS | 44.37 | 4.76 | 61.90 |
| GM maps + PCA | |||
| C vs PD | 55.66 | 5.41 | 67.57 |
| C vs CBS | 63.12 | 3.23 | 67.74 |
| C vs PSP-RS | 68.99 | 3.29 | 76.32 |
| PD vs CBS | 50.95 | 10.00 | 60.00 |
| PD vs PSP-RS | 71.87 | 4.63 | 81.48 |
| PSP vs CBS | 48.52 | 4.76 | 57.14 |
| Diffusion MRI data (FA and MD maps) | |||
| FA and MD maps + ROIs | |||
| C vs PD | 59.74 | 5.41 | 75.68 |
| C vs CBS | 78.74 | 5.37 | 87.09 |
| C vs PSP-RS | 90.49 | 2.63 | 94.74 |
| PD vs CBS | 77.90 | 15.00 | 85.00 |
| PD vs PSP-RS | 86.78 | 3.70 | 92.59 |
| PSP vs CBS | 76.33 | 4.76 | 80.95 |
| FA and MD maps + PCA | |||
| C vs PD | 57.63 | 10.81 | 72.97 |
| C vs CBS | 73.41 | 6.45 | 80.64 |
| C vs PSP-RS | 80.87 | 5.92 | 89.44 |
| PD vs CBS | 80.81 | 5.00 | 85.00 |
| PD vs PSP-RS | 81.48 | 3.70 | 88.89 |
| PSP vs CBS | 80.82 | 4.95 | 90.63 |
Seventy subjects (19 from each group) were used to train the model, and validation was performed on 58 unseen patients and controls. The classification accuracies presented correspond to the mean and maximum accuracies obtained when different numbers of features are included in the statistical model (ROIs or PCA components). Inter-quartile range (IQR) is also shown.
Figure 3PCA versus ROIs. Comparison between feature extraction methods: PCA (green) and ROIs (yellow). The range of classification accuracies presented for each pairwise comparison and combination of methodological variables corresponds to the results obtained as different numbers of features are included in the statistical model. (A) Cross-validation results when GM volume maps are used as feature type. (B) Independent validation results when GM volume maps are used as feature type. (C) Cross-validation results when FA and MD maps are used as feature type. (D) Independent validation results when FA and MD maps are used as feature type.
Figure 4Cross-validation versus independent validation. Comparison between cross-validation (magenta) and independent validation (blue) results. The range of classification accuracies presented for each pairwise comparison and combination of methodological variables corresponds to the results obtained as different numbers of features are included in the statistical model. (A) Results obtained when PCA is used for feature extraction, with GM volumes maps as feature type. (B) Results obtained when ROIs are used for feature extraction, with GM volumes maps as feature type. (C) Results obtained when PCA is used for feature extraction, with FA and MD maps as feature type. (B) Results obtained when ROIs are used for feature extraction, with FA and MD maps as feature type.
Figure 5Spatial localization of top classification features. Colour-coded images showing the relative importance of each ROI for pairwise classification using SVM. The top ranked feature for each pairwise group comparison has a normalized weight of 1 and is shown in yellow, while the least important features are shown in red. (A) Cortical and sub-cortical ROIs used for classification with GM volume features. (B) White matter ROIs used for classification with FA and MD features. The top row shows the spatial distribution of the most relevant FA features, while the bottom row shows the localization of the most relevant MD features.