| Literature DB >> 31734524 |
Viktor Wottschel1, Declan T Chard2, Christian Enzinger3, Massimo Filippi4, Jette L Frederiksen5, Claudio Gasperini6, Antonio Giorgio7, Maria A Rocca4, Alex Rovira8, Nicola De Stefano7, Mar Tintoré8, Daniel C Alexander9, Frederik Barkhof10, Olga Ciccarelli2.
Abstract
Machine learning classification is an attractive approach to automatically differentiate patients from healthy subjects, and to predict future disease outcomes. A clinically isolated syndrome (CIS) is often the first presentation of multiple sclerosis (MS), but it is difficult at onset to predict who will have a second relapse and hence convert to clinically definite MS. In this study, we thus aimed to distinguish CIS converters from non-converters at onset of a CIS, using recursive feature elimination and weight averaging with support vector machines. We also sought to assess the influence of cohort size and cross-validation methods on the accuracy estimate of the classification. We retrospectively collected 400 patients with CIS from six European MAGNIMS MS centres. Patients underwent brain MRI at onset of a CIS according to local standard-of-care protocols. The diagnosis of clinically definite MS at one-year follow-up was the standard against which the accuracy of the model was tested. For each patient, we derived MRI-based features, such as grey matter probability, white matter lesion load, cortical thickness, and volume of specific cortical and white matter regions. Features with little contribution to the classification model were removed iteratively through an interleaved sample bootstrapping and feature averaging approach. Classification of CIS outcome at one-year follow-up was performed with 2-fold, 5-fold, 10-fold and leave-one-out cross-validation for each centre cohort independently and in all patients together. The estimated classification accuracy across centres ranged from 64.9% to 88.1% using 2-fold cross-validation and from 73% to 92.9% using leave-one-out cross-validation. The classification accuracy estimate was higher in single-centre, smaller data sets than in combinations of data sets, being the lowest when all patients were merged together. Regional MRI features such as WM lesions, grey matter probability in the thalamus and the precuneus or cortical thickness in the cuneus and inferior temporal gyrus predicted the occurrence of a second relapse in patients at onset of a CIS using support vector machines. The increased accuracy estimate of the classification achieved with smaller and single-centre samples may indicate a model bias (overfitting) when data points were limited, but also more homogeneous. We provide an overview of classifier performance from a range of cross-validation schemes to give insight into the variability across schemes. The proposed recursive feature elimination approach with weight averaging can be used both in single- and multi-centre data sets in order to bridge the gap between group-level comparisons and making predictions for individual patients.Entities:
Keywords: Feature selection; Machine learning classification; Multiple sclerosis
Mesh:
Year: 2019 PMID: 31734524 PMCID: PMC6861587 DOI: 10.1016/j.nicl.2019.102011
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Patient demographic and clinical characteristics per each MAGNIMS centre.
| Barcelona | Copenhagen | Graz | London | Milan | Siena | All | |
|---|---|---|---|---|---|---|---|
| No. of patients | 175 | 24 | 47 | 72 | 35 | 47 | 400 |
| Age [y] | 31.9(16–50) | 36.6(24–54) | 33.8(21–50) | 34.2(19–49) | 29.5(20–43) | 32.3(20–54) | 32.7 |
| Sex | 124F/51M | 14F/10M | 34F/13M | 44F/28M | 24F/11M | 25F/22M | 265F/135M |
| Median EDSS (range) | 2 (0–6) | 3.75 (1–5) | 1 (0–3.5) | 1 (0–8) | 2 (0–6) | 1.5 (0–2) | 2 (0-8) |
| Median global lesion load (range) [mm3] | 1299 (14–25220) | 461 (23–3270) | 82875 (6630–779726) | 849 (29–25581) | 1511 (38–19383) | 1868 (77–22796) | 1597 (14-779726) |
| Converters to CDMS at 1y follow-up | 19.4% | 25% | 23.4% | 30.6% | 22.9% | 21.3% | 22.8% |
| Type of CIS onset (brainstem / optic nerve / spinal cord / other) | 52/45/48/30 | 0/24/0/0 | 10/15/10/12 | 6/62/4/0 | 10/10/3/12 | 10/8/17/12 | 88/164/82/66 |
Fig. 1Illustration of the Neuromorphometrics atlas used for brain parcellation in this study.
Results for single centres using 2-fold cross-validation.
| Accuracy (95% CI) [%] | Sensitivity [%] | Specificity [%] | |
|---|---|---|---|
| Individual data set | |||
| Barcelona (B) | 64.9 (64.5–65.3) | 63.7 | 66.1 |
| Copenhagen (C) | 88.1 (87.4–88.8) | 79.4 | 96.8 |
| Graz (G) | 74.3 (73.7–75.0) | 63.8 | 84.9 |
| London (L) | 75.8 (75.3–76.3) | 74.3 | 77.3 |
| Milan (M) | 88.1 (87.5–88.7) | 79.4 | 96.8 |
| Siena (S) | 82.9 (82.3–83.4) | 68.9 | 96.8 |
| Combinations of data sets (first letter of sites) | |||
| BCGLMS | 64.8 (64.6–65.1) | 64.1 | 65.6 |
| BLMS | 65.2 (64.9–65.4) | 64.0 | 66.4 |
| BLM | 66.9 (66.6–67.2) | 66.4 | 67.4 |
Results for single centres using leave-one-out cross-validation.
| Accuracy (95% CI) [%] | Sensitivity [%] | Specificity [%] | |
|---|---|---|---|
| Individual data set | |||
| Barcelona (B) | 73.0 (72.6–73.3) | 72.6 | 73.3 |
| Copenhagen (C) | 92.9 (92.4–93.3) | 86.0 | 99.7 |
| Graz (G) | 82.8 (82.2–83.3) | 71.2 | 94.2 |
| London (L) | 91.3 (91.0–91.6) | 91.6 | 91.0 |
| Milan (M) | 92.0 (91.6–92.4) | 84.4 | 99.6 |
| Siena (S) | 87.2 (86.7–87.7) | 75.8 | 98.6 |
| Combinations of data sets (first letter of sites) | |||
| BCGLMS | 70.8 (70.6–71.0) | 70.3 | 71.3 |
| BLMS | 73.3 (73.0–73.5) | 73.2 | 73.3 |
| BLM | 73.2 (72.9–73.4) | 73.5 | 72.8 |
Fig. 2Accuracy estimates achieved at different iterations of the recursive feature selection when using all centres’ data sets combined together (BCGLMS)). The accuracy estimates increase with the first steps of the RFE, and the accuracy estimates generally increase with the number of folds. The shaded areas indicate 95% confidence intervals over 1000 bootstraps.
Fig. 3Accuracy estimates per centre or combination of centres for each cross-validation method. Corresponding values for confidence intervals, sensitivity and specificity can be found in Tables 2–5.
Fig. 4Top: bar chart of the proportion of converters in the cohort. Bottom: estimated classification accuracy relative to the size of the minority class. There is a general increase of estimated accuracy with a decrease in sample size. The subscript M and S indicate multi-centre and single-centre data sets respectively.
Fig. 5Location of features relevant to the prediction of CIS conversion at 1-year follow-up. The highlighted areas represent A: GM probability, B: regional volume sizes and C: cortical thickness respectively. Please note that white matter lesion load across the whole brain was also selected but is not shown here for clarity. Type of CIS onset was selected as the only non-imaging feature. A full list of features can be found in the supplementary material.
Results for single centres using 5-fold cross-validation.
| Accuracy (95% CI) [%] | Sensitivity [%] | Specificity [%] | |
|---|---|---|---|
| Individual data set | |||
| Barcelona (B) | 68.9 (68.5–69.3) | 66.9 | 70.9 |
| Copenhagen (C) | 91.3 (90.8–91.8) | 83.3 | 99.4 |
| Graz (G) | 77.8 (77.3–78.4) | 63.1 | 92.6 |
| London (L) | 88.0 (87.7–88.3) | 87.4 | 88.6 |
| Milan (M) | 91.5 (91.1–91.9) | 84.0 | 99.1 |
| Siena (S) | 83.7 (83.2–84.2) | 70.2 | 97.2 |
| Combinations of data sets (first letter of sites) | |||
| BCGLMS | 68.5 (68.3–68.7) | 67.8 | 69.3 |
| BLMS | 70.4 (70.1–70.6) | 69.9 | 70.85 |
| BLM | 69.8 (69.5–70.1) | 69.7 | 69.9 |
Results for single centres using 10-fold cross-validation (note that centres C and M had less than 10 converters and thus could not be used for 10-fold CV).
| Accuracy (95% CI) [%] | Sensitivity [%] | Specificity [%] | |
|---|---|---|---|
| Individual data set | |||
| Barcelona (B) | 70.6 (70.2–71.0) | 70.0 | 71.2 |
| Copenhagen (C) | NA | NA | NA |
| Graz (G) | 79.1 (78.5–79.7) | 67.9 | 90.3 |
| London (L) | 90.6 (90.3–90.9) | 88.9 | 92.2 |
| Milan (M) | NA | NA | NA |
| Siena (S) | 85.5 (85.0–86.0) | 73.0 | 98.1 |
| Combinations of data sets (first letter of sites) | |||
| BCGLMS | NA | NA | NA |
| BLMS | NA | NA | NA |
| BLM | NA | NA | NA |