| Literature DB >> 35284989 |
Giuseppe Pontillo1,2, Simone Penna3, Sirio Cocozza4, Mario Quarantelli5, Michela Gravina3, Roberta Lanzillo6, Stefano Marrone3, Teresa Costabile7, Matilde Inglese8,9, Vincenzo Brescia Morra6, Daniele Riccio3, Andrea Elefante4, Maria Petracca6, Carlo Sansone3, Arturo Brunetti4.
Abstract
OBJECTIVES: To stratify patients with multiple sclerosis (pwMS) based on brain MRI-derived volumetric features using unsupervised machine learning.Entities:
Keywords: Brain; Machine learning; Magnetic resonance imaging; Multiple sclerosis; Prognosis
Mesh:
Year: 2022 PMID: 35284989 PMCID: PMC9279232 DOI: 10.1007/s00330-022-08610-z
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 7.034
Fig. 1Flowchart showing inclusion and exclusion criteria. Overall, 861 MS patients were considered for this study. After application of the inclusion and exclusion criteria, a total of 425 patients were selected, corresponding to 1129 MRI scans
Fig. 2Workflow illustrating the main data processing and analysis steps. Volumes of demyelinating lesions and 116 atlas-defined gray matter regions were automatically segmented based on FLAIR-T2w and T1-w images, respectively. Then, the corresponding volumes were expressed as z-scores with reference to external populations of patients and healthy controls that were also used to select the most altered MRI-derived volumes. Following feature selection, baseline MRI biomarkers entered the Subtype and Stage Inference (SuStaIn) algorithm, using 10-fold cross-validation to determine the optimal number of subtypes and the consistency of progression patterns. Models of up to a maximum of 4 subtypes with z-scores of 1, 2, or 3 for each biomarker were tested (excluding z-score events reached by fewer than 5% of the subjects), corresponding to interpretable levels of mild, moderate, and severe abnormality (color coded from blue to red). The trained model was then fit on all training data and applied to longitudinal MRIs. Finally, the biological reliability and clinical relevance of the SuStaIn classification were assessed in the light of longitudinal MRI scans and clinical outcomes
Demographic, clinical, and MRI characteristics of the studied population
| MS | HC | MS (external site) | |
|---|---|---|---|
| Number of subjects | 425 | 148 | 80 |
| Number of MRI scans | 1129 | 148 | 80 |
| Age (y) | 35.9 ± 9.9 | 35.9 ± 13.0 | 40.4 ± 11.9 |
| Female Sex* | 301 (70.8) | 77 (52.0) | 56 (70.0) |
| DD (y) | 12.7 ± 8.3 | - | 10.3 ± 7.4 |
| EDSS** | 2.5 (2.0 - 3.5) | - | 2.0 (1.5 - 3.0) |
| TLV (mL) | 10.1 ± 13.4 | - | 3.4 ± 5.3 |
| WBV (mL) | 1328.8 ± 127.9 | 1385.1 ± 147.4 | 1370.4 ± 153.3 |
*Data are the number of subjects, with percentages in parentheses.
**Data are medians, with interquartile ranges in parentheses.
MS, multiple sclerosis; HC, healthy controls; DD, disease duration; EDSS, Expanded Disability Status Scale; TLV, total lesion volume; WBV, whole brain volume
Fig. 3Results of the feature selection procedure. Gray matter regions whose volume survived the feature selection procedure (i.e., associated with a moderate to large effect size at the comparison with healthy controls) are presented, along with a lesion probability map (obtained by summing all the binary lesion masks and dividing by the number of patients, thresholded at 10% probability), all superimposed on axial slices of the average T1w volume in the standard space. Images are in radiological orientation
Fig. 4Positional variance diagrams for the two MRI-driven subtypes. Each entry describes the probability for each biomarker of reaching the color-coded z-score at each SuStaIn stage. The colors represent the degree of abnormality based on the z-score level (blue = mild, z-score of 1; violet = moderate, z-score of 2; red = severe, z-score of 3), while the color shade reflects the uncertainty associated with the corresponding biomarker event. CVS, cross-validation similarity; TLV, total lesion volume
Demographic, clinical and MRI characteristics of the MRI-driven subtypes
| DGM-first | Cortex-first | ||
|---|---|---|---|
| Age (y) | 35.9 ± 10.1 | 35.9 ± 9.5 | 0.98 |
| Female Sex* | 160 (67.2) | 141 (75.4) | 0.36 |
| DD (y) | 9.4 ± 7.8 | 6.5 ± 6.1 | |
| EDSS** | 2.5 (2.0-3.5) | 2.5 (2.0-3.0) | |
| SuStaIn stage | 4 (1-12) | 4 (1-8) | |
| TLV (mL) | 14.0 ± 15.1 | 5.5 ± 8.9 | |
| WBV (mL) | 1325.3 ± 126.8 | 1333.0 ± 129.5 | 0.65 |
Unless otherwise indicated, data are expressed as mean ± standard deviation. Between-group differences were tested with either Student t (age and DD), Pearson Chi-square (sex), Kruskal-Wallis (EDSS and SuStaIn stage), or age-, sex-, and TIV-corrected ANCOVA (TLV and WBV) tests.
*Data are the number of subjects, with percentages in parentheses.
**Data are medians, with interquartile ranges in parentheses.
***Significant between-group differences are reported in bold.
DGM, deep gray matter; DD, disease duration; EDSS, Expanded Disability Status Scale; TLV, total lesion volume; WBV, whole brain volume.
Results of the regression analyses for the prediction of clinical outcomes. For both ordinal (baseline and long-term EDSS and long-term BICAMS) and logistic (transition to SP course) regression analyses, the overall fit (R2 and F-statistic for ordinal, Nagelkerke R2 and -2LL for logistic regression) and associated significance level of the model are presented, along with the estimated parameters (and corresponding 5000-resamples bootstrap 95% CI and SE) and associated test statistic (t for ordinal, z for logistic regression) and significance level of both the intercept and individual predictors
| Model | Predictor | ||||||
|---|---|---|---|---|---|---|---|
| SE | |||||||
| Baseline EDSS | 0.256 | 27.245 | < 0.001 | ||||
| Constant | 1.597 (1.243, 1.952) | 0.180 | 57.333 | ||||
| SuStaIn subtype | −0.280 (−0.460, −0.100) | 0.092 | −3.056 | ||||
| SuStaIn stage | 0.042 (0.027, 0.058) | 0.008 | 5.368 | ||||
| SuStaIn subtype x stage | −0.012 (−0.042, 0.018) | 0.015 | −0.703 | 0.43 | |||
| Age | 0.033 (0.023, 0.042) | 0.005 | 6.639 | ||||
| Sex | −0.120 (−0.305, 0.066) | 0.095 | −1.263 | 0.21 | |||
| Long-term EDSS | 0.291 | 6.253 | < 0.001 | ||||
| Constant | 0.654 (−0.330, 1.637) | 0.498 | 1.311 | 0.19 | |||
| SuStain subtype | −0.059 (−0.287, 0.170) | 0.116 | −0.507 | 0.61 | |||
| SuStaIn stage | 0.030 (0.008, 0.052) | 0.011 | 2.726 | ||||
| SuStaIn subtype x stage | −0.016 (−0.059, 0.028) | 0.022 | −0.707 | 0.48 | |||
| Age | 0.015 (0.002, 0.028) | 0.006 | 2.353 | ||||
| Sex | 0.004 (−0.248, 0.256) | 0.128 | 0.029 | 0.98 | |||
| FU time | −0.112 (−0.207, −0.016) | 0.049 | −2.296 | ||||
| DMT** | 0.280 (0.102, 0.457) | 0.090 | 3.113 | ||||
| Long-term BICAMS | 0.287 | 13.492 | < 0.001 | ||||
| Constant | −0.269 (−1.798, 1.259) | 0.774 | −0.248 | 0.73 | |||
| SuStain subtype | −0.442 (−0.751, −0.133) | 0.157 | −2.824 | ||||
| SuStaIn stage | 0.048 (0.024, 0.072) | 0.012 | 3.946 | ||||
| SuStaIn subtype x stage | −0.080 (−0.130, −0.030) | 0.025 | −3.160 | ||||
| Age | 0.001 (−0.017, 0.020) | 0.009 | 0.120 | 0.90 | |||
| Sex | 0.196 (−0.140, 0.532) | 0.170 | 1.151 | 0.25 | |||
| FU time | 0.110 (−0.040, 0.261) | 0.076 | 1.450 | 0.15 | |||
| DMT** | 0.093 (−0.160, 0.346) | 0.128 | 0.726 | 0.47 | |||
| Long-term SP course* | 0.299 | 121.230 | < 0.001 | ||||
| Constant | −2.973 (−7.442, −1.496) | 2.280 | −1.304 | 0.19 | |||
| SuStain subtype | 0.422 (−0.556, 1.399) | 0.499 | 0.846 | 0.40 | |||
| SuStaIn stage | 0.079 (0.009, 0.149) | 0.036 | 2.204 | ||||
| SuStaIn subtype x stage | 0.044 (−0.103, 0.191) | 0.075 | 0.586 | 0.56 | |||
| Age | 0.095 (0.035, 0.155) | 0.031 | 3.091 | ||||
| Sex | −0.926 (−2.116, 0.263) | 0.607 | −1.526 | 0.13 | |||
| FU time | −0.301 (−0.694, 0.092) | 0.200 | −1.503 | 0.13 | |||
| DMT** | 0.571 (0.006, 1.135) | 0.288 | 1.980 | ||||
For all analyses, the DGM-first subtype was coded as 0 and the Cortex-first as 1.
Significant values are reported in bold
*Long-term course was coded as follows: SP course = 1, RR course = 0.
**DMT was coded as follows: no therapy = 0 (13 patients, 7.3%), interferon = 1 (140 patients, 78.7%), glatiramer acetate = 2 (5 patients, 2.8%), natalizumab = 3 (20 patients, 11.2%).
SP, secondary progressive; RR, relapsing remitting; LL, log-likelihood; CI, confidence interval; SE, standard error; FU, follow-up; DMT, disease-modifying therapy.