| Literature DB >> 33547592 |
Jawed Nawabi1,2, Helge Kniep3, Sarah Elsayed3, Constanze Friedrich3, Peter Sporns3,4, Thilo Rusche4, Maik Böhmer5, Andrea Morotti6, Frieder Schlunk7, Lasse Dührsen8, Gabriel Broocks3, Gerhard Schön9, Fanny Quandt10, Götz Thomalla9, Jens Fiehler3, Uta Hanning3.
Abstract
We hypothesized that imaging-only-based machine learning algorithms can analyze non-enhanced CT scans of patients with acute intracerebral hemorrhage (ICH). This retrospective multicenter cohort study analyzed 520 non-enhanced CT scans and clinical data of patients with acute spontaneous ICH. Clinical outcome at hospital discharge was dichotomized into good outcome and poor outcome using different modified Rankin Scale (mRS) cut-off values. Predictive performance of a random forest machine learning approach based on filter- and texture-derived high-end image features was evaluated for differentiation of functional outcome at mRS 2, 3, and 4. Prediction of survival (mRS ≤ 5) was compared to results of the ICH Score. All models were tuned, validated, and tested in a nested 5-fold cross-validation approach. Receiver-operating-characteristic area under the curve (ROC AUC) of the machine learning classifier using image features only was 0.80 (95% CI [0.77; 0.82]) for predicting mRS ≤ 2, 0.80 (95% CI [0.78; 0.81]) for mRS ≤ 3, and 0.79 (95% CI [0.77; 0.80]) for mRS ≤ 4. Trained on survival prediction (mRS ≤ 5), the classifier reached an AUC of 0.80 (95% CI [0.78; 0.82]) which was equivalent to results of the ICH Score. If combined, the integrated model showed a significantly higher AUC of 0.84 (95% CI [0.83; 0.86], P value <0.05). Accordingly, sensitivities were significantly higher at Youden Index maximum cut-offs (77% vs. 74% sensitivity at 76% specificity, P value <0.05). Machine learning-based evaluation of quantitative high-end image features provided the same discriminatory power in predicting functional outcome as multidimensional clinical scoring systems. The integration of conventional scores and image features had synergistic effects with a statistically significant increase in AUC.Entities:
Keywords: Intracerebral hemorrhage; Machine Learning; Outcome prediction; Radiomics
Mesh:
Year: 2021 PMID: 33547592 PMCID: PMC8557152 DOI: 10.1007/s12975-021-00891-8
Source DB: PubMed Journal: Transl Stroke Res ISSN: 1868-4483 Impact factor: 6.829
Baseline demographic, clinical, and radiological characteristics of study cohort
| Baseline characteristics | All | mRS 0–3 | mRS 4–6 | |
|---|---|---|---|---|
| Clinical parameters | ||||
| Age [years], median (IQR) | 73 (59; 79) | 70 (57; 78) | 73 (60; 80) | 0.85 |
| Female, | 234 (45.0) | 67 (44.4) | 167 (45.3) | 0.85 |
| Hypertension, | 359 (69.2) | 99 (65.6) | 260 (70.7) | 0.25 |
| Diabetes mellitus, | 73 (14.0) | 23 (15.2) | 50 (13.7) | 0.62 |
| Antiplatelet medication, | 104 (20) | 33 (21.9) | 71 (19.2) | 0.50 |
| Anticoagulant medication, | 113 (21.7) | 34 (22.5) | 79 (21.4) | 0.78 |
| Systolic blood pressure [mm Hg], median (IQR) | 162 (138; 193.75) | 162 (145; 185) | 160 (135; 197.5) | 0.75 |
| Time from symptom onset to CT [days], median (IQR) | 0.19 (0.76; 0.52) | 0.21 (0.09; 0.59) | 0.19 (0.07; 0.52) | 0.92 |
| Time from CT to discharge [days], median (IQR) | 14 (7; 22) | 14 (6.5; 18.5) | 15.5 (6.75; 23.5) | 0.13 |
| Clinical scores | ||||
| GCS Score, median (IQR) | 11 (5; 14) | 14 (12; 15) | 9 (3; 13) | <0.001 |
| ICH Score, median (IQR) | 2 (1; 3) | 1 (0; 2) | 3 (2; 4) | <0.001 |
| CT parameters | ||||
| Bleeding location, | ||||
- Lobar - Basal ganglia - Thalamus - Brainstem and pons - Cerebellar | 238(45.8) 198 (38.1) 18 (3.5) 23 (4.4) 43 (8.3) | 76 (50.3) 54 (35.8) 3 (2.0) 5 (3.3) 13 (8.6) | 162 (43.9) 144 (39.0) 15 (4.1) 18 (4.9) 30 (8.1) | 0.18 0.49 0.24 0.43 0.86 |
| Intraventricular hemorrhage, | 267 (51.3) | 50 (33.1) | 217 (59) | <0.001 |
| ICH volume [mL], median (IQR) | 25.1 (9.7; 60.3) | 11.5 (3.6; 24.7) | 35.5 (14.9; 73.2) | <0.001 |
| Surgical procedures | ||||
| Craniectomy, median (IQR) | 117 (22.5) | 16 (10.6) | 101 (27.4) | <0.001 |
Comparison of baseline demographic, clinical, and radiological characteristics between ICH patients with good clinical outcome (modified Rankin Scale (mRS) 0–3) versus poor clinical outcome (mRS 4–6). ICH Score, Intracerebral Hemorrhage Score; GCS, Glasgow Coma Scale; IQR, interquartile range
Fig. 1Conceptual overview of the proposed machine learning approach for intracerebral hemorrhage outcome prediction showing the major processing steps: CT based image acquisition and segmentation, feature extraction (n = 1218), and statistical learning (random forest algorithm). NECT, non-contrast-enhanced computed tomography; ICH, intracerebral hemorrhage; CT, computed tomography; mRS, modified Rankin Scale; CV, cross-validation set with i: inner loop and o: outer loop
Fig. 2Receiver-operating characteristics (ROC) curves for (a) functional outcome prediction of the proposed machine learning classifier based on quantitative image features and (b) prediction of survival using the ICH Score, the proposed machine learning classifier based on quantitative image features, and a classifier integrating ICH Score metrics and quantitative image features. AUC, area under the curve; CI, confidence interval; mRS, modified Rankin Scale
Classification performance of imaging-based outcome prediction
| Prediction mRS | Patients | AUC | MCC max | Sensitivity | Specificity | Accuracy | PPV | NPV |
|---|---|---|---|---|---|---|---|---|
| A) Imaging-based ML prediction of functional outcome (mRS) | ||||||||
| mRS ≤ 2 | 75/520(14%) | 0.8[0.77; 0.82] | 0.36[0.33; 0.39] | 73%[68%; 77%] | 73%[72%; 75%] | 73%[72%; 75%] | 31%[28%; 35%] | 94%[93%; 95%] |
| mRS ≤ 3 | 151/520(29%) | 0.8[0.78; 0.81] | 0.42[0.39; 0.45] | 71%[67%; 74%] | 73%[71%; 75%] | 72%[71%; 74%] | 52%[49%; 55%] | 86%[84%; 88%] |
| mRS ≤ 4 | 259/520(50%) | 0.79[0.77; 0.8] | 0.47[0.44; 0.5] | 71%[68%; 73%] | 71%[68%; 73%] | 71%[69%; 72%] | 70%[68%; 73%] | 71%[68%; 73%] |
| B) Imaging-based ML, ICH Score, and integrated model-based prediction of survival (mRS ≤ 5) | ||||||||
| Imaging-based ML classifier | 375/520(72%) | 0.8[0.78; 0.82] | 0.46[0.43; 0.49] | 73%[70%; 75%] | 71%[67%; 74%] | 72%[70%; 74%] | 87%[85%; 88%] | 50%[47%; 53%] |
| ICH Score | 375/520(72%) | 0.80*[0.78; 0.82] | 0.46[0.43; 0.49] | 74%*[72%; 75%] | 76%[73%; 79%] | 74%[72%; 76%] | 89%[87%; 90%] | 53%[50%; 56%] |
| Image features + ICH Score | 375/520(72%) | 0.84*[0.86; 0.83] | 0.49[0.46; 0.52] | 77%*[75%; 79%] | 76%[72%; 79%] | 77%[75%; 78%] | 89%[87%; 91%] | 56%[53%; 59%] |
*P value <0.05
Prediction of outcome in patients with acute ICH: Number of patients with respective outcome (positive class) and performance metrics of A) imaging-based random forest machine learning classification at different mRS levels and B) ICH Score–based prediction compared to an integrated model using knowledge derived from both the machine learning algorithm and the ICH Score. Metrics are shown at Youden Index maximum cut-off points. Bonferroni corrections have been applied to account for alpha spending error. Results are based on nested 5-fold cross-validation of 520 patients from three different centers. AUC, receiver-operating-characteristic area under the curve; PPV, positive predictive value; NPV, negative predictive value; MCC, Matthews correlation coefficient; CI, confidence interval; mRS, modified Rankin Scale
Fig. 3Predictive value of quantitative image features. Bar charts show mean Gini impurity feature importance of all cross-validation training sets of the top- 15 high-end image features. Pie charts show distribution of feature classes and applied filters in utilized top-100 predictors. First-order metrics: Basic statistical metrics of the voxel grey level distribution; glcm: gray level co-occurrence matrix; gldm: gray level dependence matrix; glrlm: gray level run length matrix; glszm: gray level size zone; H: high-pass wavelet decomposition; L: low-pass wavelet decomposition