| Literature DB >> 35522333 |
Gianluca Trevisi1,2, Valerio Maria Caccavella3, Alba Scerrati4,5, Francesco Signorelli3, Giuseppe Giovanni Salamone1, Klizia Orsini1, Christian Fasciani1, Sonia D'Arrigo6, Anna Maria Auricchio3, Ginevra D'Onofrio3, Francesco Salomi4, Alessio Albanese3, Pasquale De Bonis4,5, Annunziato Mangiola1,2, Carmelo Lucio Sturiale7,8.
Abstract
Spontaneous intracerebral hemorrhage (ICH) has an increasing incidence and a worse outcome in elderly patients. The ability to predict the functional outcome in these patients can be helpful in supporting treatment decisions and establishing prognostic expectations. We evaluated the performance of a machine learning (ML) model to predict the 6-month functional status in elderly patients with ICH leveraging the predictive value of the clinical characteristics at hospital admission. Data were extracted by a retrospective multicentric database of patients ≥ 70 years of age consecutively admitted for the management of spontaneous ICH between January 1, 2014 and December 31, 2019. Relevant demographic, clinical, and radiological variables were selected by a feature selection algorithm (Boruta) and used to build a ML model. Outcome was determined according to the Glasgow Outcome Scale (GOS) at 6 months from ICH: dead (GOS 1), poor outcome (GOS 2-3: vegetative status/severe disability), and good outcome (GOS 4-5: moderate disability/good recovery). Ten features were selected by Boruta with the following relative importance order in the ML model: Glasgow Coma Scale, Charlson Comorbidity Index, ICH score, ICH volume, pupillary status, brainstem location, age, anticoagulant/antiplatelet agents, intraventricular hemorrhage, and cerebellar location. Random forest prediction model, evaluated on the hold-out test set, achieved an AUC of 0.96 (0.94-0.98), 0.89 (0.86-0.93), and 0.93 (0.90-0.95) for dead, poor, and good outcome classes, respectively, demonstrating high discriminative ability. A random forest classifier was successfully trained and internally validated to stratify elderly patients with spontaneous ICH into prognostic subclasses. The predictive value is enhanced by the ability of ML model to identify synergy among variables.Entities:
Keywords: Conventional statistics; Hemorrhagic stroke; Intracerebral hemorrhage; Intracranial hemorrhage; Machine learning; Outcome
Mesh:
Year: 2022 PMID: 35522333 PMCID: PMC9349060 DOI: 10.1007/s10143-022-01802-7
Source DB: PubMed Journal: Neurosurg Rev ISSN: 0344-5607 Impact factor: 2.800
Fig. 1Machine learning workflow. [1] Data extraction and patient selection from a multicentric database. [2] Features selection, hyperparameter tuning, and random forest classifier training. [3] Evaluation of average performance metrics and confidence interval bootstrapping. ICH, intracerebral hemorrhage
Univariate analysis: significant variables
| Parameter | Total ( | Dead ( | Poor outcome ( | Good outcome ( | Corrected |
|---|---|---|---|---|---|
| Age | 79.85 (± 6.35) | 80.89 (± 6.21) | 80.85 (± 6.66) | 77.72 (± 5.67) | < 0.001* |
| IVH | 215 (27.0%) | 129 (42.86%) | 56 (22.67%) | 30 (11.49%) | < 0.001* |
| SAH | 112 (14.0%) | 59 (19.6%) | 34 (13.77%) | 19 (7.28%) | 0.001* |
| Hematoma volume | 35.68 (± 42.14) | 61.15 (± 52.38) | 26.39 (± 28.21) | 15.1 (± 18.47) | < 0.001* |
| GCS at admission | 10.43 (± 4.12) | 6.92 (± 3.66) | 11.51 (± 3.13) | 13.46 (± 1.88) | < 0.001* |
| ICH score | 1.96 (± 1.5) | 3.14 (± 1.35) | 1.66 (± 1.16) | 0.88 (± 0.86) | < 0.001* |
| Pupillary status at admission | |||||
| Isochoric | 603 (74.54%) | 132 (43.85%) | 219 (88.66%) | 252 (96.55%) | < 0.001* |
| Anisocoric | 116 (14.34%) | 96 (31.89%) | 14 (5.67%) | 6 (2.3%) | |
| Mydriatic | 45 (5.56%) | 39 (12.96%) | 5 (2.02%) | 1 (0.38%) | |
| Miotic | 45 (5.56%) | 34 (11.3%) | 9 (3.64%) | 2 (0.77%) | |
| Comorbidities | |||||
| Renal insufficiency | 51 (6.0%) | 17 (5.65%) | 9 (3.45%) | 25 (10.12%) | 0.040* |
| Neurological | 195 (24.0%) | 63 (20.93%) | 54 (20.69%) | 78 (31.58%) | 0.026* |
| Charlson Comorbidity Index | 3.36 (± 2.56) | 3.34 (± 2.65) | 2.79 (± 2.17) | 3.99 (± 2.69) | < 0.001* |
| No. of comorbidities | 2.6 (± 1.61) | 2.44 (± 1.41) | 2.4 (± 1.52) | 3.02 (± 1.85) | < 0.001* |
| Pharmacotherapy | |||||
| Antiplatelet | 324 (40.0%) | 141 (46.84%) | 92 (37.25%) | 91 (34.87%) | 0.046* |
| Anticoagulant/antiplatelet | 494 (61.0%) | 216 (71.76%) | 143 (57.89%) | 135 (51.72%) | < 0.001* |
| Antacids | 193 (24.0%) | 98 (32.56%) | 53 (21.46%) | 42 (16.09%) | < 0.001* |
| Number of anticoagulants or antiplatelets | 0.64 (± 0.56) | 0.74 (± 0.55) | 0.6 (± 0.54) | 0.55 (± 0.57) | 0.001* |
| No. of drugs | 3.47 (± 1.98) | 3.78 (± 2.01) | 3.35 (± 2.03) | 3.23 (± 1.85) | 0.014* |
| Topography | |||||
| Frontal | 226 (28.0%) | 121 (40.2%) | 62 (25.1%) | 43 (16.48%) | < 0.001* |
| Temporal | 223 (28.0%) | 113 (37.54%) | 67 (27.13%) | 43 (16.48%) | < 0.001* |
| Brainstem | 29 (4.0%) | 17 (5.65%) | 11 (4.45%) | 1 (0.38%) | 0.016* |
| Cerebellum | 70 (9.0%) | 14 (4.65%) | 18 (7.29%) | 38 (14.56%) | 0.001* |
Data reported as the number of patients (%) and mean (± SD)
IVH intraventricular hemorrhage, SAH subarachnoid hemorrhage, GCS Glasgow Coma Scale
*Significant at p ≤ 0.05 after Holm-Bonferroni correction
Fig. 2A–C AUC-ROC curves (on both training and hold-out test set) for each diagnostic outcome class and global confusion matrix. D Importance of permutated features for the random forest classifier
Random forest prediction model performance metrics
| Performance metrics | Dead | Poor outcome | Good outcome |
|---|---|---|---|
| AUC | 0.96 (0.94–0.98) | 0.89 (0.86–0.93) | 0.93 (0.90–0.95) |
| Accuracy | 89.71% (86.42–93.01%) | 82.27% (78.60–86.43%) | 83.55% (79.82–87.66%) |
| Precision (PPV) | 87.58% (82.69–92.86%) | 70.46% (62.69–80.01%) | 72.14% (65.59–79.45%) |
| Recall (sensitivity) | 86.92% (80.61–92.86%) | 65.04% (55.07–75.36%) | 77.52% (68.42–86.84%) |
| Specificity | 91.62% (87.59–95.19%) | 89.10% (85.06–93.11%) | 86.29% (82.04–91.02%) |
| F1 score | 0.87 (0.83–0.92) | 0.68 (0.60–0.76) | 0.75 (0.68–0.81) |
| FPR | 8.29% (5.35–14.42%) | 11.82% (7.06–16.34%) | 14.21% (10.70–19.87%) |
| NPV | 91.75% (88.12–95.24%) | 87.05% (83.36–90.21%) | 89.52% (86.31–94.26%) |
Performance metrics of the random forest prediction model on the hold-out test set were computed adopting a One-vs-Rest (OVR) multiclass strategy. Average value and 95% bootstrap confidence interval are reported
AUC area under the curve, PPV positive predictive value, FPR false positive rate, NPV negative predictive value
Multivariate logistic regression: significant variables
| Outcome | Parameter | Odds ratio | 95% CI | |
|---|---|---|---|---|
| Death | Cerebellum | 0.380 | 0.143–0.999 | 0.049 |
| Age | 1.05 | 1.004–1.097 | 0.033 | |
| Hematoma volume | 1.020 | 1.008–1.033 | 0.001 | |
| GCS at admission | 0.651 | 0.574–0.737 | < 0.001 | |
| ICH score | 1.673 | 1.187–2.358 | 0.003 | |
| Poor outcome | Brainstem | 10.834 | 1.278–91.851 | 0.029 |
| Age | 1.057 | 1.017–1.098 | 0.005 | |
| Charlson Comorbidity Index | 1.211 | 1.096–1.337 | < 0.001 | |
| Hematoma volume | 1.012 | 1.001–1.024 | 0.036 | |
| GCS at admission | 0.760 | 0.677–0.855 | < 0.001 |
GCS Glasgow Coma Scale, ICH intracerebral hematoma
Fig. 3Output of local interpretable model–agnostic explanation (LIME) with random forest classifiers applied to one correctly predicted patient that died within 6 months. The figure reveals the role of various features in influencing the outcome prediction for each patient. A Patient’s characteristics. B Features contributions on predicted probabilities (red, risk factor; blue, protective factor). C Predicted probability of death at 6 months. IVH, intraventricular hemorrhage; GCS, Glasgow Coma Scale; ICH, intracerebral hemorrhage