| Literature DB >> 35055424 |
Rui Guo1, Renjie Zhang1,2, Ran Liu3, Yi Liu1, Hao Li1, Lu Ma1, Min He1, Chao You1,4, Rui Tian1.
Abstract
Spontaneous intracerebral hemorrhage (SICH) has been common in China with high morbidity and mortality rates. This study aims to develop a machine learning (ML)-based predictive model for the 90-day evaluation after SICH. We retrospectively reviewed 751 patients with SICH diagnosis and analyzed clinical, radiographic, and laboratory data. A modified Rankin scale (mRS) of 0-2 was defined as a favorable functional outcome, while an mRS of 3-6 was defined as an unfavorable functional outcome. We evaluated 90-day functional outcome and mortality to develop six ML-based predictive models and compared their efficacy with a traditional risk stratification scale, the intracerebral hemorrhage (ICH) score. The predictive performance was evaluated by the areas under the receiver operating characteristic curves (AUC). A total of 553 patients (73.6%) reached the functional outcome at the 3rd month, with the 90-day mortality rate of 10.2%. Logistic regression (LR) and logistic regression CV (LRCV) showed the best predictive performance for functional outcome (AUC = 0.890 and 0.887, respectively), and category boosting presented the best predictive performance for the mortality (AUC = 0.841). Therefore, ML might be of potential assistance in the prediction of the prognosis of SICH.Entities:
Keywords: 90-day function outcome; machine learning; mortality; spontaneous intracerebral hemorrhage (SICH)
Year: 2022 PMID: 35055424 PMCID: PMC8778760 DOI: 10.3390/jpm12010112
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Flowchart of SICH patient inclusion and exclusion.
Clinical characteristics of the patients with spontaneous intracerebral hemorrhage (SICH).
| Variables | Functional Outcome | Mortality | ||||
|---|---|---|---|---|---|---|
| Favorable | Unfavorable | Survival | Death | |||
| Demographics | ||||||
| Age, years | 54.0 (46.0–66.0) | 58.9 (43.7–74.0) | 0.004 ** | 54.0 (46.0–66.0) | 65.5 (52.5–77.0) | <0.001 *** |
| Gender, | 0.70 | 0.20 | ||||
| Female | 189 (74.70%) | 64 (25.30%) | 232 (92.06%) | 20 (7.94%) | ||
| Male | 364 (73.09%) | 134 (26.91%) | 443 (88.78%) | 56 (11.22%) | ||
| Clinical features | ||||||
| Location, | <0.001 *** | <0.001 *** | ||||
| Supratentorial | 475 (78.51%) | 130 (21.49%) | 556 (91.90%) | 49 (8.10%) | ||
| Infratentorial | 72 (58.06%) | 52 (41.94%) | 106 (85.48%) | 18 (14.52%) | ||
| Supra and Infra | 6 (27.27%) | 16 (72.73%) | 13 (59.09%) | 9 (40.91%) | ||
| Initial volume, mL | 25.0 (15.0–35.0) | 34.9 (19.3–50.5) | <0.001 *** | 25.0 (15.0–35.0) | 35.0 (20.0–46.2) | <0.001 *** |
| IVH, | <0.001 *** | 0.001 ** | ||||
| Yes | 253 (63.73%) | 144 (36.27%) | 342 (86.15%) | 55 (13.85%) | ||
| No | 300 (84.75%) | 54 (15.25%) | 333 (94.07%) | 21 (5.93%) | ||
| GCS | 13 (9–15) | 8 (6–8) | <0.001 *** | 13 (8–15) | 7 (4–10) | <0.001 *** |
| Length of time in ER, h | 1.08 (0.57–2.35) | 1.13 (0.65–2.35) | 0.48 | 1.03 (0.57–2.35) | 1.47 (0.85–2.35) | 0.02 * |
| BT, °C | 36.6 (36.5–36.8) | 36.8 (36.5–37.0) | <0.001 *** | 36.6 (36.5–36.9) | 36.8 (36.5–37.0) | 0.02 * |
| HR, bpm | 82 (72–92) | 86 (75 -102) | 0.001 ** | 82 (72–93) | 94 (80–112) | <0.001 *** |
| Systolic BP, mmHg | 165 (144–183) | 164 (130–199) | 0.48 | 165 (144–182) | 168 (128 -208) | 0.18 |
| Diastolic BP, mmHg | 96 (82–107) | 92 (81–109) | 0.10 | 96 (82–108) | 93.5 (78–107) | 0.12 |
| Medical history | ||||||
| Hypertension, | 0.48 | 0.24 | ||||
| Yes | 429 (72.96%) | 159 (27.04%) | 524 (77.63%) | 64 (82.89%) | ||
| No | 124 (76.07%) | 39 (23.93%) | 151 (22.37%) | 12 (15.79%) | ||
| DM, | 0.09 | 0.007 ** | ||||
| Yes | 50 (64.94%) | 27 (35.06%) | 62 (80.52%) | 15 (19.48%) | ||
| No | 503 (74.63%) | 171 (25.37%) | 613 (90.95%) | 61 (9.05%) | ||
| Coronary heart disease, | 0.21 | 0.29 | ||||
| Yes | 34 (65.38%) | 18 (34.62%) | 44 (84.62%) | 8 (15.38%) | ||
| No | 519 (74.25%) | 180 (25.75%) | 631 (90.27%) | 68 (9.73%) | ||
| Kidney diseases, | 0.16 | 0.15 | ||||
| Yes | 30 (63.83%) | 167 (36.17%) | 38 (82.61%) | 8 (17.39%) | ||
| No | 523 (74.29%) | 181 (25.71%) | 637 (90.35%) | 68 (9.65%) | ||
| Pulmonary diseases, | 0.07 | 0.15 | ||||
| Yes | 68 (66.02%) | 35 (33.98%) | 88 (85.44%) | 15 (14.56%) | ||
| No | 485 (74.85%) | 163 (25.15%) | 587 (90.59%) | 61 (9.41%) | ||
| Cigarette smoking, | 0.43 | 0.31 | ||||
| Yes | 175 (75.76%) | 56 (24.24%) | 212 (91.77%) | 19 (8.23%) | ||
| No | 378 (75.76%) | 142 (27.31%) | 463 (89.04%) | 57 (10.96%) | ||
| Alcohol consumption, | 0.41 | 0.76 | ||||
| Yes | 170 (75.89%) | 54 (24.11%) | 203 (90.62%) | 21 (9.38%) | ||
| No | 383 (72.68%) | 144 (27.32%) | 472 (89.56%) | 54 (10.44%) | ||
| Family history of stroke, | 0.19 | 0.74 | ||||
| Yes | 11 (57.89%) | 8 (42.11%) | 18 (94.74%) | 1 (5.26%) | ||
| No | 542 (74.04%) | 190 (25.96%) | 657 (89.75%) | 75 (10.25%) | ||
| Coagulative disorders, | 0.05 | 0.86 | ||||
| Yes | 6 (46.15%) | 7 (53.85%) | 11 (84.62%) | 2 (15.38%) | ||
| No | 547 (74.12%) | 191 (25.88%) | 664 (89.97%) | 74 (10.03%) | ||
| Anticoagulation therapy, | 0.19 | 0.66 | ||||
| Yes | 11 (57.89%) | 8 (42.11%) | 16 (84.21%) | 3 (15.79%) | ||
| No | 542 (74.04%) | 190 (25.96%) | 659 (90.03%) | 73 (9.97%) | ||
| Antiplatelet therapy, | 0.61 | 0.07 | ||||
| Yes | 2 (50.00%) | 2 (50.00%) | 2 (50.00%) | 2 (50.00%) | ||
| No | 551 (73.76%) | 196 (26.24%) | 673 (90.09%) | 74 (9.91%) | ||
| Laboratory studies | ||||||
| BG, mmol/L | 7.16 (6.07–8.85) | 9.25 (7.35–11.64) | <0.001 *** | 7.38 (6.24–9.41) | 9.37 (7.35–12.45) | <0.001 *** |
| Creatinine, µmol/L | 69 (56–84) | 72 (60–96) | 0.004 ** | 69 (56–85) | 79 (64–116) | <0.001 *** |
| Uric acid, µmol/L | 324 (250–407) | 338 (257–419) | 0.26 | 321 (250–407) | 348 (288–439) | 0.03 * |
| TG, mmol/L | 1.14 (0.80–1.72) | 1.21 (0.87–1.73) | 0.09 | 1.13 (0.81–1.69) | 1.38 (0.88–1.99) | 0.03 * |
| Cholesterol, mmol/L | 4.42 (3.78–5.06) | 4.34 (3.68–5.06) | 0.36 | 4.40 (3.76–5.06) | 4.36 (3.66–5.12) | 0.50 |
| HDLC, mmol/L | 1.29 (1.03–1.61) | 1.33 (1.03–1.66) | 0.19 | 1.30 (1.04–1.63) | 1.31 (1.02–1.61) | 0.33 |
| LDLC, mmol/L | 2.60 (2.08–3.21) | 2.51 (1.91–3.24) | 0.11 | 2.60 (2.05–3.21) | 2.40 (1.83–3.3) | 0.07 |
| Sodium, mmol/L | 138.4 (136.1–140.3) | 138.3 (134.0–142.6) | 0.29 | 138.4 (136.1–140.4) | 137.9 (133.6–142.3) | 0.45 |
| Chlorine, mmol/L | 101.4 (98.8–104.3) | 100.5 (95.5–105.4) | 0.002 ** | 101.3 (98.6–104.3) | 99.6 (94.8–104.4) | 0.001 ** |
| eGFR, mL/min | 91.0 (87.7–103.5) | 91.0 (77.2–100.9) | <0.001 *** | 91.0 (87.0–103.6) | 86.0 (63.6–91.0) | <0.001 *** |
| Platelet, 109 cells/L | 170 (129–217) | 184 (136–222) | 0.12 | 175 (131–218) | 175 (98–252) | 0.35 |
| WBC, 109 cells/L | 10.11 (7.58–12.99) | 11.91 (9.22–15.43) | <0.001 *** | 10.54 (7.76–13.22) | 11.62 (8.24–16.45) | 0.006 ** |
| ANC, 109 cells/L | 8.43 (5.67–11.25) | 10.33 (7.09–13.26) | <0.001 *** | 8.81 (5.91–11.51) | 9.64 (6.17–13.57) | 0.03 * |
| ALC, 109 cells/L | 1.09 (0.76–1.48) | 1.17 (0.72–1.82) | 0.08 | 1.09 (0.75–1.51) | 1.19 (0.73–1.99) | 0.06 |
| AMC, 109 cells/L | 0.39 (0.26–0.53) | 0.42 (0.28–0.62) | 0.006 ** | 0.4 (0.26–0.54) | 0.47 (0.30–0.62) | 0.02 * |
| Hematocrit | 0.41 (0.38–0.44) | 0.41 (0.37–0.44) | 0.29 | 0.41 (0.38–0.44) | 0.42 (0.37–0.44) | 0.23 |
| Fibrinogen, g/L | 2.77 (2.26–3.41) | 2.74 (2.16–3.57) | 0.44 | 2.75 (2.24–3.42) | 2.77 (2.28–3.61) | 0.27 |
| D-dimer, mg/L FEU | 0.64 (0.31–1.94) | 1.43 (0.63–2.84) | <0.001 *** | 0.72 (0.32–2.16) | 2.37 (0.80–5.24) | <0.001 *** |
| Treatment, | 0.92 | 0.74 | ||||
| Surgery | 172 (73.19%) | 63 (26.81%) | 213 (90.64%) | 22 (9.36%) | ||
| Conservative | 381 (73.84%) | 135 (26.16%) | 462 (89.53%) | 54 (10.47%) | ||
* p < 0.05; ** p < 0.01; *** p < 0.001. ANC, absolute neutrophil count; ALC, absolute lymphocyte count; AMC, absolute monocyte count; BG, blood glucose; BT, body temperature; DM, diabetes mellitus; eGFR, estimated glomerular filtration rate; ER, emergency room; GCS, Glasgow Coma Scale; HR, heart rate; IVH, intraventricular hemorrhage; TG, triglyceride; WBC, white blood cell; HDLC, high-density lipoprotein cholesterol; LDLC, low-density lipoprotein cholesterol.
Predictive performance for the 90-day functional outcome and mortality after spontaneous intracerebral hemorrhage.
| Algorithm | Functional Outcome | Mortality | ||
|---|---|---|---|---|
| AUC, Mean | AUC, 95%CI | AUC, Mean | AUC, 95% CI | |
| ICH score | 0.856 | 0.827–0.884 | 0.790 | 0.712–0.867 |
| LR | 0.890 | 0.858–0.922 | 0.837 | 0.780–0.894 |
| LRCV | 0.887 | 0.855–0.920 | 0.844 | 0.807–0.881 |
| SVM | 0.849 | 0.804–0.894 | 0.777 | 0.720–0.833 |
| RF | 0.862 | 0.813–0.912 | 0.818 | 0.718–0.917 |
| XGBoost | 0.863 | 0.815–0.911 | 0.820 | 0.741–0.899 |
| CatBoost | 0.871 | 0.829–0.913 | 0.841 | 0.774–0.907 |
AUC, area under the receiver operator characteristic curve; CatBoost, Category Boosting; CI, confidence interval; ICH, intracerebral hemorrhage; LR, logistic regression; SD, standard deviation; SVM; support vector machine; RF, random forest; XGBoost, extreme gradient boosting.
Figure 2The receiver operating characteristic (ROC) curve of all the six machine learning (ML)-based models compared with the traditional ICH Score, with respect to predictive performance for the functional outcome at the third month.
List of variables used in the final model.
| Algorithm | Variables for Functional Outcome a | Variables for Mortality a |
|---|---|---|
| LR | Coagulation disorders, Location of the hematoma, GCS, IVH, AMC, BG, BT, D-dimer, Age, ANC, Chlorine | Location of the hematoma, AMC, GCS, DM, WBC, D-Dimer, ANC, BG, Age, Chlorine, IVH, HR, Time in ER, BT |
| LRCV | Coagulation disorders, Location of the hematoma, AMC, GCS, IVH, BG, ANC, WBC, D-dimer, Age, BT | AMC, Location of the hematoma, DM, GCS, WBC, ANC, IVH, D-Dimer, Age, Chlorine, BG, TG, HR, Hematoma volume, BT |
| SVM b | - | - |
| RF | GCS, BG, Hematoma volume, Location of the hematoma, D-Dimer, IVH | GCS, D-dimer, Age, BG, HR, eGFR, Time in ER, Hematoma volume, Chlorine, ANC, WBC, Location of the hematoma, Creatine, Uric acid, TG, BT, IVH, DM |
| XGBoost | GCS, BG, D-dimer, Location of the hematoma, eGFR, Hematoma volume, Age, WBC, Creatine, Chlorine | GCS, D-dimer, Age, WBC, Location of the hematoma, Hematoma volume, eGFR, HR, Chlorine, Time in ER, Creatine, ANC, TG |
| CatBoost | GCS, BG, D-dimer | GCS, Age, D-dimer, HR, Time in ER, Chlorine, eGFR, Location of the hematoma, Hematoma volume |
a Variables are listed according to the importance. b Because of the mechanism of SVM, the importance of variables cannot be accessed. AMC, absolute monocyte count; ANC, absolute neutrophil count; BG, blood glucose; BT, body temperature; CatBoost, Category Boosting; DM, diabetes mellitus; eGFR, estimated glomerular filtration rate; ER, emergency room; GCS, Glasgow Coma Scale; HR, heart rate; IVH, intraventricular hemorrhage; LR, logistic regression; RF, random forest; SVM; support vector machine; TG, triglyceride; WBC, white blood cell; XGBoost, extreme gradient boosting.
Figure 3The ROC curve of all the six ML-based models compared with the traditional ICH Score, with respect to predictive performance for the 90-day mortality at the third month.