| Literature DB >> 33980906 |
Carlos Fernandez-Lozano1,2, Pablo Hervella3, Virginia Mato-Abad4, Manuel Rodríguez-Yáñez5, Sonia Suárez-Garaboa4, Iria López-Dequidt5, Ana Estany-Gestal6, Tomás Sobrino3, Francisco Campos3, José Castillo3, Santiago Rodríguez-Yáñez7, Ramón Iglesias-Rey8.
Abstract
We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e-16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e-16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.Entities:
Year: 2021 PMID: 33980906 PMCID: PMC8115135 DOI: 10.1038/s41598-021-89434-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic variables of the experimented dataset of patients summarized by group.
| IS + ICH (n = 6022) | IS (n = 4922) | ICH (n = 1100) | |
|---|---|---|---|
| Age (years) | 72.1 ± 13.7 | 71.9 ± 13.8 | 73.3 ± 13.1 |
| Female gender (%) | 44.5 | 44.8 | 41.5 |
| Arterial hypertension (%) | 66.7 | 63.7 | 60.7 |
| Diabetes mellitus (%) | 23.4 | 24.1 | 20.4 |
| Alcohol consumption (%) | 12.2 | 11.5 | 15.4 |
| Smoking (%) | 15.4 | 16.4 | 10.7 |
| Dyslipidemia (%) | 35.4 | 35.1 | 36.7 |
| Peripheral arterial disease (%) | 5.7 | 5.9 | 4.6 |
| Ischemic heart disease (%) | 10.8 | 11.3 | 8.6 |
| Atrial fibrillation (%) | 20.8 | 24.1 | 18.1 |
| Previous transient ischemic attack (%) | 5.4 | 6.1 | 2.5 |
| Previous ischemic stroke (%) | 13.1 | 13.6 | 9.8 |
| Previous intracerebral hemorrhage,% | 2.4 | 0.9 | 9.8 |
| Previous anticoagulants (%) | 9.5 | 8.5 | 14.1 |
| Previous platelet antiaggregants (%) | 22.9 | 24.4 | 16.5 |
Clinical and neuroimaging variables of the experimented dataset of patients summarized by group.
| IS + ICH (n = 6022) | IS (n = 4922) | ICH (n = 1100) | |
|---|---|---|---|
| Stroke on awakening (%) | 8.3 | 9.1 | 4.6 |
| Previous mRS | 0 [0, 1] | 0 [0, 1] | 1 [0, 1] |
| Time from stroke onset, minutes | 239.1 ± 175.2 | 240.8 ± 167.4 | 231.3 ± 206.1 |
| NIHSS score at admission | 13 [7, 19] | 13 [8, 19] | 13 [7, 18] |
| NIHSS score at 24 h | 8 [13, 16] | 7 [3, 15] | 12 [6, 19] |
| NIHSS score at 48 h | 7 [2, 15] | 6 [2, 14] | 12 [4, 20] |
| Early neurological deterioration (%) | 7.7 | 5.8 | 16.5 |
| Atherothrombotic (%) | – | 22.9 | – |
| Cardioembolic (%) | – | 36.3 | – |
| Lacunar (%) | – | 8.7 | – |
| Undetermined (%) | – | 30.9 | – |
| Others (%) | – | 1.2 | – |
| Intravenous fibrinolysis (%) | – | 22.7 | – |
| Thrombectomy (%) | – | 5.2 | – |
| DWI at admission (ml) | – | 33.3 ± 76.9 | – |
| TC volume 4th–7th day (ml) | – | 51.1 ± 82.3 | – |
| IH1 (%) | – | 7.0 | – |
| IH2 (%) | – | 3.1 | – |
| PH1 (%) | – | 1.7 | – |
| PH2 (%) | – | 1.2 | – |
| Hypertensive (%) | – | – | 46.0 |
| Amyloid (%) | – | – | 10.4 |
| Anticoagulants (%) | – | – | 14.2 |
| Others/Undetermined (%) | – | – | 29.4 |
| Hematoma volume at admission (ml) | – | – | 40.3 ± 46.2 |
| Hematoma volume 4th–7th day (ml) | – | – | 51.9 ± 48.1 |
| Total hematoma volume (ml) | – | – | 68.3 ± 53.1 |
| Volume of hypodensity (ml) | – | – | 15.2 ± 17.9 |
| Hematoma growth (ml) | – | – | 11.9 ± 27.6 |
| Deep hemispherics (%) | – | – | 50.0 |
| Lobar (%) | – | – | 39.6 |
| Cerebellar (%) | – | – | 4.7 |
| Breinstem (%) | – | – | 3.8 |
| Primary intraventricular (%) | – | – | 1.9 |
| Axillary temperature at admission (ºC) | 36.4 ± 0.7 | 36.4 ± 0.7 | 36.6 ± 0.8 |
| Blood glucose at admission (mg/dl) | 137.6 ± 56.3 | 137.3 ± 57.9 | 138.9 ± 48.1 |
| Sedimentation rate (mm) | 26.4 ± 23.1 | 26.5 ± 23.1 | 26.2 ± 23.1 |
| Glycosylated hemoglobin (%) | 6.1 ± 2.1 | 6.1 ± 2.3 | 5.8 ± 0.9 |
| LDL cholesterol (mg/dl) | 101.9 ± 42.9 | 112.5 ± 44.4 | 109.6 ± 35.2 |
| HDL cholesterol (mg/dl) | 41.2 ± 18.5 | 41.8 ± 18.5 | 38.8 ± 18.3 |
| Triglycerides (mg/dl) | 118.3 ± 63.1 | 121.2 ± 65.1 | 109.4 ± 50.7 |
| Platelets (× 103/ml) | 215.4 ± 82.9 | 217.7 ± 83.7 | 203.3 ± 77.9 |
| Hemoglobin (g/dl) | 13.7 ± 1.9 | 13.8 ± 1.9 | 13.5 ± 2.1 |
| DBP at admission (mmHg) | 81.9 ± 16.1 | 81.5 ± 15.8 | 84.3 ± 17.2 |
| SBP at admission (mmHg) | 152.9 ± 27.3 | 152.5 ± 27.3 | 155.5 ± 27.4 |
Molecular markers and outcome at 3 months of the experimented dataset of patients summarized by group.
| IS + ICH (n = 6022) | IS (n = 4922) | ICH (n = 1100) | |
|---|---|---|---|
| Leukocytes at admission (× 103/ml) | 8.9 ± 3.1 | 9.1 ± 3.2 | 8.8 ± 3.3 |
| Fibrinogen at admission (mg/dl) | 443.9 ± 101.7 | 444.5 ± 101.8 | 444.1 ± 101.5 |
| C-reactive protein admission (mg/dl) | 2.7 ± 3.8 | 3.6 ± 4.2 | 5.2 ± 5.2 |
| Microalbuminuria (mg/24 h) | 7.9 ± 26.2 | 5.9 ± 25.9 | 16.7 ± 30.0 |
| NT-pro-BNP levels (pg/ml) | 915.9 ± 1563.7 | 1581.2 ± 1886.1 | 1013.8 ± 3620.2 |
| mRS | 2 [1, 4] | 2 [0, 4] | 3 [1, 6] |
| Poor outcome (%) | 49.6 | 47.5 | 58.6 |
| Morbidity (%) | 35.0 | 33.4 | 27.6 |
| Mortality (%) | 16.3 | 13.2 | 30.2 |
Figure 1Flowchart of patient groups and functional outcome.
Figure 2Mortality prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. (C) ROC curves for the Random Forest classifier.
Figure 32D-heatmap of mortality (EXT) predictions against NIHSS(48) and NIHSS(24). Model results are shown for the IS + ICH group, as it was the most stable for mortality prediction (0.904 ± 0.025 of AUC). Red areas correspond to patients who do not die (0), blue areas correspond to patients who die (1), and misclassified items are highlighted.
Figure 4Morbidity prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. (C) ROC curves for the Random Forest classifier.
Figure 5Comparison of ROC curves of 7 variables selected for machine learning experiments for mortality and morbidity prediction at 3 months of the different patient groups evaluated. (A,B) Morbidity and mortality of IS + ICH group. (C,D) Morbidity and mortality of IS group. (E–F) Morbidity and mortality of ICH group.