| Literature DB >> 36253488 |
Lee Hwangbo1,2, Yoon Jung Kang3,2, Hoon Kwon1,2, Jae Il Lee4,2, Han-Jin Cho3,2, Jun-Kyeung Ko4,2, Sang Min Sung3,2,5, Tae Hong Lee6,7,8.
Abstract
Patients with acute ischemic stroke can benefit from reperfusion therapy. Nevertheless, there are gray areas where initiation of reperfusion therapy is neither supported nor contraindicated by the current practice guidelines. In these situations, a prediction model for mortality can be beneficial in decision-making. This study aimed to develop a mortality prediction model for acute ischemic stroke patients not receiving reperfusion therapies using a stacking ensemble learning model. The model used an artificial neural network as an ensemble classifier. Seven base classifiers were K-nearest neighbors, support vector machine, extreme gradient boosting, random forest, naive Bayes, artificial neural network, and logistic regression algorithms. From the clinical data in the International Stroke Trial database, we selected a concise set of variables assessable at the presentation. The primary study outcome was all-cause mortality at 6 months. Our stacking ensemble model predicted 6-month mortality with acceptable performance in ischemic stroke patients not receiving reperfusion therapy. The area under the curve of receiver-operating characteristics, accuracy, sensitivity, and specificity of the stacking ensemble classifier on a put-aside validation set were 0.783 (95% confidence interval 0.758-0.808), 71.6% (69.3-74.2), 72.3% (69.2-76.4%), and 70.9% (68.9-74.3%), respectively.Entities:
Mesh:
Year: 2022 PMID: 36253488 PMCID: PMC9576722 DOI: 10.1038/s41598-022-22323-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Design of stacking ensemble learner. Seven base classifiers are hyperparameter-tuned individually, and each produces a prediction value. The seven outcome prediction values are input variables for the stacking ensemble learner. KNN: k-nearest neighbours; XGB: extreme gradient boosting; SVM: support vector machine; NB: Naïve Bayes; RF: random forests; ANN: artificial neural networks; LR: logistic regression.
Figure 2Flow of study patients. The confusion matrix of the final stacking ensemble learning produced the predicted number. The put-aside validation set evaluated the models after complete training.
Summary of clinical variables of train and validation sets.
| Train set | Validation set | P value | |
|---|---|---|---|
| Age (SD), years | 70.4 (11.5) | 70.1 (11.9) | 0.2896 |
| Sex (female) | 2618 (42.6%) | 1177 (44.6%) | 0.07716 |
| Altered consciousness (drowsy or sunconscious) | 706 (11.1%) | 294 (11.4%) | 0.6813 |
| Wake-up stroke | 1888 (30.7%) | 818 (31.0%) | 0.7845 |
| Atrial fibrillation | 894 (14.5%) | 343 (13.0%) | 0.0635 |
| Visible infarction on computed tomography | 1981 (32.2%) | 874 (33.3%) | 0.4063 |
| Heparin within 24 h of visit | 150 (2.4%) | 67 (2.5%) | 0.8363 |
| Aspirin within 3 days of visit | 1304 (21.2%) | 523 (19.8%) | 0.1551 |
| Systolic blood pressure (SD), mmHg | 160.4 (27.4) | 160.7 (28.2) | 0.6780 |
| Facial | 4318 (70.2%) | 1848 (70.1%) | 0.9217 |
| Upper extremity | 5188 (84.4%) | 2243 (85.1%) | 0.4228 |
| Lower extremity | 4478 (72.8%) | 1956 (74.1%) | 0.1952 |
| Dysphasia | 2225 (84.4%) | 893 (33.9%) | 0.0399 |
| Hemianopsia | 1034 (16.8%) | 431 (16.3%) | 0.6108 |
| Visuospatial disorder | 945 (15.4%) | 395 (15.0%) | 0.6674 |
| Brainstem-cerebellar | 746 (12.1%) | 333 (12.6%) | 0.5377 |
| Other | 376 (6.1%) | 137 (5.2%) | 0.1024 |
| Heparin administered | 2058 (33.5%) | 887 (33.6%) | 0.8941 |
| Aspirin administered | 3098 (50.4%) | 1294 (49.1%) | 0.2729 |
| Death at 6 months | 910 (14.8%) | 375 (14.2%) | 0.5045 |
Tabulated data are number of patients for binary variables and mean for continuous variables. The p values are calculated with either Pearson’s χ2 test or Student’s t-test.
Figure 3Receiver operating characteristics curve for seven base and ensemble learners. KNN: k-nearest neighbors; XGB: extreme gradient boost; SVM: support vector machine; NB: Naïve Bayes; RF: random forests; ANN: artificial neural networks, LR = logistic regression.
Model performance on train and validation set of stacking ensemble machine learning.
| Train set | Validation set | |
|---|---|---|
| AUROC | 0.797 (0.782–0.813) | 0.783 (0.758–0.808) |
| Accuracy | 0.728 (0.707–0.742) | 0.716 (0.693–0.742) |
| Sensitivity | 0.719 (0.703–0.745) | 0.723 (0.692–0.764) |
| Specificity | 0.732 (0.705–0.744) | 0.709 (0.689–0.743) |
| Positive predictive value | 0.316 (0.291–0.338) | 0.296 (0.266–0.331) |
| Negative predictive value | 0.937 (0.931–0.944) | 0.940 (0.930–0.950) |
| Positive likelihood ratio | 2.69 (2.42–2.86) | 2.48 (2.29–2.87) |
| Negative likelihood ratio | 0.384 (0.348–0.414) | 0.391 (0.330–0.437) |
| F1 score | 0.439 (0.413–0.463) | 0.420 (0.387–0.457) |
Proportion or ratio (bootstrapped 95% CI).
Figure 4Violin plots of bootstrapped metrics of AUROC, accuracy, sensitivity, specificity, LR+, and LR−. AUROC: area under the receiver operating characteristics curve; LR+: positive likelihood ratio; LR−: negative likelihood ratio.
Comparison of selected mortality prediction models for AIS patients.
| Algorithms used | Validations | Hyperacute applicability | Predicting outcomes | Reported AUROC | |
|---|---|---|---|---|---|
| Current study | SEL | Internal validation | Yes | 6-month mortality | 0.783 |
| Saposnik et al. | Integer scoring system | Internal and incomplete external validations (half of the external set used for calibration) | Yes | 30-day and 1-year mortality | 0.790 (30-day) |
| 0.782 (1-year) | |||||
| Eaton et al. | NB | Internal validation | No | 7- and 93-day mortality | 0.858 (7-day) |
| 0.807 (93-day) | |||||
| Fernandez-Lozano et al. | RF | Internal validation | No | 3-month morbidity and mortality | 0.703 (3-month morbidity) |
| 0.899 (3-month mortality) | |||||
| Abedi et al. | LR, RF, XGB | Internal validation | No | 1-, 3-, 6-, 12-, 18-, 24-month mortality | 0.82 (1-month; RF) |
| 0.80 (6-month; RF) | |||||
| 0.77 (12-month; XGB) | |||||
| Lin et al. | RF, SVM, ANN, hybrid ANN | Internal validation | No | 90-day morbidity | 0.972 (RF) |
| 0.971 (SVM) | |||||
| 0.969 (ANN) | |||||
| 0.974 (hybrid ANN) |
Hyperacute applicability means all selected features are assessable at the time of initial presentation.