| Literature DB >> 34285708 |
Zhixun Bai1,2,3, Jing Lu4, Ting Li3, Yi Ma3, Zhijiang Liu3, Ranzun Zhao1,3, Zhenglong Wang3, Bei Shi1,3.
Abstract
Accurate risk assessment of high-risk patients is essential in clinical practice. However, there is no practical method to predict or monitor the prognosis of patients with ST-segment elevation myocardial infarction (STEMI) complicated by hyperuricemia. We aimed to evaluate the performance of different machine learning models for the prediction of 1-year mortality in STEMI patients with hyperuricemia. We compared five machine learning models (logistic regression, k-nearest neighbor, CatBoost, random forest, and XGBoost) with the traditional global (GRACE) risk score for acute coronary event registrations. We registered patients aged >18 years diagnosed with STEMI and hyperuricemia at the Affiliated Hospital of Zunyi Medical University between January 2016 and January 2020. Overall, 656 patients were enrolled (average age, 62.5 ± 13.6 years; 83.6%, male). All patients underwent emergency percutaneous coronary intervention. We evaluated the performance of five machine learning classifiers and the GRACE risk model in predicting 1-year mortality. The area under the curve (AUC) of the six models, including the GRACE risk model, ranged from 0.75 to 0.88. Among all the models, CatBoost had the highest predictive accuracy (0.89), AUC (0.87), precision (0.84), and F1 value (0.44). After hybrid sampling technique optimization, CatBoost had the highest accuracy (0.96), AUC (0.99), precision (0.95), and F1 value (0.97). Machine learning algorithms, especially the CatBoost model, can accurately predict the mortality associated with STEMI complicated by hyperuricemia after a 1-year follow-up.Entities:
Year: 2021 PMID: 34285708 PMCID: PMC8275420 DOI: 10.1155/2021/7252280
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1A flow diagram showing the study process.
Comparison of characteristics of patients with and without mortality in the cohort.
| Variables | Total ( | Survival ( | Death ( |
|
|---|---|---|---|---|
| Demographic characteristics | ||||
| Sex, | 0.008 | |||
| Female | 107 (16) | 83 (15) | 24 (26) | |
| Male | 549 (84) | 482 (85) | 67 (74) | |
| Age, y | 64.00 (52, 74) | 63.00 (51, 73) | 70.00 (59, 78) | <0.001 |
| Smoking, | 453 (69) | 396 (70) | 57 (63) | 0.192 |
| Weekend on admission, | 248 (38) | 205 (36) | 43 (47) | 0.059 |
| Delay, | 167 (25) | 133 (24) | 34 (37) | 0.007 |
| Vascular risk factors | ||||
| Hypertension, | 380 (58) | 326 (58) | 54 (59) | 0.857 |
| Diabetes mellitus, | 121 (18) | 98 (17) | 23 (25) | 0.096 |
| Prior-stroke, | 35 (5) | 30 (5) | 5 (5) | 1 |
| CKD, | 152 (23) | 122 (22) | 30 (33) | 0.024 |
| Clinical data | ||||
| HR, beats/min | 80 (72, 92) | 80.00 (72, 91) | 85 (73, 106) | 0.003 |
| SBP, mmHg | 124 (108, 140) | 127 (110, 143) | 111 (92, 129) | <0.001 |
| DBP, mmHg | 80 (68, 91) | 80 (70, 92) | 74 (58, 85) | <0.001 |
| Shock_index | 0.65 (0.55, 0.77) | 0.64 (0.54, 0.75) | 0.75 (0.61, 1.04) | <0.001 |
| Electrocardiographic data | ||||
| Inferior, | 300 (46) | 263 (47) | 37 (41) | 0.351 |
| Anterior, | 322 (49) | 276 (49) | 46 (51) | 0.851 |
| Other, | 21 (3) | 16 (3) | 5 (5) | 0.194 |
| Right ventricular, | 7 (1) | 6 (1) | 1 (1) | 1 |
| Laboratory examinations on admission | ||||
| WBC, ∗109/L | 11.27 (8.60, 14.19) | 10.97 (8.34, 13.57) | 13.92 (10.56, 19.51) | <0.001 |
| Neutrophil count, ∗109/L | 8.85 (6.34, 11.83) | 8.46 (6.11, 11.19) | 11.40 (8.43, 16.26) | <0.001 |
| NLR | 6.65 (3.89, 10.77) | 6.25 (3.78, 9.85) | 9.74 (5.89, 14.93) | <0.001 |
| PLR | 149.03 (104.31, 224.60) | 148.96 (107.43, 220.27) | 151.40 (81.96, 250.07) | 0.518 |
| MLR | 0.54 (0.37, 0.82) | 0.51 (0.36, 0.76) | 0.75 (0.41, 1.12) | <0.001 |
| SIRI | 4.51 (2.63, 8.44) | 4.19 (2.47, 7.39) | 8.41 (4.38, 15.27) | <0.001 |
| SII | 1285.43 (746.84, 2247.28) | 1233.05 (735.30, 2139.17) | 1923.99 (894.50, 2898.80) | 0.003 |
| HB, g/L | 139.00 (123.00, 154.00) | 140.00 (124.00, 155.00) | 128.00 (115.00, 147.00) | 0.001 |
| RBC, ∗1012/L | 4.54 (3.98, 5.01) | 4.58 (4.05, 5.02) | 4.21 (3.71, 4.88) | 0.006 |
| PLT, ∗109/L | 205.00 (161.00, 249.25) | 207.00 (164.00, 250.00) | 196.00 (138.00, 246.50) | 0.113 |
| ALT, U/L | 33.00 (23.00, 56.00) | 32.00 (22.25, 51.75) | 56.00 (30.00, 193.00) | <0.001 |
| AST, U/L | 72.00 (36.50, 169.5) | 67.00 (35.00, 143.00) | 225.00 (73.50, 456.00) | <0.001 |
| GGT, U/L | 44.00 (27.00, 75.00) | 43.00 (27.00, 72.75) | 61.00 (29.00, 104.00) | 0.007 |
| BUN, mmol/L | 6.72 (5.25, 9.37) | 6.38 (5.09, 8.50) | 10.33 (7.45, 13.15) | <0.001 |
| Creatinine, umol/L | 101.00 (82.00, 128.00) | 98.00 (81.00, 119.00) | 134.00 (109.00, 174.50) | <0.001 |
| Uric acid, umol/L | 484.00 (449.00, 542.00) | 481.00 (447.00, 535.00) | 523.00 (461.00, 637.00) | <0.001 |
| Cystatin C, mg/L | 1.22 (0.97, 1.58) | 1.17 (0.95, 1.49) | 1.65 (1.32, 2.18) | <0.001 |
| CK, U/L | 507.00 (186.00, 1368.75) | 463.50 (172.00, 1322.50) | 745.00 (303.25, 2012.50) | 0.002 |
| CKMB, U/L | 52.00 (25.00, 127.00) | 48.00 (24.00, 117.25) | 86.00 (33.00, 190.00) | <0.001 |
| LDH, U/L | 375.00 (266.25, 639.75) | 350.50 (255.25, 556.75) | 695.50 (407.75, 1229.75) | <0.001 |
| | 259.00 (173.00, 475.00) | 240.00 (165.00, 427.50) | 490.50 (273.75, 773.00) | <0.001 |
| CTnT, ng/L | 1014.00 (213.50, 3480.00) | 786.95 (185.97, 3069.00) | 3077.00 (1133.00, 6711.00) | <0.001 |
| BNP, pg/mL | 1022.50 (255.15, 3860.75) | 884.90 (204.85, 2713.00) | 5349.00 (2058.00, 15267.00) | <0.001 |
| Glucose, mmol/L | 6.66 (5.56, 8.66) | 6.52 (5.44, 8.19) | 8.47 (6.41, 11.60) | <0.001 |
| Myoglobin, ng/mL | 341.10 (104.50, 910.40) | 308.95 (95.96, 820.28) | 615.00 (203.50, 2251.00) | <0.001 |
| Diseased vessel identified during procedure | ||||
| LM, | 13 (2) | 13 (2) | 0 (0) | 0.233 |
| LAD, | 213 (33) | 185 (33) | 28 (31) | 0.836 |
| LCX, | 70 (11) | 62 (11) | 8 (9) | 0.674 |
| RCA, | 157 (24) | 134 (24) | 23 (26) | 0.819 |
| Risk assessment | ||||
| GRACE, score | 125.00 (102.00, 154.00) | 121.00 (101.00, 146.00) | 178.00 (140.00, 206.50) | <0.001 |
Values are expressed as medians with interquartile ranges for continuous data. Other values are presented as numbers and percentages. Shock index: ratio of HR to SBP; SIRI: systemic inflammatory response index; SII: systemic inflammatory reaction index; PLR: ratio of platelets to lymphocytes; NLR: the ratio of neutrophils to lymphocytes; MLR: ratio of monocytes to lymphocytes; OHCA: out-of-hospital cardiac arrest; GRACE: Global Registry of Acute Coronary Events score; α-HBDH: α-hydroxybutyrate dehydrogenase; BNP: B-type natriuretic peptides.
Comparison of validation results of machine learning models.
| Models | Accuracy | AUC | Recall | Precision | F1 value |
|---|---|---|---|---|---|
| CatBoost | 0.89 | 0.87 | 0.33 | 0.78 | 0.44 |
| RF | 0.89 |
| 0.26 |
| 0.38 |
| XGBoost |
| 0.83 |
| 0.81 |
|
| LR | 0.89 | 0.82 | 0.38 | 0.63 | 0.46 |
| KNN | 0.88 | 0.75 | 0.21 | 0.61 | 0.31 |
| Model with oversampling (SMOTEENN) | |||||
| CatBoost | 0.96 | 0.99 | 0.98 | 0.95 | 0.97 |
| RF | 0.95 | 0.99 | 0.98 | 0.94 | 0.96 |
| XGBoost | 0.94 | 0.98 | 0.98 | 0.92 | 0.95 |
| LR | 0.91 | 0.95 | 0.92 | 0.92 | 0.92 |
| KNN | 0.92 | 0.96 | 0.98 | 0.88 | 0.93 |
| Tradition risk score model | |||||
| GRACE score | 0.84 | 0.80 | 0.46 | 0.59 | 0.51 |
AUC and F1 score: the higher, the better. XGBoost: Extreme Gradient Boosting; RF: random forest; LR: logistic regression; KNN: K-nearest neighbors.
Figure 2ROC analysis result of five classifiers and GRACE for the prediction of 1-year mortality with all available features.
Figure 3ROC analysis result of five classifiers with SMOTEENN.
Figure 4CatBoost model performance visualization. (a) ROC curve. (b) Precision-recall curve. (c) Confusion matrix. (d) Classification report. (e) Feature importance. (f) Decision boundary.
Ensemble of machine learning models.
| Ensemble | Accuracy | AUC | Recall | Precision | F1 value |
|---|---|---|---|---|---|
| RF+CatBoost+XGBoost | 0.90 | 0.87 | 0.36 | 0.72 | 0.47 |
| XGBoost+LR+KNN | 0.90 | 0.84 | 0.33 | 0.72 | 0.43 |
| RF+LR+KNN | 0.89 | 0.86 | 0.30 | 0.71 | 0.39 |
| RF+XGBoost+LR+KNN | 0.90 | 0.86 | 0.37 | 0.73 | 0.46 |
| All | 0.90 | 0.86 | 0.37 | 0.73 | 0.47 |