| Literature DB >> 32021220 |
Yi-Ming Li1, Li-Cheng Jiang2, Jing-Jing He1, Kai-Yu Jia1, Yong Peng1, Mao Chen1.
Abstract
A formal risk assessment for identifying high-risk patients is essential in clinical practice and promoted in guidelines for the management of anterior acute myocardial infarction. In this study, we sought to evaluate the performance of different machine learning models in predicting the 1-year mortality rate of anterior ST-segment elevation myocardial infarction (STEMI) patients and to compare the utility of these models to the conventional Global Registry of Acute Coronary Events (GRACE) risk scores. We enrolled all of the patients aged >18 years with discharge diagnoses of anterior STEMI in the Western China Hospital, Sichuan University, from January 2011 to January 2017. A total of 1244 patients were included in this study. The mean patient age was 63.8±12.9 years, and the proportion of males was 78.4%. The majority (75.18%) received revascularization therapy. In the prediction of the 1-year mortality rate, the areas under the curve (AUCs) of the receiver operating characteristic curves (ROCs) of the six models ranged from 0.709 to 0.942. Among all models, XGBoost achieved the highest accuracy (92%), specificity (99%) and f1 score (0.72) for predictions with the full variable model. After feature selection, XGBoost still obtained the highest accuracy (93%), specificity (99%) and f1 score (0.73). In conclusion, machine learning algorithms can accurately predict the rate of death after a 1-year follow-up of anterior STEMI, especially the XGBoost model.Entities:
Keywords: acute anterior myocardial infarction; machine learning; prediction model
Year: 2020 PMID: 32021220 PMCID: PMC6957091 DOI: 10.2147/TCRM.S236498
Source DB: PubMed Journal: Ther Clin Risk Manag ISSN: 1176-6336 Impact factor: 2.423
Clinical Characteristics of the Study Population
| Characteristics | Total | Patients Survived | Patients Died | P-value |
|---|---|---|---|---|
| No. of patients | 1244 | 1059 | 185 | |
| Age | 63.76±12.92 | 62.16 ± 12.63 | 72.91±10.51 | <0.001 |
| Male | 975 (78.38%) | 855 (80.74%) | 120 (64.80%) | <0.001 |
| smoke | 780 (62.70%) | 680 (64.21%) | 100 (54.05%) | 0.008 |
| Pre-hypertension, n (%) | 580 (46.62%) | 480 (45.89%) | 94 (50.81%) | 0.216 |
| Pre-diabetes mellitus, n (%) | 260 (20.90%) | 211 (19.92%) | 49 (26.49%) | 0.043 |
| Pre-COPD, n (%) | 131 (10.53%) | 92 (8.69%) | 39 (21.08) | <0.001 |
| History of chest pain, n (%) | 257 (20.66%) | 219 (20.68%) | 38 (20.54%) | 0.002 |
| HR, beats/min | 83.83 ± 16.85 | 81.56 ± 15.59 | 95.15 ± 19.18 | <0.001 |
| SBP, mm Hg | 125.29 ± 22.06 | 125.93 ± 22.21 | 121.57 ± 20.80 | 0.013 |
| DBP, mm Hg | 77.21 ± 22.85 | 77.59 ± 24.07 | 75.59 ± 13.90 | 0.296 |
| LVEF, % | 49.36 ± 10.59 | 50.69 ± 10.16 | 41.75 ± 9.78 | <0.001 |
| Cardiac arrest, % | 18 (1.45%) | 16 (1.51%) | 2 (1.09%) | 0.652 |
| GRACE risk score | 176.47 ± 38.60 | 169.99 ± 34.11 | 213.61 ± 40.80 | <0.001 |
| Killip classification ≥2 | 348 (27.97%) | 213 (20.11%) | 135 (72.97%) | <0.001 |
| Serum creatinine, μmol/L | 94.70 ± 63.16 | 87.65 ± 52.52 | 135.10 ± 95.73 | <0.001 |
| Blood glucose, mmol/L | 9.03 ± 4.14 | 8.68 ± 3.75 | 11.04 ± 5.51 | <0.001 |
| Cystatin C, mg/L | 1.12 ± 0.59 | 1.04 ± 0.44 | 1.60 ± 0.99 | <0.001 |
| BNP, pg/mL | 3999.61 ± 7180.38 | 2780.16 ± 5243.11 | 10,980.16 ± 11,518.47 | <0.001 |
| BUN, mg/dL | 7.07 ± 4.23 | 6.48 ± 3.32 | 10.43 ± 6.65 | <0.001 |
| T-Bil, umol/L | 13.84 ± 7.64 | 13.54 ± 7.30 | 15.56 ± 7.20 | <0.001 |
| Fibrinogen, g/L | 3.35 ± 1.28 | 3.28 ± 1.21 | 3.78 ± 1.54 | <0.001 |
| <0.001 | ||||
| Non | 224 (18.01%) | 131 (12.37%) | 93 (50.27%) | |
| Primary PCI | 582 (46.78%) | 521 (49.20%) | 61 (32.97%) | |
| Selective PCI | 377 (30.31%) | 255 (33.52%) | 22 (11.89%) | |
| Thrombolysis | 18 (1.45%) | 17(1.61%) | 1 (0.54%) | |
| Rescue PCI | 5 (0.40%) | 4 (0.37%) | 1 (0.54%) | |
| Selective CABG | 3 (0.24%) | 3 (0.28%) | 0 (0%) | |
| CAG only | 35 (2.81%) | 28 (2.64%) | 7 (3.78%) | |
| NYHA classification ≥2 | 124 (9.97%) | 73 (6.89%) | 51 (27.57%) | <0.001 |
Abbreviations: HR, heart rate; SBP, systolic blood pressures; DBP, diastolic blood pressure; LVEF, left ventricular ejection fraction; GRACE, Global Registry of Acute Coronary Events; BNP, brain natriuretic peptide; BUN, blood urea nitrogen; T-Bil, total bilirubin; PCI, percutaneous coronary intervention; CABG, coronary artery bypass graft; CAG, coronary angiography, NYHA, New York Heart Association.
Figure 1ROC analysis result of six classifiers for the prediction of 1-year mortality with all available features.
Abbreviation: ROC, receiver operating characteristic curve.
Comparison of Validation Results of Six Machine Learning Models
| Models with All Features | ||||
|---|---|---|---|---|
| Models | Accuracy (%) | Sensitivity (%) | Specificity (%) | F1 Score* |
| Logistic regression | 87 | 49 | 95 | 0.57 |
| GaussianNB | 88 | 77 | 90 | 0.68 |
| KNN | 82 | 19 | 95 | 0.27 |
| Decision tree | 86 | 64 | 90 | 0.61 |
| Random forest | 88 | 75 | 91 | 0.69 |
| XGBoost | 92 | 60 | 99 | 0.74 |
| Models after feature selection | ||||
| Logistic regression | 88 | 40 | 98 | 0.53 |
| GaussianNB | 87 | 60 | 92 | 0.61 |
| KNN | 83 | 17 | 97 | 0.26 |
| Decision tree | 90 | 51 | 98 | 0.63 |
| Random forest | 89 | 75 | 92 | 0.71 |
| XGBoost | 92 | 60 | 99 | 0.73 |
| Traditional risk score | ||||
| GRACE risk score | 86 | 16 | 98 | 0.26 |
Note: *F1 score: the higher the better.
Abbreviations: NB, naïve bayes; KNN, k nearest neighbors.
Figure 2ROC analysis result of six classifiers for the prediction of 1-year mortality with 20 top features.
Abbreviation: ROC, receiver operating characteristic curve.