| Literature DB >> 34430603 |
Zhixun Bai1,2,3, Shan Hu1,2, Yan Wang1,2, Wenwen Deng1,2, Ning Gu1,2, Ranzun Zhao1,2, Wei Zhang2, Yi Ma2, Zhenglong Wang2, Zhijiang Liu2, Changyin Shen3, Bei Shi1,2.
Abstract
BACKGROUND: The in-hospital mortality of patients with ST-segment elevation myocardial infarction (STEMI) increases to more than 50% following a cardiogenic shock (CS) event. This study highlights the need to consider the risk of delayed calculation in developing in-hospital CS risk models. This report compared the performances of multiple machine learning models and established a late-CS risk nomogram for STEMI patients.Entities:
Keywords: Machine learning; ST-segment elevation myocardial infarction (STEMI); cardiogenic shock (CS); least absolute shrinkage and selection operator (LASSO)
Year: 2021 PMID: 34430603 PMCID: PMC8350690 DOI: 10.21037/atm-21-2905
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1A flow diagram showing the study process. Training and test dataset generation, model training and performance evaluation. A total of 2,282 patients were recruited in the current study. The data were preprocessed and randomly divided into a training set (70%) and a test set (30%), maintaining similar proportions for the two classes proportions in each set. In the training set, k-fold cross-validation (k=5) was used. STEMI, ST-segment elevation myocardial infarction; CS, cardiogenic shock.
Characteristics of the patients in the training and test datasets
| Characteristics | Total (n=2,282) | Training dataset (n=1,598) | Test dataset (n=684) | P value |
|---|---|---|---|---|
| Group, n [%] | 0.788 | |||
| Noncardiogenic shock | 2,112 [93] | 1,481 [93] | 631 [92] | |
| Cardiogenic shock | 170 [7] | 117 [7] | 53 [8] | |
| Gender, n [%] | 0.484 | |||
| Female | 598 [26] | 426 [27] | 172 [25] | |
| Male | 1,684 [74] | 1,172 [73] | 512 [75] | |
| Age, [IQR], y | 64.0 [53.0, 73.0] | 64.0 [53.0, 72.0] | 64.0 [54.0, 73.0] | 0.643 |
| Inferior wall, n [%] | 0.93 | |||
| No | 1,326 [58] | 930 [58] | 396 [58] | |
| Yes | 956 [42] | 668 [42] | 288 [42] | |
| Anterior wall, n [%] | 0.404 | |||
| No | 1,073 [47] | 761 [48] | 312 [46] | |
| Yes | 1,209 [53] | 837 [52] | 372 [54] | |
| Other, n [%] | 0.797 | |||
| No | 2,224 [97] | 1,556 [97] | 668 [98] | |
| Yes | 58 [3] | 42 [3] | 16 [2] | |
| Right ventricular, n [%] | 0.972 | |||
| No | 2,250 [99] | 1,575 [99] | 675 [99] | |
| Yes | 32 [1] | 23 [1] | 9 [1] | |
| Nonworking days, n [%] | 0.783 | |||
| No | 1,426 [62] | 1,002 [63] | 424 [62] | |
| Yes | 856 [38] | 596 [37] | 260 [38] | |
| Hypertension, n [%] | 0.346 | |||
| No | 1,155 [51] | 798 [50] | 357 [52] | |
| Yes | 1,127 [49] | 800 [50] | 327 [48] | |
| Diabetes mellitus, n [%] | 0.686 | |||
| No | 1,872 [82] | 1,307 [82] | 565 [83] | |
| Yes | 410 [18] | 291 [18] | 119 [17] | |
| Smoker, n [%] | 0.634 | |||
| No | 839 [37] | 582 [36] | 257 [38] | |
| Yes | 1,443 [63] | 1,016 [64] | 427 [62] | |
| Stroke, n [%] | 0.828 | |||
| No | 2,144 [94] | 1,503 [94] | 641 [94] | |
| Yes | 138 [6] | 95 [6] | 43 [6] | |
| CKD, n [%] | 0.352 | |||
| No | 2,060 [90] | 1,436 [90] | 624 [91] | |
| Yes | 222 [10] | 162 [10] | 60 [9] | |
| Delay, n [%] | 0.143 | |||
| FMC ≤12 hours | 1,716 [75] | 1,216 [76] | 500 [73] | |
| FMC >12 hours | 566 [25] | 382 [24] | 184 [27] | |
| WBC, [IQR], ×109/L | 10.5 [8.2, 13.3] | 10.5 [8.2, 13.2] | 10.4 [8.2, 13.4] | 0.737 |
| Neutrophil, [IQR], ×109/L | 8.3 [5.8, 11.0] | 8.3 [5.8, 11.0] | 8.1 [5.9, 11.0] | 0.96 |
| Shock index, [IQR] | 0.6 [0.5, 0.7] | 0.6 [0.5, 0.7] | 0.6 [0.5, 0.7] | 0.704 |
| NLR, [IQR] | 6.2 [3.7, 10.5] | 6.3 [3.7, 10.5] | 6.1 [3.7, 10.1] | 0.367 |
| PLR, [IQR] | 157.0 [109.4, 224.8] | 158.1 [111.2, 229.8] | 153.3 [106.1, 212.7] | 0.084 |
| MLR, [IQR] | 0.5 [0.4, 0.8] | 0.5 [0.4, 0.8] | 0.5 [0.4, 0.8] | 0.653 |
| SIRI, [IQR] | 4.2 [2.3, 7.3] | 4.2 [2.3, 7.4] | 4.1 [2.4, 7.1] | 0.872 |
| SII, [IQR] | 1,248.5 [705.2, 2,167.0] | 1,269.3 [710.2, 2,200.2] | 1,212.2 [701.2, 2,051.6] | 0.251 |
| HB, [IQR], g/L | 137.0 [123.0, 150.0] | 137.0 [123.0, 151.0] | 137.0 [122.0, 150.0] | 0.925 |
| RBC, [IQR], ×1012/L | 4.5 [4.0, 4.9] | 4.5 [4.0, 4.9] | 4.5 [4.0, 4.9] | 0.744 |
| PLT, [IQR], ×109/L | 203.0 [167.0, 248.0] | 205.0 [169.0, 249.0] | 200.0 [163.0, 246.0] | 0.177 |
| ALT, [IQR], U/L | 31.0 [21.0, 48.0] | 30.5 [20.0, 48.0] | 32.0 [21.0, 49.0] | 0.284 |
| AST, n [%] | 0.876 | |||
| <500 U/L | 2,217 [97] | 1,552 [97] | 665 [97] | |
| 500–1,000 U/L | 45 [2] | 31 [2] | 14 [2] | |
| ≥1,000 U/L | 20 [1] | 15 [1] | 5 [1] | |
| GGT, [IQR], IU/L | 34.0 [22.0, 58.0] | 34.0 [22.0, 58.0] | 34.0 [21.0, 59.0] | 0.655 |
| CK, [IQR], U/L | 480.0 [174.0, 1,335.5] | 479.5 [174.2, 1,329.2] | 481.5 [173.5, 1,350.5] | 0.96 |
| CKMB, [IQR], U/L | 51.5 [24.0, 124.0] | 51.0 [24.0, 123.0] | 52.5 [24.0, 128.0] | 0.85 |
| LDH, [IQR], U/L | 373.0 [264.2, 596.8] | 371.5 [265.2, 584.0] | 381.0 [263.0, 617.8] | 0.642 |
| HBDH, [IQR], U/L | 263.0 [173.0, 461.0] | 258.5 [173.2, 456.0] | 272.0 [169.8, 477.2] | 0.63 |
| CTnT, [IQR], ng/L | 796.1 [203.2, 2,563.0] | 809.0 [212.2, 2,563.0] | 778.2 [179.2, 2,564.2] | 0.495 |
| BNP, [IQR], pg/mL | 883.7 [227.2, 2,632.0] | 881.7 [236.6, 2,718.5] | 885.9 [209.0, 2,522.8] | 0.392 |
Values are expressed as medians with IQR for continuous data. Other values are presented as numbers and percentages. CKD, chronic kidney disease; FMC, first medical contact; WBC, white blood cell; Shock index, ratio of HR to SBP; PLR, ratio of platelets to lymphocytes; NLR, ratio of neutrophils to lymphocytes; MLR, ratio of monocytes to lymphocytes; SIRI, systemic inflammatory response index; SII, systemic inflammatory reaction index; HB, hemoglobin; RBC, red blood cell; PLT, platelet; ALT, alanine aminotransferase; AST, aspartate aminotransferase; GGT, gamma glutamyl transferase; CK, creatine kinase; CKMB, creatine kinase MB; LDH, lactate dehydrogenase; HBDH, hydroxybutyrate dehydrogenase; CTnT, cardiac troponin T; BNP, brain natriuretic peptide; IQR, interquartile ranges.
A comparison of the model performances with the training and test datasets
| Model | Training dataset | Test dataset | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Acc | AUC | Recall | Precision | Gini | Acc | AUC | Recall | Precision | Gini | ||
| LR | 0.934 | 0.826 | 0.203 | 0.686 | 0.653 | 0.937 | 0.823 | 0.308 | 0.696 | 0.645 | |
| LASSO | 0.927 | 0.803 | 0.085 | 0.556 | 0.605 | 0.931 | 0.822 | 0.212 | 0.647 | 0.643 | |
| XGBoost | 1 | 1 | 1 | 1 | 1 | 0.927 | 0.782 | 0.25 | 0.541 | 0.566 | |
| LightGBM | 1 | 1 | 1 | 1 | 1 | 0.928 | 0.803 | 0.25 | 0.565 | 0.606 | |
| SVM | 0.928 | 0.767 | 0.042 | 0.833 | 0.535 | 0.927 | 0.778 | 0.077 | 0.667 | 0.557 | |
LR, logistic regression; LASSO, least absolute shrinkage and selection operator; XGBoost, extreme gradient boosting; LightGBM, light gradient boosting machine; SVM, support vector regression; Acc, accuracy; AUC, area under the curve.
Figure 2LASSO risk model that incorporated the 8 independent predictors was developed and presented as a nomogram. (A) Proposed CS nomogram; (B) the nomogram results for patient #6 are illustrated by mapping the values to the covariate scales. The incidence estimated for late CS after admission is 0.352. *, P<0.05; **, P<0.01; ***, P<0.001. CS, cardiogenic shock; LDH, lactate dehydrogenase; AST, aspartate aminotransferase; HB, hemoglobin; WBC, white blood cell; CKD, chronic kidney disease.
Figure 3Performance of the LASSO risk nomogram model. (A) Calibration curves for the predictions of the CS nomogram in the cohort. The solid line represents the performance of the nomogram; a closer fit to the diagonal dotted line represents a better prediction. (B) Predictive accuracy of the LASSO model, GRACE model, simple-ORBI model (based only on admission variables in the ORBI score, such as age, previous stroke/TIA, presentation with cardiac arrest, anterior myocardial infarction, FMC delay >90 min, Killip Class II or III, heart rate >90 beats/min, and the combination of systolic blood pressure <125 mmHg and pulse pressure <45 mmHg) and shock index model for late CS. (C) Decision curve analysis for the CS nomogram. The LASSO nomogram model (red) demonstrated an improved net benefit compared with the simple-ORBI model (green), the GRACE model (blue), and a model based on admission shock index (orange). (D) Clinical impact curve for the LASSO nomogram model. The heavy red solid line shows the total number of patients out of 1,000 who would be deemed high risk for each risk threshold. The blue dashed line shows how many of those patients would be true positive cases. CS, cardiogenic shock; LASSO, least absolute shrinkage and selection operator; GRACE, Global Registry of Acute Coronary Events; ORBI, Brittany Regional Infarction Observatory; TIA, transient ischemic attack; FMC, first medical contact.