| Literature DB >> 33658832 |
Wenhan Wu1,2, Zongguang Zhou1,2.
Abstract
PURPOSE: This study aimed to use traditional statistics and machine learning to develop and validate prediction models for predicting hospital death in patients with AMI and compare these models' performance. PATIENTS AND METHODS: Data were retrieved from the Medical Information Mart for Intensive Care (MIMIC III) electronic clinical database. A total of 338 eligible AMI patients were divided into a training cohort (n = 238) and a validation cohort (n = 100), and all patients were divided into survival groups and nonsurvival groups according to patients' hospital outcomes. The performance of the traditional statistics prediction model and the optimal machine learning prediction model was evaluated and compared with respect to discrimination, calibration, and clinical utility in the validation cohort.Entities:
Keywords: acute mesenteric ischemia; hospital mortality; machine learning; nomogram; prediction model
Year: 2021 PMID: 33658832 PMCID: PMC7920592 DOI: 10.2147/IJGM.S300492
Source DB: PubMed Journal: Int J Gen Med ISSN: 1178-7074
Figure 1Diagram of developing AMI hospital mortality prediction models.
Baseline Patient Demographics, Clinical and Laboratory Characteristics, and Outcomes
| Patient Characteristics | Training Cohort (n = 238) | Validation Cohort (n = 100) | ||||
|---|---|---|---|---|---|---|
| Survivors (n = 159) | Nonsurvivors (n = 79) | P-value | Survivors (n = 62) | Nonsurvivors (n = 38) | P-value | |
| Temperature, Fahrenheit, IQR | 98.5 (97.8–99.6) | 98.1 (97.1–99.0) | 0.182 | 98.2 (97.6–99.5) | 98.7 (98.1–101.0) | 0.286 |
| Heart rate | 91.5±19.0 | 91.6±23.3 | 0.711 | 92.0±20.1 | 90.5±16.2 | 0.700 |
| SBP, mmHg | 124±27 | 108±29 | <0.001 | 123±28 | 109±27 | 0.011 |
| DBP, mmHg | 74±17 | 60±15 | <0.001 | 71±18 | 57±14 | <0.001 |
| RR | 20±7 | 20±5 | 0.894 | 20±4 | 21±5.0 | 0.543 |
| Age | 66.2±14.9 | 71.9±13.2 | 0.004 | 64.8±16.3 | 69.1±14.0 | 0.031 |
| Male (%) | 75 (47.2) | 42 (53.2) | 0.384 | 27 (43.5) | 18 (47.4) | 0.709 |
| Myocardial infarction (%) | 4 (2.5) | 2 (2.5) | 1.000 | 2 (3.2) | 4 (10.5) | 0.290 |
| CHF (%) | 16 (10.1) | 17 (21.5) | 0.016 | 10 (16.1) | 6 (15.6) | 0.964 |
| PVD (%) | 24 (15.1) | 12 (15.2) | 0.985 | 13 (31.0) | 8 (21.1) | 0.992 |
| Dementia (%) | 5 (3.1) | 6 (7.6) | 0.225 | 1 (1.6) | 2 (5.3) | 0.664 |
| COPD (%) | 25 (15.7) | 15 (19.0) | 0.526 | 14 (22.6) | 5 (13.2) | 0.244 |
| CTD (%) | 4 (2.5) | 3 (3.8) | 0.886 | 2 (3.2) | 3 (7.9) | 0.571 |
| Peptic ulcer (%) | 7 (4.4) | 8 (10.1) | 0.153 | 4 (6.5) | 1 (2.6) | 0.705 |
| Diabetes (%) | 41 (25.8) | 21 (16.6) | 0.895 | 15 (24.2) | 14 (36.8) | 0.176 |
| CKD (%) | 21 (13.2) | 24 (30.4) | 0.001 | 7 (11.3) | 10 (26.3) | 0.042 |
| Hemiplegia (%) | 1 (0.6) | 4 (5.1) | 0.077 | 0 (0) | 0 (0) | 1.000 |
| Malignant lymphoma (%) | 0 (0) | 1 (1.3) | 0.332 | 1 (1.6) | 0 (0) | 1.000 |
| Solid tumor (%) | 23 (14.5) | 9 (11.4) | 0.513 | 8 (12.9) | 7 (18.4) | 0.453 |
| Liver disease (%) | 13 (8.2) | 10 (12.7) | 0.270 | 8 (12.9) | 4 (10.5) | 0.970 |
| AIDS (%) | 2 (1.3) | 0 (0) | 1.000 | 2 (3.2) | 0 (0) | 0.524 |
| AF (%) | 27 (17.0) | 13 (17.7) | 0.887 | 11 (17.7) | 7 (18.4) | 0.932 |
| CAD (%) | 44 (27.7) | 21 (26.6) | 0.859 | 17 (27.4) | 13 (34.2) | 0.472 |
| Hypertension (%) | 98 (61.6) | 48 (60.8) | 0.896 | 29 (46.8) | 20 (52.6) | 0.570 |
| Blood, Hematocrit, % | 34.4±6.1 | 32.5±6.9 | 0.029 | 33.7±6.2 | 32.1±5.1 | 0.169 |
| Blood, INR | 1.6±1.0 | 1.8±1.2 | 0.072 | 1.7±0.9 | 1.7±0.8 | 0.750 |
| Blood, MCH, picograms per cell | 30.3±2.5 | 30.7±2.8 | 0.331 | 30.7±2.0 | 30.4±2.8 | 0.370 |
| Blood, MCHC, g/dL | 33.7±1.5 | 33.2±1.3 | 0.014 | 33.6±1.5 | 32.9±1.9 | 0.032 |
| Blood, MCV, femtoliters/cell | 90.2±6.5 | 92.5±7.6 | 0.013 | 91.4±6.1 | 90.3±6.2 | 0.368 |
| Blood, Platelet Count, *109/L | 236±128 | 200±132 | 0.044 | 236±146 | 229±127 | 0.813 |
| Blood, PT, s | 16.7±7.4 | 18.6±8.2 | 0.073 | 16.7±5.8 | 16.4±3.5 | 0.787 |
| Blood, PTT, s | 32.9 (30.5–41.8) | 38.4 (30.1–50.0) | 0.054 | 35.1 (28.3–42.6) | 34.8 (28.1–45.0) | 0.837 |
| Blood, RDW, % | 14.7±1.6 | 15.7±2.3 | <0.001 | 15.6±2.3 | 16.1±2.4 | 0.301 |
| Blood, Red Blood Cells, *1012/L | 3.7 (3.3–4.3) | 3.6 (3.1–4.1) | 0.128 | 3.8 (3.2–4.1) | 3.6 (3.2–3.9) | 0.303 |
| Blood, White Blood Cells, *109/L | 10.0 (6.5–14.9) | 10.2 (6.3–17.3) | 0.927 | 10.9 (6.5–16.5) | 13.7 (9.0–17.2) | 0.303 |
| Blood, Hemoglobin, g/dL | 11.5±2.3 | 10.9±2.1 | 0.064 | 11.3±2.1 | 10.7±1.9 | 0.218 |
| Blood, Glucose, mg/dL | 137 (110–174) | 131 (104–188) | 0.713 | 133 (110–191) | 108 (89–138) | 0.064 |
| Blood, Lactate, mmol/L | 2.0 (1.3–3.2) | 3.0 (2.2–6.1) | <0.001 | 2.1 (1.4–3.4) | 2.9 (2.3–4.9) | 0.007 |
| Blood, PH | 7.37 (7.30–7.42) | 7.31 (7.20–7.39) | 0.008 | 7.36 (7.30–7.44) | 7.32 (7.23–7.36) | 0.007 |
| Blood, Anion gap, mmol/L | 14 (12–16) | 17 (14–21) | <0.001 | 14 (12–16) | 16 (13–19) | 0.064 |
| Blood, Bicarbonate, mmol/L | 22.0 (20.0–25.5) | 21.0 (16.5–25.0) | 0.163 | 22 (19–25) | 21.5 (18–25) | 0.973 |
| Blood, Calcium, mg/dL | 8.1±1.1 | 8.4±1.3 | 0.169 | 7.8±0.8 | 8.2±1.2 | 0.105 |
| Blood, Chloride, mmol/L | 105.7±6.6 | 105.1±6.9 | 0.467 | 106.0±7.1 | 103.7±7.5 | 0.120 |
| Blood, Creatinine, mg/dL | 1.4±1.2 | 2.3±1.9 | <0.001 | 1.3±0.9 | 2.3±2.2 | 0.001 |
| Blood, Magnesium, mg/dL | 1.8±0.5 | 1.9±0.5 | 0.063 | 1.9±0.9 | 1.9±0.3 | 0.874 |
| Blood, Phosphate, mg/dL | 3.7 (3.0–4.2) | 4.0 (3.3–4.9) | 0.302 | 3.3 (2.7–4.4) | 3.8 (2.8–5.0) | 0.276 |
| Blood, Potassium, mmol/L | 4.1±0.7 | 4.3±0.8 | 0.154 | 4.1±0.8 | 4.1±0.8 | 0.984 |
| Blood, Sodium, mmol/L | 139 (136–142) | 139 (136–143) | 0.856 | 139 (135–141) | 138 (135–142) | 0.537 |
| Blood, Urea Nitrogen, mg/dL | 22 (15–34) | 30 (23–44) | 0.002 | 24 (15–40) | 30 (20–41) | 0.837 |
| Surgery | 122 (76.7) | 57 (72.2) | 0.441 | 44 (71.0) | 29 (76.3) | 0.559 |
Abbreviations: AF, atrial fibrillation; CAD, coronary artery disease; CHF, congestive heart failure; CKD, chronic kidney disease; COPD, chronic obstructive pulmonary disease; CTD, connective tissue disease; DBP, diastolic blood pressure; MCH, mean corpuscular hemoglobin; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; PT, prothrombin time; PTT, partial thromboplastin time; PVD, peripheral vascular disease; RDW, red blood cell distribution width; RR, respiratory rate; SBP, systolic blood pressure.
Univariate Analysis for Potential Risk Factors in the Training Cohort
| Variables | OR (95% CI) | P-value |
|---|---|---|
| SBP, mmHg | 0.976 (0.964–0.988) | <0.001 |
| DBP, mmHg | 0.954 (0.936–0.972) | <0.001 |
| Age | 1.029 (1.009–1.050) | <0.001 |
| CHF | 2.451 (1.163–5.162) | 0.019 |
| CKD | 2.868 (1.476–5.569) | 0.002 |
| Blood, Hematocrit, % | 0.953 (0.912–0.995) | 0.027 |
| Blood, MCHC, g/dL | 0.790 (0.653–0.957) | 0.014 |
| Blood, MCV, femtoliters/cell | 1.051 (1.010–1.094) | 0.013 |
| Blood, Platelet Count, *109/L | 0.998 (0.995–1.000) | 0.036 |
| Blood, RDW, % | 1.339 (1.150–1.560) | <0.001 |
| Blood, Lactate, mmol/L | 1.471 (1.274–1.698) | <0.001 |
| Blood, PH | 0.003 (0.001–0.041) | <0.001 |
| Blood, Anion gap, mmol/L | 1.194 (1.109–1.285) | <0.001 |
| Blood, Creatinine, mg/dL | 1.472 (1.195–1.813) | <0.001 |
| Blood, Urea Nitrogen, mg/dL | 1.020 (1.007–1.032) | 0.001 |
Abbreviations: CHF, congestive heart failure; CKD, chronic kidney disease; DBP, diastolic blood pressure; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; RDW, red blood cell distribution width; SBP, systolic blood pressure.
Multivariate Logistic Regression Model for Hospital Mortality in the Training Cohort
| Variables | β | SE | OR (95% CI) | P-value |
|---|---|---|---|---|
| DBP | −0.046 | 0.011 | 0.955 (0.934–0.976) | <0.001 |
| Blood, Lactate | 0.341 | 0.088 | 1.407 (1.185–1.671) | <0.001 |
| Blood, pH | −4.705 | 1.849 | 0.009 (0.001–0.339) | 0.011 |
| Blood, Creatinine | 0.421 | 0.118 | 1.524 (1.210–1.919) | <0.001 |
| Blood, RDW | 0.358 | 0.094 | 1.431 (1.190–1.720) | <0.001 |
| Age | 0.047 | 0.014 | 1.048 (1.019–1.077) | 0.001 |
Abbreviations: DBP, diastolic blood pressure; RDW, red blood cell distribution width; OR, odds ratio; CI, confidence interval; SE, standard error.
Figure 2The hospital death risk-prediction nomogram for AMI.
The Comparison of Various Machine Learning Classifiers’ Performance Using Different Variable Selection Methods in the Validation Cohort
| Clinical Variables Selection | Lasso | Boruta | ||||
|---|---|---|---|---|---|---|
| Classification Algorithms | SVM | XGBoost | ELM | SVM | XGBoost | ELM |
| Sensitivity, % | 94.7 | 84.2 | 63.2 | 73.7 | 76.3 | 55.3 |
| Specificity, % | 50.0 | 67.7 | 82.3 | 75.8 | 77.4 | 87.1 |
| AUC, % | 78.3 | 79.6 | 72.7 | 80.9 | 82.9 | 71.2 |
Abbreviations: ELM, extreme learning machine; SVM, support vector machine.
Figure 3Comparison of ROC curves among nomogram and machine learning classifiers for the prediction of hospital death of patients with AMI in the validation cohort.
Figure 4Calibration curves for the nomogram and the optimal machine learning classifier (XGBoost using clinical variables determined by Boruta) in the validation cohort.
Figure 5Decision curves for the nomogram and the optimal machine learning classifier (XGBoost using clinical variables determined by Boruta) in the validation cohort.