| Literature DB >> 35966564 |
Yuchen Gao1, Xiaojie Liu2, Lijuan Wang1, Sudena Wang1, Yang Yu1, Yao Ding1, Jingcan Wang1, Hushan Ao1.
Abstract
Objectives: Postoperative major bleeding is a common problem in patients undergoing cardiac surgery and is associated with poor outcomes. We evaluated the performance of machine learning (ML) methods to predict postoperative major bleeding.Entities:
Keywords: cardiac surgery; coronary artery bypass graft (CABG) surgery; machine learning; major bleeding; prediction model
Year: 2022 PMID: 35966564 PMCID: PMC9366116 DOI: 10.3389/fcvm.2022.881881
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
Baseline characteristics of the training set.
| Variables | Total ( | No major bleeding ( | Major bleeding ( |
|
| Age (years) | 62 (55–67) | 61 (55–66) | 65 (59–69) | 0.017 |
| Male, n (%) | 559 (76.37) | 524 (77.06) | 35 (67.31) | 0.127 |
| Height (m) | 1.68 (1.62–1.72) | 1.69 (1.63–1.72) | 165 (1.60–1.70) | 0.007 |
| Weight (kg) | 72.00 (65.00–80.00) | 72.00 (65.00–80.00) | 67.25 (62.00–74.25) | 0.001 |
| BSA (m2) | 1.83 ± 0.17 | 1.84 ± 0.17 | 1.76 ± 0.2 | 0.002 |
| Smoking history, n (%) | 375 (51.23) | 349 (51.32) | 26 (50.00) | 0.886 |
| Angina, n (%) | 699 (95.49) | 647 (95.15) | 52 (100.00) | 0.160 |
| Myocardial infarction, n (%) | 62 (8.47) | 57 (8.38) | 5 (9.62) | 0.961 |
| Arrhythmia, n (%) | 22 (3.01) | 20 (2.94) | 2 (3.85) | 0.665 |
| Previous surgery, n (%) | 176 (24.04) | 164 (24.12) | 12 (23.08) | 0.866 |
| Diabetes, n (%) | 272 (37.16) | 260 (38.24) | 12 (23.08) | 0.036 |
| Hyperlipidemia, n (%) | 603 (82.38) | 559 (82.21) | 44 (84.62) | 0.850 |
| Hypertension, n (%) | 459 (62.70) | 434 (63.82) | 25 (48.08) | 0.026 |
| Kidney failure, n (%) | 21 (2.87) | 20 (2.94) | 1 (1.92) | > 0.99 |
| Dialysis, n (%) | 17 (2.32) | 17 (2.50) | 0 (0) | 0.625 |
| Chronic pulmonary disease, n (%) | 11 (1.50) | 11 (1.62) | 0 (0) | > 0.99 |
| Congestive heart failure, n (%) | 10 (1.37) | 8 (1.18) | 2 (3.85) | 0.155 |
| Anemia, n (%) | 183 (25.00) | 162 (23.82) | 21 (40.38) | 0.012 |
| Peripheral vascular disease, n (%) | 174 (23.77) | 161 (23.68) | 13 (25.00) | 0.866 |
| Venous disease, n (%) | 46 (6.28) | 43 (6.32) | 3 (5.77) | > 0.99 |
| Cerebrovascular disease, n (%) | 84 (11.48) | 77 (11.32) | 7 (13.46) | 0.651 |
| Previous PTCA, n (%) | 16 (2.19) | 14 (2.06) | 2 (3.85) | 0.316 |
| Previous thrombolysis, n (%) | 5 (0.68) | 4 (0.59) | 1 (1.92) | 0.309 |
| CHD family history, n (%) | 77 (10.52) | 69 (10.15) | 8 (15.38) | 0.240 |
| Preoperative statin use, n (%) | 307 (41.94) | 286 (42.06) | 21 (40.38) | 0.885 |
| Preoperative anticoagulant use, n (%) | 639 (87.30) | 596 (87.65) | 43 (82.69) | 0.284 |
| Antiplatelet drugs pause < 5 days, n (%) | 12 (1.64) | 10 (1.47) | 2 (3.85) | 0.207 |
| Left main coronary artery disease, n (%) | 134 (18.31) | 127 (18.68) | 7 (13.46) | 0.457 |
| RBC (×1012/L) | 4.44 ± 0.52 | 4.45 ± 0.51 | 4.32 ± 0.54 | 0.064 |
| WBC (×109/L) | 6.36 (5.31–7.41) | 6.36 (5.30–7.41) | 6.31 (5.35–7.41) | 0.934 |
| PLT (×109/L) | 208 (176–247) | 209 (177–249) | 199 (167–228) | 0.066 |
| Platelet distribution width (fL) | 12.30 (11.20–13.80) | 12.30 (11.20–13.80) | 12.35 (11.20–13.90) | 0.841 |
| Platelet volume (fL) | 10.50 (10.00–11.20) | 10.5 (10.00–11.20) | 10.45 (10.07–11.33) | 0.637 |
| Platelet-large cell ration (%) | 29.40 (24.70–34.90) | 29.40 (24.58–34.73) | 28.65 (26.15–35.42) | 0.653 |
| Thrombocytocrit (%) | 0.22 (0.19–0.26) | 0.22 (0.19–0.26) | 0.21 (0.19–0.23) | 0.096 |
| Hemoglobin (g/L) | 137 (126–146) | 137 (127–146) | 137 (120–143) | 0.045 |
| Total protein (g/L) | 66.05 (62.90–70.10) | 66.15 (63.00–70.30) | 65.20 (61.65–68.03) | 0.061 |
| Albumin (g/L) | 40.80 (38.70–43.42) | 40.85 (38.70–43.60) | 40.40 (38.48–42.20) | 0.097 |
| Potassium (mmol/L) | 4.03 (3.77–4.25) | 4.03 (3.77–4.26) | 4.02 (3.72–4.13) | 0.326 |
| Sodium (mmol/L) | 141.10 (139.12–143.01) | 141.17 (139.13–143.01) | 140.66 (138.74–142.74) | 0.630 |
| Calcium (mmol/L) | 2.26 (2.19–2.34) | 2.26 (2.19–2.34) | 2.27 (2.19–2.33) | 0.887 |
| Glucose (mmol/L) | 5.34 (4.69–6.54) | 5.36 (4.69–6.55) | 5.05 (4.64–6.18) | 0.167 |
| BUN (mmol/L) | 5.23 (4.19–6.44) | 5.29 (4.19–6.44) | 5.07 (4.22–6.30) | 0.431 |
| Creatine (μmol/L) | 82.00 (70.48–94.00) | 82.00 (70.27–94.00) | 84.29 (72.30–95.43) | 0.527 |
| GFR (ml/min/1.73 m2) | 84.29 (71.51–93.94) | 84.52 (72.14–94.24) | 78.37 (64.62–92.18) | 0.051 |
| HSCRP (mg/L) | 1.38 (0.66–3.10) | 1.42 (0.67–3.10) | 1.08 (0.50–2.99) | 0.293 |
| NT-proBNP (pg/ml) | 156.35 (62.98–369.52) | 154.05 (62.27–373.00) | 177.75 (86.90–315.28) | 0.532 |
| PT (s) | 13.10 (12.60–13.60) | 13.10 (12.60–13.53) | 13.30 (12.97–13.62) | 0.045 |
| INR (R) | 1.00 (0.96–1.04) | 1.00 (0.95–1.04) | 1.02 (0.99–1.06) | 0.047 |
| CPB or not, n (%) | 477 (65.16) | 444 (65.29) | 33 (63.46) | 0.765 |
| Operation time (h) | 3.80 (3.30–4.50) | 3.80 (3.30–4.40) | 3.90 (3.38–4.93) | 0.275 |
| Blood loss (ml) | 589.54 (75.17) | 587.76 (68.81) | 612.69 (131.82) | 0.021 |
| Intraoperative transfusion, n (%) | 14 (1.91) | 7 (1.03) | 7 (13.46) | < 0.001 |
| Intraoperative urine output (ml/kg/h) | 3.16 (2.02) | 3.13 (2.01) | 3.54 (2.11) | 0.159 |
| Hemoglobin decrease (g/L) | 25.00 (16.00–33.00) | 25.00 (16.00–33.00) | 27.00 (14.25–33.00) | 0.961 |
| Postoperative first Creatine (μmol/L) | 70.27 (60.88–81.90) | 70.31 (60.88–81.80) | 68.97 (60.85–84.61) | 0.707 |
| Postoperative first NT-proBNP (pg/ml) | 602.10 (350.15–1060.75) | 596.30 (342.08–1042.75) | 690.90 (409.08–1124.25) | 0.108 |
| Preoperative hospital LOS (d) | 6 (4–9) | 6 (4–9) | 6 (4–8) | 0.596 |
| TRUST score | 2 (1–3) | 2 (1–3) | 2 (1–3) | 0.002 |
| WILL-BLEED score | 1 (1–3) | 1 (1–3) | 3 (1–4) | 0.013 |
BSA, body surface area; PTCA, percutaneous transluminal coronary angioplasty; CHD, coronary heart disease; RBC, red blood cell; WBC, white blood cell; PLT, platelet; BUN, blood urea nitrogen; GFR, glomerular filtration rate; HSCRP, high sensitivity C reactive protein; NT-proBNP, N-terminal prohormone of brain natriuretic peptide; PT, prothrombin time; INR, international normalized ratio; LOS, length of stay.
Final logistic regression model.
| Variables | OR (95%CI) | |
| BSA | 0.018 (0.002–0.133) | <0.001 |
| Diabetes | 0.537 (0.256–1.055) | 0.083 |
| Hypertension | 0.426 (0.577–0.827) | 0.008 |
| Total protein | 0.931 (0.882–0.981) | 0.008 |
| Creatine | 1.023 (1.004–1.041) | 0.014 |
| NT-proBNP | 1.000 (0.999–1.000) | 0.025 |
| Blood loss | 1.004 (1.000–1.007) | 0.029 |
| Intraoperative transfusion | 21.968 (5.879–84.392) | <0.001 |
OR, odds ratio; NT-proBNP, N-terminal prohormone of brain natriuretic peptide.
*The backward stepwise selection procedure base on Akaike information criterion was performed to select variables.
Prediction models of postoperative major bleeding in the test set.
| Models | AUC (95%CI) | |||
| TRUST | 0.629 (0.517–0.741) | |||
| WILL-BLEED | 0.557 (0.449–0.665) | |||
| Logistic regression | 0.702 (0.577–0.827) | 0.347 | 0.103 | |
| Support vector machine | 0.792 (0.678–0.907) | 0.016 | 0.002 | 0.031 |
| Xgboost | 0.802 (0.691–0.913) | 0.028 | <0.001 | 0.120 |
| Random forest | 0.810 (0.719–0.902) | 0.005 | <0.001 | 0.084 |
| Conditional inference random forest | 0.831 (0.732–0.930) | 0.002 | <0.001 | 0.027 |
| Stochastic gradient boosting | 0.811 (0.739–0.883) | <0.001 | <0.001 | 0.073 |
| Naïve Bayes | 0.687 (0.561–0.813) | 0.468 | 0.059 | 0.842 |
| Bagged CART | 0.791 (0.706–0.876) | 0.008 | <0.001 | 0.098 |
| Boosted classification trees | 0.794 (0.675–0.913) | 0.007 | 0.003 | 0.117 |
AUC, the area under the receiver operating characteristic curve; CART, classification and regression tree.
*Compared with TRUST.
†Compared with WILL-BLEED.
‡Compared with the logistic regression model.
FIGURE 1Receiver-operating-characteristics (ROC) curves of the conventional model and machine learning models for postoperative major bleeding in the test set. The corresponding values of the area under the receiver-operating-characteristics curve (AUC) for each model are presented in Table 2.
FIGURE 2Decision curve analysis of the conventional models and machine learning-based models. The X-axis indicates the threshold probability and Y-axis indicates the net benefit. The red solid line indicates the net benefit of all patients developing postoperative major bleeding events. Ochre’s solid line indicates the net benefit of no patients developing postoperative major bleeding events. LR, logistic regression; CIRF, conditional inference random forest; SGBT, stochastic gradient boosting.
FIGURE 3Calibration belts of (A) logistic regression. (B) Random forest. (C) Conditional inference random forest. (D) Stochastic gradient boosting for postoperative major bleeding prediction in the test set.
FIGURE 4Variable importance of predictors in the ML models. The variable importance is a scaled measure to have a maximum value of 100. The predictors with variable importance of the top 20 are shown. (A) Conditional inference random forest. (B) Xgboost. (C) Random forest. (D) Stochastic gradient boosting.