| Literature DB >> 36172420 |
Arjun Verma1, Yas Sanaiha1, Joseph Hadaya1, Anthony Jason Maltagliati2, Zachary Tran1, Ramin Ramezani3, Richard J Shemin4, Peyman Benharash1,4.
Abstract
Objective: We sought to several develop parsimonious machine learning models to predict resource utilization and clinical outcomes following cardiac operations using only preoperative factors.Entities:
Keywords: AKI, acute kidney injury; AUC, area under the receiver operating characteristic; CABG, coronary artery bypass grafting; COVID-19; GBM, gradient boosted machine; ICU, intensive care unit; LOS, length of stay; ML, machine learning; RF, random forest; STS, Society of Thoracic Surgeons; UCCSC, University of California Cardiac Surgery Consortium; XGBoost, extreme gradient boosting; cardiac surgery; length of stay; machine learning; resource utilization
Year: 2022 PMID: 36172420 PMCID: PMC9510828 DOI: 10.1016/j.xjon.2022.04.017
Source DB: PubMed Journal: JTCVS Open ISSN: 2666-2736
Figure E1Study Consolidated Standards of Reporting Trials diagram. CABG, Coronary artery bypass grafting; UCCSC, University of California Cardiac Surgery Consortium; LOS, length of stay; ICU LOS, intensive care unit length of stay.
Distribution of missing variables in the entire study cohort, as well as between the derivation and validation cohorts
| Parameter | Overall (n = 6316) | Derivation (n = 5028) | Validation (n = 1288) |
|---|---|---|---|
| Age | 0 | 0 | 0 |
| Elective admission | 0 | 0 | 0 |
| Female | 0 | 0 | 0 |
| Height (cm) | 2 | 2 | 0 |
| Weight (kg) | 1 | 1 | 0 |
| Ethnicity | 0 | 0 | 0 |
| Operative type | |||
| Isolated CABG | 0 | 0 | 0 |
| Isolated valve operation | 0 | 0 | 0 |
| CABG + valve | 0 | 0 | 0 |
| Multiple valve | 0 | 0 | 0 |
| Medical conditions | |||
| Atrial fibrillation | 0 | 0 | 0 |
| Cancer | 0 | 0 | 0 |
| Cerebrovascular disease | 0 | 0 | 0 |
| Mild lung disease | 0 | 0 | 0 |
| Severe lung disease | 0 | 0 | 0 |
| Congestive heart failure | 0 | 0 | 0 |
| Diabetes | 0 | 0 | 0 |
| Home oxygen | 0 | 0 | 0 |
| Hypertension | 0 | 0 | 0 |
| Infectious endocarditis | 0 | 0 | 0 |
| Intra-aortic balloon pump | 0 | 0 | 0 |
| Liver disease | 0 | 0 | 0 |
| Mediastinal radiation | 0 | 0 | 0 |
| Peripheral vascular disease | 0 | 0 | 0 |
| Prior myocardial infarction | 0 | 0 | 0 |
| Syncope | 0 | 0 | 0 |
| Thoracic aortic disease | 0 | 0 | 0 |
| Current tobacco use | 0 | 0 | 0 |
| Laboratory values | |||
| Hematocrit (%) | 39 | 37 | 2 |
| International normalized ratio | 275 | 232 | 33 |
| Serum albumin (g/dL) | 940 | 793 | 147 |
| Preoperative creatinine (mg/dL) | 25 | 22 | 3 |
| Ejection fraction (%) | 198 | 178 | 20 |
| Hospital of operation | |||
| Center 1 | 0 | 0 | 0 |
| Center 2 | 0 | 0 | 0 |
| Center 3 | 0 | 0 | 0 |
| Center 4 | 0 | 0 | 0 |
| Center 5 | 0 | 0 | 0 |
CABG, Coronary artery bypass grafting.
Figure E2Schematic representing the algorithmic design of random forest, gradient boosted machines (GBM) and extreme gradient boosting (XGBoost).
Hyperparameters for each machine learning model developed in the present study. Parameters not mentioned here were set as the default value
| Outcome | GBM | RF | XGBoost |
|---|---|---|---|
| Length of stay | {'max_depth': 3, 'max_features': 8, 'n_estimators': 60} | {'max_depth': 6, 'max_features': 7, 'n_estimators': 40} | {'alpha': 1, 'max_depth': 2, 'n_estimators': 30} |
| ICU length of stay | {'max_depth': 5, 'max_features': 2, 'n_estimators': 70} | {'max_depth': 6, 'max_features': 6, 'n_estimators': 40} | {'alpha': 10, 'max_depth': 3, 'n_estimators': 20} |
| Mortality | {'max_depth': 2, 'max_features': 7, 'n_estimators': 20} | {'max_depth': 3, 'max_features': 2, 'n_estimators': 10} | {'alpha': 10, 'max_depth': 2, 'n_estimators': 10} |
| Acute kidney injury | {'max_depth': 3, 'max_features': 10, 'n_estimators': 30} | {'max_depth': 4, 'max_features': 9, 'n_estimators': 50} | {'alpha': 1, 'max_depth': 6, 'n_estimators': 30} |
| Postoperative transfusion | {'max_depth': 2, 'max_features': 5, 'n_estimators': 10} | {'max_depth': 2, 'max_features': 5, 'n_estimators': 10} | {'alpha': 10, 'max_depth': 2, 'n_estimators': 10} |
| Reoperation | {'max_depth': 2, 'max_features': 8, 'n_estimators': 10} | {'max_depth': 3, 'max_features': 10, 'n_estimators': 40} | {'alpha': 0.1, 'max_depth': 2, 'n_estimators': 10} |
GBM, Gradient boosted machine; RF, Random forest; XGBoost, extreme gradient boosting; ICU, intensive care unit.
Baseline patient characteristics of the study cohort
| Parameter | Overall (n = 6316) | Derivation (n = 5028) | Validation (n = 1288) | |
|---|---|---|---|---|
| Age (y) | 63 ± 13 | 63 ± 13 | 64 ± 13 | <.001 |
| Elective admission (%) | 58.5 | 58.3 | 59.3 | .52 |
| Female (%) | 27.5 | 27.7 | 26.6 | .45 |
| Height (cm) | 171 ± 11 | 171 ± 11 | 171 ± 10 | .29 |
| Weight (kg) | 82 ± 19 | 82 ± 19 | 81 ± 20 | .57 |
| Ethnicity (%) | 19.7 | 19.9 | 19.3 | .68 |
| Operative type (%) | ||||
| Isolated CABG | 50.5 | 51.3 | 47.4 | .012 |
| Isolated valve operation | 33.3 | 31.3 | 41.2 | <.001 |
| CABG + valve | 10.6 | 11.1 | 8.3 | .003 |
| Multiple valve | 5.8 | 6.3 | 3.6 | <.001 |
| Medical conditions (%) | ||||
| Atrial fibrillation | 17.6 | 17.6 | 17.7 | .91 |
| Cancer | 6.9 | 7.1 | 6.4 | .37 |
| Cerebrovascular disease | 17.0 | 17.2 | 16.2 | .38 |
| Severe lung disease | 3.3 | 3.2 | 3.9 | .23 |
| Congestive heart failure | 36.1 | 33.8 | 45.0 | <.001 |
| Diabetes | 38.1 | 37.6 | 39.7 | .18 |
| Home oxygen | 3.0 | 3.1 | 2.8 | .59 |
| Hypertension | 77.4 | 77.2 | 78.0 | .57 |
| Infectious endocarditis | 5.9 | 5.9 | 6.1 | .84 |
| Liver disease | 6.4 | 6.8 | 5.0 | .017 |
| Peripheral vascular disease | 9.0 | 8.5 | 11.1 | .003 |
| Prior myocardial infarction | 31.2 | 31.7 | 29.3 | .09 |
| Laboratory values | ||||
| Hematocrit (% blood volume) | 39 ± 6 | 39 ± 6 | 39 ± 6 | .01 |
| International normalized ratio | 1.13 ± 0.3 | 1.13 ± 0.3 | 1.12 ± 0.2 | .26 |
| Serum albumin (g/dL) | 3.9 ± 0.6 | 3.9 ± 0.6 | 3.9 ± 0.6 | .008 |
| Preoperative creatinine (mg/dL) | 1.4 ± 1.7 | 1.4 ± 1.6 | 1.5 ± 1.9 | <.001 |
| Ejection fraction (%) | 56 ± 12 | 56 ± 12 | 57 ± 12 | .21 |
| Hospital of operation (%) | ||||
| Center 1 | 32.7 | 31.7 | 36.5 | .001 |
| Center 2 | 24.0 | 24.7 | 21.4 | .014 |
| Center 3 | 19.1 | 19.1 | 18.9 | .84 |
| Center 4 | 14.0 | 14.9 | 10.5 | <.001 |
| Center 5 | 10.2 | 9.5 | 12.7 | <.001 |
Values are presented as mean ± SD or n. CABG, Coronary artery bypass grafting.
Covariates selected for final model development
| Selected covariates |
|---|
| Baseline characteristics |
| Age |
| Elective operation |
| Height (cm) |
| Weight (kg) |
| Operation type |
| Isolated CABG |
| Isolated valve operation |
| CABG + valve |
| Multiple valve |
| Medical conditions |
| Preoperative atrial fibrillation |
| Cerebrovascular disease |
| Mild lung disease |
| Severe lung disease |
| Congestive heart failure |
| Home oxygen |
| Previous myocardial infarction |
| Hematocrit |
| International normalized ratio |
| Serum albumin |
| Preoperative creatinine |
| Ejection fraction |
| Hospital characteristics |
| Annual hospital volume |
| No. of cardiac surgeons on staff |
| No. of low-acuity beds |
CABG, Coronary artery bypass grafting.
Resource utilization and clinical outcomes stratified by derivation and validation cohorts
| Outcome | Overall (n = 6316) | Derivation (n = 5028) | Validation (n = 1288) | |
|---|---|---|---|---|
| Resource use | ||||
| Length of stay (d) | 8 (6-13) | 8 (6-13) | 8 (5-12) | .008 |
| ICU length of stay (h) | 74 (47-116) | 75 (47-117) | 68 (43-99) | <.001 |
| Clinical end points | ||||
| Mortality | 0.9 | 1.0 | 0.7 | .39 |
| Acute kidney injury | 1.5 | 1.5 | 1.7 | .54 |
| Postoperative transfusion | 27.7 | 28.8 | 23.1 | <.001 |
| Reoperation | 8.6 | 9.1 | 6.9 | .014 |
Values are presented as median (interquartile range) or %. ICU, Intensive care unit.
Figure 1Coefficient of determination (R2) versus covariate set size in the prediction of in-hospital length of stay. LR, Linear regression; GBM, gradient boosted machine.
Performance of each algorithm when predicting resource utilization and clinical outcomes in the validation cohort
| Outcome | Linear | Logistic | GBM | RF | XGBoost | STS |
|---|---|---|---|---|---|---|
| Resource use | – | |||||
| Length of stay | 0.42 | – | 0.47 | 0.47 | 0.47 | – |
| ICU length of stay | 0.017 | – | 0.078 | 0.054 | 0.082 | |
| Clinical end point | ||||||
| Mortality | – | 0.68 | 0.68 | 0.7 | 0.72 | 0.91 |
| Acute kidney injury | – | 0.77 | 0.79 | 0.8 | 0.8 | 0.84 |
| Postoperative transfusion | – | 0.69 | 0.68 | 0.68 | 0.67 | – |
| Reoperation | – | 0.78 | 0.79 | 0.8 | 0.78 | 0.76 |
GBM, Gradient boosted machine; RF, random forest; XGBoost, extreme gradient boosting; STS, Society of Thoracic Surgeons risk score; ICU, intensive care unit.
Regressions were evaluated using the coefficient of determination (R2).
Binary classifiers were assessed with the area under the receiver operating characteristic.
Figure E3Calibration plot of observed versus predicted length of stay in days. R, Coefficient of determination; GBM, gradient boosted machine; LR, Linear regression.
Cross-validated model performance metrics for each algorithm and outcome
| Outcome | Linear | Logistic | GBM | RF | XGBoost | STS | ||
|---|---|---|---|---|---|---|---|---|
| Resource use | ||||||||
| Length of stay | 0.41 (0.41-0.41) | – | 0.42 (0.42-0.42) | 0.41 (0.40-0.41) | 0.42 (0.42-0.42) | – | <.001 | – |
| ICU length of stay | 0.15 (0.15-0.15) | – | 0.23 (0.23-0.23) | 0.21 (0.21-0.21) | 0.22 (0.22-0.22) | – | <.001 | – |
| Clinical end point | ||||||||
| Mortality | – | 0.67 (0.67-0.68) | 0.69 (0.68-0.70) | 0.69 (0.68-0.70) | 0.69 (0.69-0.70) | 0.91 (0.91-0.92) | <.001 | <.001 |
| Acute kidney injury | – | 0.67 (0.67-0.68) | 0.76 (0.75-0.77) | 0.76 (0.76-0.77) | 0.74 (0.73-0.75) | 0.84 (0.83-0.86) | <.001 | <.001 |
| Postoperative transfusion | – | 0.71 (0.71-0.72) | 0.73 (0.73-0.73) | 0.71 (0.71-0.71) | 0.73 (0.73-0.74) | – | <.001 | – |
| Reoperation | – | 0.81 (0.80-0.81) | 0.8 (0.79-0.80) | 0.80 (0.80-0.80) | 0.79 (0.79-0.80) | 0.76 (0.76-0.77) | .99 | <.001 |
Values are presented as mean (95% CI). GBM, Gradient boosted machine; RF, random forest; XGBoost, extreme gradient boosting; STS, Society of Thoracic Surgeons risk score; ICU, intensive care unit.
Models with continuous output were evaluated using the coefficient of determination (R2).
Binary classifiers were assessed with the area under the receiver operating characteristic.
Figure 2Interpretation of gradient boosted machine (GBM)-based model for prediction of length of stay (LOS) (days) using SHapley summary plots. The y-axis is ordered by increasing feature importance, and the x-axis is the marginal effect of each parameter on predicted LOS. Red dots show the influence of high feature values on predicted LOS, whereas blue dots show the influence of low feature values. CABG, Coronary artery bypass grafting.
Figure 3Interpretation of gradient boosted machine (GBM)-based model for prediction of intensive care unit length of stay (ICU LOS) (hours) using SHapley summary plots. The y-axis is ordered by increasing feature importance, and the x-axis is the marginal effect of each parameter on predicted ICU LOS. Red dots show the influence of high feature values on predicted ICU LOS, whereas blue dots show the influence of low feature values. CABG, Coronary artery bypass grafting.
Cross-validated Brier scores for binary classifiers
| Outcome | Logistic | GBM | RF | XGBoost | STS |
|---|---|---|---|---|---|
| Mortality | 0.0096 (0.0095-0.0096) | 0.0105 (0.0104-0.0107) | 0.0095 (0.0095-0.0095) | 0.0095 (0.0094-0.0095) | 0.0075 (0.0073-0.0077) |
| Acute kidney injury | 0.0146 (0.0145-0.0146) | 0.0161 (0.0159-0.0163) | 0.0144 (0.0143-0.0144) | 0.0145 (0.0145-0.0146) | 0.0175 (0.0175-0.0175) |
| Postoperative transfusion | 0.1827 (0.1823-0.1831) | 0.1781 (0.1777-0.1784) | 0.1894 (0.1892-0.1896) | 0.1755 (0.1751-0.1759) | – |
| Reoperation | 0.0727 (0.0725-0.0729) | 0.076 (0.0759-0.0761) | 0.0739 (0.0738-0.0741) | 0.0746 (0.0744-0.0747) | 0.0603 (0.0601-0.0604) |
GBM, Gradient boosted machine; RF, Random forest. XGBoost, extreme gradient boosting; STS, Society of Thoracic Surgeon Risk Score.
Evaluation of Brier score for each predictive model using data from the validation cohort
| Outcome | Logistic | GBM | RF | XGBoost | STS |
|---|---|---|---|---|---|
| Mortality | 0.0069 | 0.0083 | 0.0069 | 0.0067 | 0.0070 |
| Acute kidney injury | 0.0168 | 0.0165 | 0.0162 | 0.0175 | 0.0170 |
| Postoperative transfusion | 0.1733 | 0.1708 | 0.1715 | 0.1751 | – |
| Reoperation | 0.0586 | 0.0605 | 0.0594 | 0.0612 | 0.0597 |
GBM, Gradient boosted machine; RF, Random forest; XGBoost, extreme gradient boosting; STS, Society of Thoracic Surgeon Risk Score.
Figure 4Compared with linear regression, machine learning models exhibited superior performance in the estimation of length of stay following cardiac operations. CABG, Coronary artery bypass grafting; LOS, length of stay; ICU LOS, intensive care unit length of stay; EF, ejection fraction.