| Literature DB >> 35482751 |
Joseph Hadaya1, Arjun Verma1, Yas Sanaiha1, Ramin Ramezani2, Nida Qadir3, Peyman Benharash1.
Abstract
BACKGROUND: Emergency general surgery (EGS) operations are associated with substantial risk of morbidity including postoperative respiratory failure (PRF). While existing risk models are not widely utilized and rely on traditional statistical methods, application of machine learning (ML) in prediction of PRF following EGS remains unexplored.Entities:
Mesh:
Year: 2022 PMID: 35482751 PMCID: PMC9049563 DOI: 10.1371/journal.pone.0267733
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Relationship between covariates and AUC.
Impact of number of model covariates on area under the receiver operating characteristic (AUC).
Comparison of patient and hospital characteristics stratified by the development of postoperative respiratory failure (PRF).
| No PRF (n = 1,003,703) | PRF (n = 88,308) | ||
|---|---|---|---|
| Female (%) | 60.3 | 52.1 | <0.001 |
| Age (years) | 55.0±18.4 | 68.4±14.0 | <0.001 |
| Primary Insurer (%) | <0.001 | ||
| Private | 36.8 | 17.3 | |
| Medicare | 36.7 | 68.4 | |
| Medicaid | 17.2 | 9.4 | |
| Other Payer | 9.2 | 4.9 | |
| Income Quartile (%) | <0.001 | ||
| Fourth (Highest) | 19.0 | 17.0 | |
| Third | 25.0 | 24.1 | |
| Second | 27.9 | 29.2 | |
| First (Lowest) | 28.2 | 29.8 | |
| Operation (%) | <0.001 | ||
| Large Bowel Resection | 15.3 | 43.9 | |
| Small Bowel Resection | 8.7 | 21.3 | |
| Cholecystectomy | 62.1 | 18.1 | |
| Repair of Perforated Ulcer | 1.6 | 7.0 | |
| Lysis of Adhesions | 7.4 | 6.9 | |
| Appendectomy | 5.0 | 2.8 | |
| Elixhauser Comorbidity Index | 2.2±1.9 | 5.1±2.2 | <0.001 |
| Medical Conditions (%) | |||
| Arrhythmia | 12.7 | 40.5 | <0.001 |
| Chronic Liver Disease | 7.3 | 13.3 | <0.001 |
| Chronic Lung Disorder | 13.8 | 31.5 | <0.001 |
| Coagulopathy | 3.5 | 21.0 | <0.001 |
| Congestive Heart Failure | 5.1 | 26.4 | <0.001 |
| Coronary Artery Disease | 10.3 | 23.6 | <0.001 |
| End Stage Renal Disease | 1.3 | 4.7 | <0.001 |
| Hypothyroidism | 10.4 | 14.3 | <0.001 |
| Malignancy | 7.5 | 14.6 | <0.001 |
| Neurologic Disorder | 3.7 | 22.0 | <0.001 |
| Valve Disorder | 2.9 | 6.9 | <0.001 |
| Hospital Bed Size (%) | <0.001 | ||
| Large | 53.3 | 56.3 | |
| Medium | 29.1 | 28.6 | |
| Small | 17.6 | 15.0 | |
| Teaching Hospital (%) | 63.3 | 66.0 | <0.001 |
Categorical variables reported as frequency and continuous as mean and standard deviation.
*Other payer includes uninsured and self-pay.
Unadjusted and adjusted outcomes stratified by presence of postoperative respiratory failure (PRF).
| Outcome | No PRF (n = 1,003,703) | PRF (n = 88,308) | Estimate (Odds Ratio or β coefficient) | ||
|---|---|---|---|---|---|
| Mortality | 1.0 | 22.0 | <0.001 | 10.4 (9.8–11.1) | <0.001 |
| Non-home Discharge | 7.2 | 48.2 | <0.001 | 3.5 (3.4–3.6) | <0.001 |
| Length of Stay (days) | 3 (2–6) | 11 (7–19) | <0.001 | 3.1 (3.0–3.2) | <0.001 |
| Hospitalization Costs ($1,000) | 13.1 (9.4–19.4) | 39.1 (24.8–65.3) | <0.001 | 11.9 (11.6–12.3) | <0.001 |
Unadjusted outcomes reported as incidence per 100 (mortality and non-home discharge), or median and interquartile range (length of stay and costs). Adjusted outcomes reported as odds ratios or β coefficient for PRF vs. no PRF with 95% confidence intervals.
Fig 2Receiver operating characteristics (2A) and precision recall curves (2B) for logistic regression and XGBoost with complete set of covariates.
Performance metrics for logistic regression and machine learning based models.
| Complete Set | Sparse Set | |||
|---|---|---|---|---|
| Metric | LR (95% CI) | XGBoost (95% CI) | LR (95% CI) | XGBoost (95% CI) |
| AUC | 0.894 (0.892–0.896) | 0.900 (0.899–0.901) | 0.836 (0.833–0.839) | 0.836 (0.833–0.839) |
| Recall | 0.265 (0.261–0.269) | 0.270 (0.268–0.272) | 0.154 (0.151–0.157) | 0.152 (0.150–0.154) |
| Precision | 0.603 (0.597–0.609) | 0.636 (0.631–0.641) | 0.646 (0.637–0.655) | 0.651 (0.644–0.658) |
| Balanced Accuracy | 0.624 (0.622–0.626) | 0.628 (0.627–0.629) | 0.572 (0.571–0.573) | 0.572 (0.571–0.573) |
| Brier Score | 0.058 (0.057–0.059) | 0.057 (0.056–0.058) | 0.063 (0.062–0.064) | 0.063 (0.062–0.064) |
Metrics reported as mean with 95% confidence intervals (95% CI) and obtained via 10-fold cross validation. Complete set refers to the inclusion of all covariates in model development, while sparse set refers to the inclusion of 3 comorbidities (congestive heart failure, neurologic disorder, coagulopathy) and 6 emergency general surgery operative categories. LR, Logistic regression; AUC, Area under the receiver operating characteristic curve; XGBoost, Extreme gradient boosting.
Fig 3Calibration curves for logistic regression and XGBoost for complete (3A) and sparse (3B) feature sets. Complete set refers to the inclusion of all covariates in model development, while sparse set refers to the inclusion of 3 comorbidities (congestive heart failure, neurologic disorder, coagulopathy) and 6 emergency general surgery operative categories.
Fig 4Feature importance of XGBoost model in predicting postoperative respiratory failure with (4A) and without (4B) the inclusion of the Elixhauser Comorbidity Index in models.