| Literature DB >> 29881759 |
Lucas W Thornblade1, David R Flum1, Abraham D Flaxman1.
Abstract
BACKGROUND: Recurrent diverticulitis is the most common reason for elective colon surgery and, although professional societies now recommend against early resection, its use continues to rise. Shared decision making decreases use of low-value surgery but identifying which patients are most likely to elect surgery has proven difficult. We hypothesized that Machine Learning algorithms using health care utilization (HCU) data can predict future clinical events including early resection for diverticulitis. STUDYEntities:
Keywords: Colorectal Surgery; Decision Making; Diverticulitis; General Surgery; Patient Preference
Year: 2018 PMID: 29881759 PMCID: PMC5983027 DOI: 10.5334/egems.193
Source DB: PubMed Journal: EGEMS (Wash DC) ISSN: 2327-9214
Figure 1Timeline of health care utilization data in relation to diagnosis of diverticulitis and corresponding periods A–D.
Demographics and utilization parameters of patients in the lowest and highest quintiles for likelihood of undergoing future elective surgery for diverticulitis from the Gradient Boosting Machine algorithm.
| All patients (n = 84,791) | Lowest Likelihood Quintile (n = 16,958) | Highest Likelihood Quintile (n = 16,958) | p-value | |
|---|---|---|---|---|
| age, mean ± SD | 51.0 ± 8.6 | 51.3 ± 8.8 | 50.0 ± 8.7 | |
| female, % | 52.4% | 54.9% | 53.5% | 0.18 |
| Elective surgery, n (%) | 947 (1.2%) | 5 (0.3%) | 49 (3.0) | |
| Days in health care, mean ± SD | 57.2 ± 46.9 | 57.0 ± 45.8 | 58.2 ± 46.0 | 0.34 |
| ER visits, mean ± SD | 1.1 ± 2.1 | 0.8 ± 2.0 | 1.8 ± 2.3 | |
| Outpatient visits, mean ± SD | 33.9 ± 30.4 | 34.3 ± 29.6 | 34.6 ± 31.1 | 0.96 |
| Unique diagnoses, mean ± SD | 29.8 ± 18.9 | 29.3 ± 18.7 | 30.8 ± 18.2 | |
| Prescriptions, mean ± SD | 43.4 ± 51.8 | 42.2 ± 50.3 | 43.4 ± 50.2 | 0.25 |
| Diverticulitis claims, mean ± SD | 7.6 ± 11.0 | 1.4 ± 1.4 | 22.9 ± 14.2 | |
| Total health care costs, median (Interquartile Range) | – | $25,298 | $44,475 | |
*For all health care data from 104 weeks prior to diagnosis until 24 weeks following diagnosis of diverticulitis.
Figure 2Receiver operating curves (ROC) of three machine learning algorithms using data from 104 weeks before to 24 weeks following diagnosis of diverticulitis: Gradient Boosting Machine, Random Forest, and Penalized Logistic Regression. Gray shading indicates 95% Uncertainty Interval.
Figure 3Receiver operating curves for Gradient Boosting Machine performance using data from Period A (104 weeks) Period B (116 weeks) Period C (128 weeks), and Period D (152 weeks). Gray shading indicates 95% Uncertainty Interval.
Figure 4Trends of model performance with the addition of HCU data after diagnosis for four machine learning algorithms including Gradient Boosting Machine, Random Forest, and Penalized Logistic Regression. Gray shading indicates 95% Uncertainty Interval.