| Literature DB >> 29295255 |
Cheng Gao1, Abel N Kho2, Catherine Ivory3, Sarah Osmundson4, Bradley A Malin1, You Chen1.
Abstract
Obstetric care refers to the care provided to patients during ante-, intra-, and postpartum periods. Predicting length of stay (LOS) for these patients during their hospitalizations can assist healthcare organizations in allocating hospital resources more effectively and efficiently, ultimately improving maternal care quality and reducing costs to patients. In this paper, we investigate the extent to which LOS can be forecast from a patient's medical history. We introduce a machine learning framework to incorporate a patient's prior conditions (e.g., diagnostic codes) as features in a predictive model for LOS. We evaluate the framework with three years of historical billing data from the electronic medical records of 9188 obstetric patients in a large academic medical center. The results indicate that our framework achieved an average accuracy of 49.3%, which is higher than the baseline accuracy 37.7% (that relies solely on a patient's age). The most predictive features were found to have statistically significant discriminative ability. These features included billing codes for normal delivery (indicative of shorter stay) and antepartum hypertension (indicative of longer stay).Entities:
Keywords: Electronic Health Records; Length of Stay; Obstetrics
Mesh:
Year: 2017 PMID: 29295255 PMCID: PMC5860660
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630
Figure 1The process by which the LOS predictive model is composed and discriminative features are discovered.
Summary Statistics for ICD-9 codes, age and LOS in the 2010–2011 period
| # of ICD-9 codes | Age | LOS | |||
|---|---|---|---|---|---|
| 1 year | 2 years | 3 years | |||
| 4.3 | 5.5 | 6.4 | 31.8 | 72 | |
| 1 | 1 | 1 | 14 | 1.6 | |
| 60 | 80 | 90 | 68 | 1311 | |
Figure 2LOS Frequency distribution for study subjects.
Comparative Models
| Model | Description |
|---|---|
| Age | |
| One year of ICD-9 codes | |
| Two years of ICD-9 codes | |
| Three years of ICD-9 codes |
Model evaluation strategies
| Strategy | # Trees | % Predictors | |
|---|---|---|---|
| A | Vary | Constant | Constant |
| B | Constant | Vary | Constant |
| C | Constant | Constant | Vary |
Figure 3The accuracy of models as a function of the number of trees in the random forest.
Figure 4Model accuracy as a function of the number of features retained.
Figure 5Model accuracy as a function of the number of decision trees. (τ = 12 hours; predictor set = top 10%)
Figure 7Model accuracy as a function of percent of predictors. (τ = 12 hours; Number of trees = 50).
Figure 6Model accuracy as a function of the LOS threshold. (Number of trees = 50; Predictor set = top 10%).
Model comparison for all patients and patients with LOS ≥ 96
| Model | Accuracy | |
|---|---|---|
| All patients (n = 7683) | LOS ≥ 96 hours (n = 1505) | |
| 37.7% | 0.00% | |
| 49.3% | 5.8% | |
Summary for the top 10 most predictive ICD-9 codes
| Importance Rank | ICD-9 code | Description | # of patients (with code vs. without code) | Mean LOS (with code vs. without code) | p-value |
|---|---|---|---|---|---|
| 1 | 650 | Normal delivery | 2281 vs. 6907 | 58.1 vs. 76.5 | <0.001 |
| 2 | 659.63 | Elderly multigravida with antepartum condition or complication | 1153 vs. 8035 | 81.0 vs. 70.6 | <0.001 |
| 3 | V28.81 | Encounter for fetal anatomic survey | 1899 vs. 7298 | 75.3 vs. 71.1 | 0.001 |
| 4 | 285.9 | Unspecified anemia | 343 vs. 8845 | 80.5 vs. 71.6 | 0.044 |
| 5 | V28.89 | Other specified antenatal screening | 1580 vs. 7608 | 73.7 vs. 71.6 | 0.043 |
| 6 | V28.4 | Antenatal screening for fetal growth retardation using ultrasonics | 384 vs. 8804 | 85.5 vs. 71.3 | <0.001 |
| 7 | V23.9 | Unspecified high-risk pregnancy | 531 vs. 8657 | 89.3 vs. 70.9 | <0.001 |
| 8 | V23.82 | Supervision of high-risk pregnancy of elderly multigravida | 137 vs. 9051 | 85.4 vs. 71.7 | 0.035 |
| 9 | 642.93 | Unspecified hypertension antepartum | 89 vs. 9099 | 109.8 vs. 71.6 | <0.001 |
| 10 | V76.2 | Screening for malignant neoplasm of the cervix | 582 vs. 8606 | 76.5 vs. 71.6 | 0.008 |