| Literature DB >> 34068467 |
Samuel Davis1, Sara Nourazari2, Rachel Granovsky3, Nasser Fard1.
Abstract
Identifying patients with a low likelihood of paying their bill serves the needs of patients and providers alike: aligning government programs with their target beneficiaries while minimizing patient frustration and reducing waste among emergency physicians by streamlining the billing process. The goal of this study was to predict the likelihood of patients paying the balance of their emergency department visit bill within 90 days of receipt. Three machine learning methodologies were applied to predict payment: logistic regression, decision tree, and random forest. Models were trained and performance was measured using 1,055,941 patients with non-zero balances across 27 EDs from 1 August 2015 to 31 July 2017. The decision tree accurately predicted 87% of unsuccessful payments, providing significant opportunities to identify patients in need of financial assistance.Entities:
Keywords: Medicaid; emergency department; health equity; healthcare finance; predictive modeling
Year: 2021 PMID: 34068467 PMCID: PMC8150762 DOI: 10.3390/healthcare9050556
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Twelve categorial features used to model payment results.
| # | Feature | Grouping |
|---|---|---|
| 1 | Physician Only | Yes, No |
| 2 | Primary CPT | 99283, 99284, 99285, 99291, “all other” codes |
| 3 | # of CPT codes | 1, 2+ |
| 4 | Patient Age | 0–17, 18–24, 25–34, 35–44, 45–54, 55–64, 65–79, 80+ |
| 5 | Gender | Female, Male |
| 6 | Out of state | No, Yes |
| 7 | Work phone | No, Yes |
| 8 | Primary insurance | Medicare, Medicaid, Private, Other |
| 9 | Secondary insurance | No, Yes |
| 10 | Relationship | Self, Spouse, Other |
| 11 | Marital status | Single, Married, Other |
| 12 | Patient responsibility | Under USD 25, USD 25–100, USD 100–300, USD 300–600, USD 600+ |
Figure 1Confusion Matrix. Correct predictions are either TP or TN, while incorrect predictions are either FP or FN. Sensitivity is the ratio of TP to all P, which represents the strength of the model at identifying positive results. The specificity is the ratio of TN to all N, which represents the strength of the model at identifying negative results. The PPV represents the likelihood of a positive prediction being correct, while NPV represents the likelihood of a negative prediction being correct.
Feature analysis showing the proportion paid for each level and chi-square test results for each feature.
| Characteristic | Paid | Unpaid | Total | Paid % | Chi-Square Statistic | |
|---|---|---|---|---|---|---|
| 39.0% | ||||||
| Physician | ||||||
| Y | 299,037 | 420,085 | 719,122 | 41.6% | 6143.1 | <0.001 |
| N | 113,172 | 223,647 | 336,819 | 33.6% | ||
| Primary CPT | ||||||
| 99283 | 88,734 | 168,796 | 257,530 | 34.5% | 4201.4 | <0.001 |
| 99284 | 144,804 | 229,637 | 374,441 | 38.7% | ||
| 99285 | 149,148 | 208,494 | 357,642 | 41.7% | ||
| 99291 | 16,327 | 20,699 | 37,026 | 44.1% | ||
| Other | 13,196 | 16,106 | 29,302 | 45.0% | ||
| # of CPT Codes | ||||||
| 1 | 250,777 | 457,637 | 708,414 | 35.4% | 11,966.4 | <0.001 |
| 2+ | 161,432 | 186,095 | 347,527 | 46.5% | ||
| Patient Age | ||||||
| 0–17 | 49,604 | 64,075 | 113,679 | 43.6% | 93,785.6 | <0.001 |
| 18–24 | 38,064 | 96,132 | 134,196 | 28.4% | ||
| 25–34 | 49,603 | 150,315 | 199,918 | 24.8% | ||
| 35–44 | 47,077 | 117,222 | 164,299 | 28.7% | ||
| 45–54 | 60,400 | 103,120 | 163,520 | 36.9% | ||
| 55–64 | 71,525 | 68,523 | 140,048 | 51.1% | ||
| 65–79 | 60,764 | 34,084 | 94,848 | 64.1% | ||
| 80+ | 35,172 | 10,261 | 45,433 | 77.4% | ||
| Gender | ||||||
| Female | 217,199 | 315,202 | 532,401 | 40.8% | 1396.1 | <0.001 |
| Male | 195,010 | 328,530 | 523,540 | 37.2% | ||
| Out of State | ||||||
| N | 367,791 | 587,716 | 955,507 | 38.5% | 1255.7 | <0.001 |
| Y | 44,418 | 56,016 | 100,434 | 44.2% | ||
| Work Phone | ||||||
| N | 353,854 | 552,261 | 906,115 | 39.1% | 0.6 | 0.635 |
| Y | 58,355 | 91,471 | 149,826 | 38.9% | ||
| Primary Insurance | ||||||
| Medicare | 107,134 | 69,267 | 176,401 | 60.7% | 132,260.1 | <0.001 |
| Medicaid | 25,541 | 45,450 | 70,991 | 36.0% | ||
| Private | 232,763 | 279,271 | 512,034 | 45.5% | ||
| Other | 21,662 | 26,278 | 47,940 | 45.2% | ||
| None | 25,109 | 223,466 | 248,575 | 10.1% | ||
| Secondary Insurance | ||||||
| N | 319,321 | 585,117 | 904,438 | 35.3% | 36,874.9 | <0.001 |
| Y | 92,888 | 58,615 | 151,503 | 61.3% | ||
| Relationship | ||||||
| Self | 332,186 | 573,757 | 905,943 | 36.7% | 15,370.9 | <0.001 |
| Spouse | 34,364 | 26,924 | 61,288 | 56.1% | ||
| Other | 45,659 | 43,051 | 88,710 | 51.5% | ||
| Marital Status | ||||||
| Single | 187,185 | 396,415 | 583,600 | 32.1% | 28,347.0 | <0.001 |
| Married | 165,646 | 168,597 | 334,243 | 49.6% | ||
| Other | 59,378 | 78,720 | 138,098 | 43.0% | ||
| Patient Responsibility | ||||||
| <USD 25 | 83,229 | 17,508 | 100,737 | 82.6% | 113,249.8 | <0.001 |
| USD 25–100 | 104,811 | 110,896 | 215,707 | 48.6% | ||
| USD 100–300 | 80,917 | 166,145 | 247,062 | 32.8% | ||
| USD 300–600 | 81,676 | 199,839 | 281,515 | 29.0% | ||
| >USD 600 | 61,576 | 149,344 | 210,920 | 29.2% | ||
LR results, including odds ratio (OR), confidence interval (CI), and p-value for each non-base level by feature.
| Characteristic | OR | CI (95%) | Characteristic | OR | CI (95%) | ||
|---|---|---|---|---|---|---|---|
| Intercept | 0.36 | (0.35–0.37) | <0.001 | Work phone | |||
| Physician | N | base | base | base | |||
| Y | base | base | base | Y | 1.17 | (1.16–1.19) | <0.001 |
| N | 0.90 | (0.89–0.91) | <0.001 | Primary Insurance | |||
| Primary CPT | Medicare | 0.49 | (0.48–0.50) | <0.001 | |||
| 99283 | 0.99 | (0.97–1.00) | 0.146 | Medicaid | 0.91 | (0.89–0.93) | <0.001 |
| 99284 | base | base | base | Private | base | base | base |
| 99285 | 1.08 | (1.07–1.10) | <0.001 | Other | 1.02 | (0.99–1.05) | 0.131 |
| 99291 | 1.00 | (0.97–1.03) | 0.858 | None | 0.23 | (0.22–0.23) | <0.001 |
| Other | 1.13 | (1.09–1.16) | <0.001 | Secondary Insurance | |||
| # of CPT Codes | N | base | base | base | |||
| 1 | base | base | base | Y | 1.66 | (1.64–1.69) | <0.001 |
| 2+ | 1.00 | (0.99–1.02) | 0.580 | Relationship | |||
| Patient Age | Self | base | base | base | |||
| 0–17 | 1.97 | (1.92–2.01) | <0.001 | Spouse | 1.22 | (1.20–1.25) | <0.001 |
| 18–24 | 1.15 | (1.12–1.17) | <0.001 | Other | 1.36 | (1.34–1.39) | <0.001 |
| 25–34 | 0.97 | (0.95–0.99) | <0.001 | Marital Status | |||
| 35–44 | base | base | base | Single | base | base | base |
| 45–54 | 1.30 | (1.28–1.33) | <0.001 | Married | 1.61 | (1.59–1.63) | <0.001 |
| 55–64 | 2.16 | (2.12–2.20) | <0.001 | Other | 0.87 | (0.85–0.88) | <0.001 |
| 65–79 | 3.53 | (3.45–3.62) | <0.001 | Patient Responsibility | |||
| 80+ | 7.64 | (7.39–7.90) | <0.001 | <USD 25 | 7.56 | (7.39–7.74) | <0.001 |
| Gender | USD 25–100 | 1.18 | (1.17–1.20) | <0.001 | |||
| Female | base | base | base | USD 100–300 | base | base | base |
| Male | 0.99 | (0.98–1.00) | 0.004 | USD 300–600 | 1.06 | (1.04–1.07) | <0.001 |
| Out of State | >USD 600 | 1.01 | (1.00–1.03) | 0.138 | |||
| N | base | base | base | ||||
| Y | 1.34 | (1.32–1.36) | <0.001 | ||||
Figure 2Abridged DT results; the complete tree has 131 nodes, of which 66 are terminal.
Overall model results. RF performed the best in all metrics except specificity, where DT performed best.
| Experiment | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|
|
| 71.3% | 49.7% | 85.3% | 68.6% | 72.4% |
|
| 71.7% | 47.9% | 87.2% | 70.7% | 72.1% |
|
| 72.4% | 49.9% | 86.9% | 71.1% | 72.9% |