| Literature DB >> 32334595 |
Madhu Mazumdar1,2, Jung-Yi Joyce Lin1,2, Wei Zhang3, Lihua Li1,2, Mark Liu2, Kavita Dharmarajan4, Mark Sanderson5, Luis Isola2, Liangyuan Hu6,7.
Abstract
BACKGROUND: The Oncology Care Model (OCM) was developed as a payment model to encourage participating practices to provide better-quality care for cancer patients at a lower cost. The risk-adjustment model used in OCM is a Gamma generalized linear model (Gamma GLM) with log-link. The predicted value of expense for the episodes identified for our academic medical center (AMC), based on the model fitted to the national data, did not correlate well with our observed expense. This motivated us to fit the Gamma GLM to our AMC data and compare it with two other flexible modeling methods: Random Forest (RF) and Partially Linear Additive Quantile Regression (PLAQR). We also performed a simulation study to assess comparative performance of these methods and examined the impact of non-linearity and interaction effects, two understudied aspects in the field of cost prediction.Entities:
Keywords: Generalized linear model; Machine learning; Oncology care model; Quantile regression; Risk-adjustment model
Mesh:
Year: 2020 PMID: 32334595 PMCID: PMC7183716 DOI: 10.1186/s12913-020-05148-y
Source DB: PubMed Journal: BMC Health Serv Res ISSN: 1472-6963 Impact factor: 2.655
Fig. 1Observed versus expected expenses for the OCM model
Model characteristics for Gamma GLM, PLAQR, and Random Forest (RF) models
| Gamma GLM | PLAQR | Random Forest | |
|---|---|---|---|
| Distribution assumption | Parametric | Semi-parametric | Nonparametric |
| Estimate | Mean | Quantile | Mean |
| Ability to model skewed outcome | Yes | Yes | Yes |
| Non-linear effect | Needs to be specified through model diagnostics | Needs to be specified (B-spline) | Data-driven detection; pre-specification not needed |
| Interaction effect | Needs to be specified through model diagnostics | Needs to be specified through model diagnostics | Data-driven detection; pre-specification not needed |
| Software | R, SAS, STATA | R | R, SAS |
Summary statistics for actual expenses from OCM data at Mount Sinai (N = 4205) and simulated distributions
| Mean | Median | 90th percentile | Skewness | Kurtosis | |
|---|---|---|---|---|---|
| OCM data | $27,329 | $20,552 | $61,885 | 1.0 | 0.6 |
| Gamma | $27,023 | $19,378 | $59,058 | 1.5 | 2.3 |
| Weibull | $28,488 | $20,498 | $63,108 | 1.5 | 2.4 |
| Heteroscedastic log-normal | $28,653 | $19,700 | $62,159 | 2.8 | 21.5 |
| Heavy tail | $39,368 | $18,844 | $91,454 | 5.7 | 53.2 |
Fig. 2Actual expenses in Mount Sinai OCM data versus simulated data. Simulated data included distribution from exponential (Gamma and Weibull) and non-exponential families (Heteroscedastic log-normal and heavy-tailed). The red vertical line indicates the 90th percentile for each distribution
Fig. 3a-c. Accuracy of predicted cost in correctly and incorrectly specified models for simulated data. 3A: RMSE, 3B: MAPE, 3C: CA; Performance metrices estimated from 1000 bootstrapped samples. Gamma GLM, PLAQR estimating 50th percentile on original scale, and RF with original scale of cost are shown
Fig. 4a-c. Accuracy of predicted cost in correctly specified models for Mount Sinai OCM data. 4A: RMSE, 4B: MAPE, 4C: CA; Performance metrics estimated from the 1000 bootstrapped samples. Results using Gamma GLM, PLAQR estimating 50th percentile on original scale, and RF with original scale of cost are shown
RMSE and MAPE of Random Forest (RF), Gamma GLM and PLAQR implemented on OCM data in different percentiles
| RMSE | 0 - 20th | 20 - 40th | 40 - 60th | 60 - 80th | 80th - 90th | 90th - 100th | |
|---|---|---|---|---|---|---|---|
| RF | $3661 | $8444 | $8246 | $6583 | $11,174 | $17,411 | |
| Gamma GLM | $5881 | $16,548 | $18,291 | $15.246 | $21,446 | $33,276 | |
| PLAQR | $5312 | $15,333 | $14,826 | $12,715 | $21,995 | $37,253 | |
| MAPE | RF | $2436 | $6048 | $6712 | $5008 | $9044 | $15,406 |
| Gamma GLM | $4098 | $11,033 | $14,362 | $11,501 | $18,477 | $29,234 | |
| PLAQR | $2668 | $10,749 | $11,933 | $9760 | $18,620 | $34,102 |