Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Machine learning versus regression modelling in predicting individual healthcare costs from a representative sample of the nationwide claims database in France.

Literature DB >> 34373958

Machine learning versus regression modelling in predicting individual healthcare costs from a representative sample of the nationwide claims database in France.

Alexandre Vimont^1,2, Henri Leleu³, Isabelle Durand-Zaleski⁴.

Abstract

BACKGROUND: Innovative provider payment methods that avoid adverse selection and reward performance require accurate prediction of healthcare costs based on individual risk adjustment. Our objective was to compare the performances of a simple neural network (NN) and random forest (RF) to a generalized linear model (GLM) for the prediction of medical cost at the individual level.
METHODS: A 1/97 representative sample of the French National Health Data Information System was used. Predictors selected were: demographic information; pre-existing conditions, Charlson comorbidity index; healthcare service use and costs. Predictive performances of each model were compared through individual-level (adjusted R-squared (adj-R2), mean absolute error (MAE) and hit ratio (HiR)), and distribution-level metrics on different sets of covariates in the general population and by pre-existing morbid condition, using a quasi-Monte Carlo design.
RESULTS: We included 510,182 subjects alive on 31st December, 2015. Mean annual costs were 1894€ (standard deviation 9326€) (median 393€, IQ range 95€; 1480€), including zero-claim subjects. All models performed similarly after adjustment on demographics. RF model had better performances on other sets of covariates (pre-existing conditions, resource counts and past year costs). On full model, RF reached an adj-R2 of 47.5%, a MAE of 1338€ and a HiR of 67%, while GLM and NN had an adj-R2 of 34.7% and 31.6%, a MAE of 1635€ and 1660€, and a HiR of 58% and 55 M, respectively. RF model outperformed GLM and NN for most conditions and for high-cost subjects.
CONCLUSIONS: RF should be preferred when the objective is to best predict medical costs. When the objective is to understand the contribution of predictors, GLM was well suited with demographics, conditions and base year cost.

Entities: Chemical

Keywords: Cost containment; Healthcare costs; Healthcare management; Machine learning; Neural network; Predictive analytics; Random forest

Mesh：

Year: 2021 PMID： 34373958 DOI： 10.1007/s10198-021-01363-4

Source DB: PubMed Journal: Eur J Health Econ ISSN： 1618-7598

Keyword Cloud
References

18 in total

1. Regression Trees Identify Relevant Interactions: Can This Improve the Predictive Performance of Risk Adjustment?

Authors: Florian Buchner; Jürgen Wasem; Sonja Schillo
Journal: Health Econ Date: 2015-10-26 Impact factor: 3.046

2. Optimal quality reporting in markets for health plans.

Authors: Jacob Glazer; Thomas G McGuire
Journal: J Health Econ Date: 2005-12-13 Impact factor: 3.883

3. Risk-adjusted capitation payment systems for health insurance plans in a competitive market.

Authors: Leida M Lamers; Rene Cja van Vliet; Wynand Pmm van de Ven
Journal: Expert Rev Pharmacoecon Outcomes Res Date: 2003-10 Impact factor: 2.217

4. Exploring the predictive power of interaction terms in a sophisticated risk equalization model using regression trees.

Authors: S H C M van Veen; R C van Kleef; W P M M van de Ven; R C J A van Vliet
Journal: Health Econ Date: 2017-05-23 Impact factor: 3.046

5. A Machine Learning Framework for Plan Payment Risk Adjustment.

Authors: Sherri Rose
Journal: Health Serv Res Date: 2016-02-19 Impact factor: 3.402

6. Alternative evaluation metrics for risk adjustment methods.

Authors: Sungchul Park; Anirban Basu
Journal: Health Econ Date: 2018-03-26 Impact factor: 3.046

7. How Does Risk Selection Respond to Risk Adjustment? New Evidence from the Medicare Advantage Program.

Authors: Jason Brown; Mark Duggan; Ilyana Kuziemko; William Woolston
Journal: Am Econ Rev Date: 2014-10

8. Demand elasticities and service selection incentives among competing private health plans.

Authors: Randall P Ellis; Bruno Martins; Wenjia Zhu
Journal: J Health Econ Date: 2017-12 Impact factor: 3.883

9. A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs.

Authors: Andrew M Jones; James Lomas; Peter T Moore; Nigel Rice
Journal: J R Stat Soc Ser A Stat Soc Date: 2015-10-15 Impact factor: 2.483

10. Machine learning approaches for predicting high cost high need patient expenditures in health care.

Authors: Chengliang Yang; Chris Delcher; Elizabeth Shenkman; Sanjay Ranka
Journal: Biomed Eng Online Date: 2018-11-20 Impact factor: 2.819