| Literature DB >> 32357871 |
Jeremy A Irvin1, Andrew A Kondrich2, Michael Ko3, Pranav Rajpurkar2, Behzad Haghgoo2, Bruce E Landon4,5, Robert L Phillips6, Stephen Petterson7, Andrew Y Ng2, Sanjay Basu5,8,9.
Abstract
BACKGROUND: Risk adjustment models are employed to prevent adverse selection, anticipate budgetary reserve needs, and offer care management services to high-risk individuals. We aimed to address two unknowns about risk adjustment: whether machine learning (ML) and inclusion of social determinants of health (SDH) indicators improve prospective risk adjustment for health plan payments.Entities:
Keywords: Machine learning; Risk estimation; Social determinants of health
Year: 2020 PMID: 32357871 PMCID: PMC7195714 DOI: 10.1186/s12889-020-08735-0
Source DB: PubMed Journal: BMC Public Health ISSN: 1471-2458 Impact factor: 3.295
Fig. 1Data Set Selection Flow Diagram. Administrative claims data was obtained from a single national private insurer. The dotted arrow means predictors were optionally incorporated but no members were added or excluded
Statistics of ZIP Code-Level Social Determinants of Health
| SDH Variable | Mean [Median] (Std) |
|---|---|
| Median Income in the Past 12 Months, $ | 26,546 [25727] (6230) |
| Families Under 0.5 Ratio of Income to Poverty Level in the Past 12 Months, % | 4.7 [4.3] (2.7) |
| Families Between 0.5 and 0.74 Ratio of Income to Poverty Level in the Past 12 Months, % | 3.2 [3.0] (1.7) |
| Families Between 0.75 and 0.99 Ratio of Income to Poverty Level in the Past 12 Months, % | 3.6 [3.4] (1.6) |
| Families Received Food Stamps/Snap in the Past 12 months, % | 14.2 [13.6] (6.6) |
| Population Unemployed, % | 5.4 [5.2] (1.9) |
| Gini Index of Income Inequality | 45.2 [45.1] (3.6) |
| Population Obtained High School Diploma, % | 43.0 [42.9] (4.8) |
| Population Obtained Bachelor’s Degree, % | 16.1 [15.2] (6.2) |
| Population Speak English Less than “Very Well”, % | 10.5 [5.6] (12.5) |
| Families with Single Parent, % | 22.9 [22.7] (6.2) |
| Population Without Health Insurance Coverage, % | 11.3 [10.6] (4.9) |
| Population African American, % | 9.9 [4.5] (13.5) |
| Population Asian, % | 2.9 [1.2] (4.7) |
| Population American Indian and Alaska Native, % | 1.5 [0.3] (6.4) |
| Population Hispanic or Latino, % | 11.9 [5.5] (15.8) |
| Population White, % | 71.8 [77.7] (22.3) |
SDH variables were obtained from the 2012–2016 American Community Survey 5-year estimates from the U.S. Census Bureau
Characteristics of Members in the Dataset Subsets
| Characteristic | Training Set | Test Set |
|---|---|---|
| Members Total, No. | 1,058,479 | 117,616 |
| Female Total, No. (%) | 517,364 (48.9%) | 57,469 (48.9%) |
| Members from ZIP codes without measured SDH variablesa, No. (%) | 1074 (0.1%) | 115 (0.1%) |
| Population statistics, mean [median] (SD) | ||
| Age, y | 41.1 [41.0] (13.1) | 41.1 [41.0] (13.1) |
| 2017 Annual Cost, $ | 6946 [861] (28,240) | 6868 [855] (27,826) |
| 2017 Top-coded Annual Costb, $ | 6762 [861] (23,822) | 6677 [855] (23,536) |
The training set was used to develop the models and the test set was used to evaluate the models
aThe SDH variables of these members were imputed with the median values of SDH variables over all ZIP codes, and an additional indicator variable was used to identify whether members fall into this category
bStatistics of cost when top-coding at $400,000 (values higher than $400,000 were replaced with $400,000)
Performance Measures of the Prospective Linear and Machine Learning Models on the Test Set
| Evaluation Metric | No SDH | SDH |
|---|---|---|
| R2 (95% CI)a | ||
| Linear | 0.327 (0.300, 0.353) | 0.327 (0.300, 0.354) |
| ML | 0.388 (0.357, 0.420) | 0.387 (0.357, 0.419) |
| MAE (95% CI)b | ||
| Linear | 6992 (6889, 7094) | 6991 (6889, 7094) |
| ML | 6637 (6539, 6735) | 6634 (6536, 6732) |
| C-statistic (95% CI)c | ||
| Linear | 0.703 (0.701, 0.705) | 0.700 (0.699, 0.702) |
| ML | 0.717 (0.715, 0.718) | 0.716 (0.714, 0.717) |
Comparison of performance measures between linear regression and machine learning prospective risk adjustment models, predicting 2017 yearly top-coded spending from 2016 characteristics. The SDH model additionally includes SDH variables obtained from U.S. Census data (see Table 1)
aConfidence intervals for R2 were constructed using the nonparametric bootstrap [21]
bConfidence intervals for MAE were constructed using a paired t-test
cConfidence intervals for C-statistic were constructed using a jackknife procedure [25]
Predictive Ratio and Net Compensation Values of Prospective Machine Learning Models on SDH-Based Subgroups in the Test Set
| Model Predictive Ratiob and Net Compensationc | ||||
|---|---|---|---|---|
| Subgroup | No. (%) | 2017 Spending ($)a | ML (95% CI) | ML with SDH (95% CI) |
| Total | 117,616 (100) | 6677 | 1.000 (0.976, 1.024) | 1.000 (0.976, 1.024) |
| 0 (− 105, 105) | 0 (− 105, 105) | |||
| Poverty | ||||
| Median Income in the Past 12 Months, $ | 4923 (4.2) | 10,818 | 1.017 (0.915, 1.120) | 1.006 (0.905, 1.108) |
| − 183 (− 836, 470) | −67 (− 729, 595) | |||
| Families Under 0.5 Ratio of Income to Poverty Level in the Past 12 Months, % | 7932 (6.7) | 9344 | 0.966 (0.882, 1.050) | 0.948 (0.865, 1.031) |
| 331 (− 138, 801) | 510 (33, 987) | |||
| Families Between 0.5 and 0.74 Ratio of Income to Poverty Level in the Past 12 Months, % | 6651 (5.7) | 8952 | 1.010 (0.912, 1.108) | 0.988 (0.892, 1.084) |
| −89 (− 599, 420) | 109 (− 408, 627) | |||
| Families Between 0.75 and 0.99 Ratio of Income to Poverty Level in the Past 12 Months, % | 7194 (6.1) | 9395 | 1.052 (0.956, 1.148) | 1.010 (0.919, 1.101) |
| − 467 (− 977, 43) | −94 (− 613, 425) | |||
| Families Received Food Stamps/Snap in the Past 12 months, % | 9009 (7.7) | 9001 | 1.028 (0.941, 1.115) | 0.996 (0.912, 1.079) |
| − 247 (− 684, 191) | 39 (− 409, 487) | |||
| Population Unemployed, % | 10,278 (8.7) | 7055 | 0.961 (0.886, 1.036) | 0.957 (0.882, 1.032) |
| 289 (−71, 649) | 316 (−51, 683) | |||
| Gini Index of Income Inequality | 16,155 (13.7) | 6138 | 1.054 (0.985, 1.122) | 1.021 (0.955, 1.087) |
| − 312 (− 578, −46) | − 126 (− 393, 140) | |||
| Education | ||||
| Population Obtained High School Diploma, % | 9482 (8.1) | 7555 | 0.987 (0.900, 1.073) | 0.974 (0.889, 1.058) |
| 102 (− 324, 529) | 205 (− 227, 637) | |||
| Population Obtained Bachelor’s Degree, % | 4169 (3.5) | 11,338 | 1.032 (0.923, 1.142) | 1.027 (0.917, 1.136) |
| −353 (− 1139, 433) | − 294 (− 1080, 492) | |||
| Other | ||||
| Population Speak English Less than “Very Well”, % | 23,659 (20.1) | 5453 | 1.023 (0.963, 1.083) | 0.989 (0.932, 1.046) |
| − 124 (− 346, 98) | 61 (−161, 283) | |||
| Families with Single Parent, % | 9097 (7.7) | 9880 | 0.993 (0.910, 1.076) | 0.978 (0.896, 1.060) |
| 65 (− 397, 527) | 224 (− 246, 693) | |||
| Population Without Health Insurance Coverage, % | 13,656 (11.6) | 8333 | 1.066 (0.990, 1.142) | 0.990 (0.921, 1.059) |
| − 516 (− 885, − 147) | 83 (− 287, 454) | |||
Comparison of machine learning prospective risk adjustment models without and with the addition of SDH indicators as predictors (see Table 1 for a complete list of SDH indicators). The predictions for each model were adjusted so that the mean of the predictions over the total test population was equal to the mean of the actual costs, resulting in a predictive ratio of exactly 1.0 over the total test set population. Subgroups were composed of members in the lowest decile of ZIP codes with respect to the corresponding SDH variable (see Supplementary Information Table S1). Only socioeconomic variables are considered in this subgroup analysis, and results on age and sex subgroups are shown in the Supplementary Information
aSpending included all healthcare utilization in 2017 of members with full enrollment in 2016 and 2017. Values larger than $400,000 were replaced with $400,000
bPredictive ratio for a subgroup was computed as the ratio of the mean of observed to the mean of predicted spending over the subgroup. Approximate confidence intervals for predictive ratios were computed with the delta method [40]
cNet compensation for a subgroup was computed as the mean difference between predicted and observed spending in the subgroup. Confidence intervals were estimated using a paired t-test