| Literature DB >> 33090223 |
Alejandro Schuler1, Liam O'Súilleabháin1, Gina Rinetti-Vargas1, Patricia Kipnis1,2, Fernando Barreda1, Vincent X Liu1,3, Oleg Sofrygin1, Gabriel J Escobar1.
Abstract
Importance: Prediction models are widely used in health care as a way of risk stratifying populations for targeted intervention. Most risk stratification has been done using a small number of predictors from insurance claims. However, the utility of diverse nonclinical predictors, such as neighborhood socioeconomic contexts, remains unknown. Objective: To assess the value of using neighborhood socioeconomic predictors in the context of 1-year risk prediction for mortality and 6 different health care use outcomes in a large integrated care system. Design, Setting, and Participants: Diagnostic study using data from all adults age 18 years or older who had Kaiser Foundation Health Plan membership and/or use in the Kaiser Permantente Northern California: a multisite, integrated health care delivery system between January 1, 2013, and June 30, 2014. Data were recorded before the index date for each patient to predict their use and mortality in a 1-year post period using a test-train split for model training and evaluation. Analyses were conducted in fall of 2019. Main Outcomes and Measures: One-year encounter counts (doctor office, virtual, emergency department, elective hospitalizations, and nonelective), total costs, and mortality.Entities:
Mesh:
Year: 2020 PMID: 33090223 PMCID: PMC7582126 DOI: 10.1001/jamanetworkopen.2020.17109
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Variables Included in Each of the Predictor Categories
| Variablea | Total number of variables |
|---|---|
| Administrative | 123 |
| Age | 1 |
| Sex | 1 |
| Diagnosis cost group (predicted cost score) | 4 |
| RCCs | 117 |
| EHR | 18 |
| HbA1c | 4 |
| BMI | 5 |
| abLAPS score | 4 |
| COPS2 | 4 |
| Online patient portal registration status (yes/no) | 1 |
| nSES | 27 |
| NDI | 1 |
| Transportation | 6 |
| Housing | 2 |
| Job availability | 2 |
| Food access | 2 |
| Crime | 2 |
| Environmental factors | 11 |
| Walkability | 1 |
Abbreviations: abLAPS, laboratory-based acute physiology score; BMI, body mass index; COPS2, Comorbidity Point Score, version 2; EHR, electronic health record; NDI, Neighborhood Deprivation Index; nSES, neighborhood socioeconomic status; RCCs, related clinical conditions.
The provenance, meaning, and summary statistics for each of the nSES variables are described in the eAppendix in the Supplement.
Summary Statistics for the Test Cohort
| Variable | Mean (SD) | |
|---|---|---|
| Preperiod | Postperiod | |
| Population, total No. | 1 475 559 | 1 465 343 |
| Age, y | 47.20 (17.39) | 47.00 (17.24) |
| Male sex, % | 48 | 48 |
| Charlson score | 0.49 (1.24) | 0.47 (1.19) |
| COPS2 | 13.16 (12.94) | 12.88 (12.08) |
| abLAPS | 1.94 (4.50) | 1.88 (4.34) |
| Diabetes (RCC 7), % | 7.5 | 8.1 |
| HbA1c | 4.98 (0.62) | 4.98 (0.61) |
| CHF (RCC 59), % | 1.5 | 1.7 |
| COPD (RCC 77), % | 7.8 | 8.9 |
| Invasive cancer (RCCs 2,3,4), % | 1.7 | 1.8 |
| CKD (RCC 87), % | 4.5 | 4.9 |
| Major infection (sepsis, community-acquired pneumonia, RCC 1), % | 8.2 | 8.7 |
| BMI | 25.43 (6.40) | 25.43 (6.40) |
| NDI | −0.16 (0.84) | −0.16 (0.84) |
| In-person outpatient visits | 3.59 (5.89) | 3.50 (5.89) |
| Virtual visits | 3.30 (6.89) | 3.25 (6.86) |
| ED visits | 0.24 (0.83) | 0.24 (0.87) |
| ED visits (≥4), % | 0.93 | 1 |
| Elective hospitalization | 0.03 (0.21) | 0.03 (0.21) |
| Elective hospitalization (≥2), % | 0.32 | 0.32 |
| Elective hospitalization (ever), % | 2.7 | 2.9 |
| Nonelective hospitalization | 0.04 (0.26) | 0.04 (0.28) |
| Nonelective hospitalization (≥2), % | 0.56 | 0.66 |
| Nonelective hospitalization (ever), % | 2.7 | 2.9 |
| Cost | 4054.88 (14449.06) | 4254.68 (16878.31) |
| ≥$15 000, % | 5.9 | 6 |
| ≥$100 000, % | 0.28 | 0.35 |
Abbreviations: abLAPS, laboratory-based acute physiology score; BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); CHF, congestive heart failure; CKD, chronic kidney disease; COPD, chronic obstructive pulmonary disease; COPS2, Comorbidity Point Score, version 2; ED, emergency department; HbA1c, hemoglobin A1c; NDI, Neighborhood Deprivation Index; RCC, related clinical conditions.
SI conversion factor: To convert HbA1C to proportion of total hemoglobin, multiply by 0.01.
Figure. Performance Measures of All Models Across All Predictors Sets and Outcomes
Higher is better for all measures except Brier score. AUPRC indicates area under the precision-recall curve; AUROC, area under the receiver operator curve; EHR, electronic health record; LASSO, least absolute shrinkage and selection operator; nSES, neighborhood socioeconomic status.
Area Under the Receiver Operator Curve of the Best Model (Neural Network) Across All Outcomes and Predictor Setsa
| Model | Cost | Mortality | Office visits | Emergency department | Nonelective hospital | Elective hospital | Virtual visits |
|---|---|---|---|---|---|---|---|
| Administrative | 0.85 | 0.94 | 0.83 | 0.73 | 0.85 | 0.79 | 0.86 |
| +nSES | 0.84 | 0.94 | 0.83 | 0.73 | 0.85 | 0.79 | 0.86 |
| +HER | 0.85 | 0.94 | 0.83 | 0.73 | 0.86 | 0.79 | 0.86 |
| +EHR +nSES | 0.84 | 0.94 | 0.83 | 0.73 | 0.86 | 0.79 | 0.86 |
Abbreviations: +nSES, plus neighborhood socioeconomic status; +EHR, plus electronic health records; +EHR +nSES, plus both EHR and nSES.
Cross-validation standard errors for all figures in this table are below 0.001 due to the size of the data. A full table of performance measures for all methods, predictor sets, and outcomes is available in the eAppendix in the Supplement.