| Literature DB >> 34592712 |
Raghav Ramachandran1, Michael J McShea1, Stephanie N Howson1, Howard S Burkom1, Hsien-Yen Chang2, Jonathan P Weiner2, Hadi Kharrazi2.
Abstract
BACKGROUND: A high proportion of health care services are persistently utilized by a small subpopulation of patients. To improve clinical outcomes while reducing costs and utilization, population health management programs often provide targeted interventions to patients who may become persistent high users/utilizers (PHUs). Enhanced prediction and management of PHUs can improve health care system efficiencies and improve the overall quality of patient care.Entities:
Keywords: comorbidity patterns; health care; health care costs; health care services; latent class analysis; persistent high users; persistent high utilizers; population health analytics; prediction models; unsupervised clustering; utilization prediction
Year: 2021 PMID: 34592712 PMCID: PMC8663459 DOI: 10.2196/31442
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Selection process of the study population. JHHC: Johns Hopkins Health Care; EDC: expanded diagnostic cluster.
Characteristics of the study populations.
| Characteristic | Overall study population (N=164,221) | Non-PHUa population (n=155,862) | PHU population (n=8359) | |
|
| ||||
|
| 0-17 | 100,811 (61.4) | 99,352 (63.7) | 1459 (17.5) |
|
| 18-64 | 62,396 (38.0) | 55,666 (35.7) | 6730 (80.5) |
|
| 65+ | 1014 (0.6) | 844 (0.5) | 170 (2.0) |
| Age (years), mean (SD) | 19.79 (17.43) | 18.79 (16.82) | 38.51 (18.01) | |
| Male, n (%) | 72,418 (44.1) | 69,683 (44.7) | 2735 (32.7) | |
|
| ||||
|
| White | 41,219 (25.1) | 38,762 (24.9) | 2,457 (29.4) |
|
| Black | 53,872 (32.8) | 50,993 (32.7) | 2,879 (34.4) |
|
| Otherb | 149 (0.1) | 143 (0.1) | 6 (0.1) |
|
| Missingc | 68,981 (42.0) | 65,964 (42.3) | 3017 (36.1) |
|
| ||||
|
| 0 | 158,763 (96.7) | 151,971 (97.5) | 6792 (81.3) |
|
| 1-5 | 5,366 (3.3) | 3,866 (2.5) | 1500 (17.9) |
|
| 6-10 | 74 (<0.1) | 20 (<0.1) | 54 (0.6) |
|
| 11+ | 18 (<0.1) | 5 (<0.1) | 13 (0.2) |
|
| ||||
|
| 0 | 3,690 (2.2) | 3,663 (2.4) | 27 (0.3) |
|
| 1-5 | 95,372 (58.1) | 94,138 (60.4) | 1234 (14.8) |
|
| 6-10 | 33,745 (20.5) | 32,317 (20.7) | 1428 (17.1) |
|
| 11+ | 31,414 (19.1) | 25,744 (16.5) | 5670 (67.8) |
aPHU: persistent high users.
b“Other”describes members of known race/ethnicity not equal to Asian, Hispanic, White, or Black.
c“Missing” describes members with empty values for race.
Model fit statistics for latent class analysis models with 2 to 6 classes (N=164,221).
| Model | G2a | AICb | BICc |
| 2-class model | 5,487,702 | 9,113,315 | 9,116,888 |
| 3-class model | 5,213,964 | 8,839,935 | 8,845,300 |
| 4-class model | 5,088,223 | 8,714,552 | 8,721,708 |
| 5-class model | 4,934,192 | 8,560,878 | 8,569,826 |
| 6-class model | 4,874,634 | 8,501,679 | 8,512,419 |
aG2: likelihood ratio/deviance statistic.
bAIC: Akaike information criterion.
cBIC: Bayesian information criterion.
Figure 2Latent class item-response probabilities for the full population (N=164,221).
Figure 3Latent class item-response probabilities for the otitis media subpopulation (n=24,992).
Figure 6Latent class item-response probabilities for the acute upper respiratory infection subpopulation (n=53,232).
Figure 4Latent class item-response probabilities for the mental health subpopulation (n=34,456).
Figure 5Latent class item-response probabilities for the musculoskeletal subpopulation (n=24,799).
Comparing classification metrics for predicting persistent high user/utilizer (PHU) status.
| Metric | Full study population (N=164,221) | Otitis media (n=24,992) | Mental health (n=34,456) | MSKa (n=24,799) | acute URIb (n=53,232) | ||||
|
| ACGc | LCA-LRMd | LCA-LRM | LCA-LRM | LCA-LRM | LCA-LRM | |||
| PPVe (%) | 48.60 | 38.53 | 44.40 | 39.91 | 42.74 | 41.28 | |||
| Sensitivity (%) | 47.90 | 38.72 | 47.23 | 62.43 | 55.14 | 44.99 | |||
| F1-score (%) | 48.20 | 38.62 | 45.77 | 48.69 | 48.15 | 43.05 | |||
| Percentile (threshold) | 95th (0.33) | 95th (0.33) | 95th (0.18) | 80th (0.25) | 95th (0.53) | 95th (0.23) | |||
| PHUs (%) | 5.1 | 5.1 | 4.5 | 12.8 | 15.5 | 4.6 | |||
aMSK: musculoskeletal.
bURI: upper respiratory infection.
cACG: Adjusted Clinical Groups; latent class analysis results not included in the model.
dLCA-LRM: latent class analysis-logistic regression model; latent class probabilities included as predictors in the model.
ePPV: positive predictive value.