| Literature DB >> 27742603 |
Alvin Rajkomar1, Joanne Wing Lan Yim, Kevin Grumbach, Ami Parekh.
Abstract
BACKGROUND: Characterizing patient complexity using granular electronic health record (EHR) data regularly available to health systems is necessary to optimize primary care processes at scale.Entities:
Keywords: ambulatory care; health care economics and organizations; machine learning; medical informatics; patient acceptance of health care; primary health care; risk adjustment
Year: 2016 PMID: 27742603 PMCID: PMC5086026 DOI: 10.2196/medinform.6530
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Flowchart of data from the electronic health record to the algorithm. PCWC: Primary care work cluster.
Figure 2Flowchart of decision rules and clustering algorithms that demonstrate how patients were classified into different utilization phenotypes and primary work group clusters.
Figure 3The fractions of all patients assigned to the primary care work clusters in 4 selected clinics and their unweighted and weighted panel sizes. The distribution of patients across clusters was unique to each clinic, and because each cluster is weighted differently, the difference between weighted and unweighted panel sizes differed for each clinic as well. The geriatric clinic, which has 41% of its population assigned to the high work cluster, had a weighted panel size that was more than twice the unweighted size.
Figure 4Equations that define how scaling factor w was defined. We constrain the total weighted population size (the right hand side) to be equal to the total unweighted population size in (a). We solve for w in (b) PCP: primary care physician.
Patient characteristics of each utilization phenotypes (inactive through group D) in the training set (N=24,324).
| Characteristics | Utilization phenotype | ||||
| Inactive | A | B | C | D | |
| Size of group (n) | 3986 | 5343 | 6991 | 3000 | 2452 |
| Age, years, mean (SD) | 41.9 (17.3) | 47.7 (14.7) | 53.7 (16.8) | 56.6 (16.4) | 59.9 (17.3) |
| Male, n (%) | 1551 (38.9) | 2057 (38.5) | 2678 (38.3) | 1083 (36.1) | 922 (37.6) |
| White, n (%) | 1814 (45.5) | 2875 (53.8) | 3293 (47.1) | 1635 (54.5) | 1324 (54) |
| Asian, n (%) | 694 (17.4) | 1095 (20.5) | 1734 (24.8) | 675 (22.5) | 596 (24.3) |
| Black, n (%) | 379 (9.5) | 289 (5.4) | 587 (8.4) | 222 (7.4) | 184 (7.5) |
| Commercial, n (%) | 2738 (68.7) | 4266 (79.9) | 4348 (62.2) | 1731 (57.7) | 1113 (45.4) |
| Medicare or Medicaid, n (%) | 1068 (26.8) | 992 (18.6) | 2545 (36.4) | 1245 (41.5) | 1324 (54.0) |
| Other payer, n (%) | 180 (4.5) | 85 (2) | 98 (1) | 24 (1) | 15 (1) |
| Active medications at PCPa visit, mean (SD) | 0 (0) | 2.3 (2.9) | 5 (3.6) | 5.5 (4.3) | 8.1 (6) |
| Primary care visits, mean (SD) | 0 (0) | 0.7 (0.5) | 2.6 (1.3) | 2.1 (1.4) | 2.9 (2.3) |
| Weighted primary care visits, mean (SD) | 0 (0) | 0.7 (0.6) | 3.2 (1.8) | 2.8 (1.9) | 4.3 (3.7) |
| No-show visits, mean (SD) | 0.1 (0.4) | 0.2 (0.7) | 0.5 (1) | 0.6 (1.2) | 1.4 (2.1) |
| Urgent care visits, mean (SD) | 0 (0) | 0.1 (0.5) | 0.2 (0.5) | 0.2 (0.6) | 0.2 (0.6) |
| Telephone encounters, mean (SD) | 0 (0) | 0.4 (0.7) | 1.7 (1.8) | 1.4 (1.6) | 2.3 (2.5) |
| Emergency department visits, mean (SD) | 0 (0) | 0 (0) | 0.2 (0.5) | 0.2 (0.5) | 0.3 (0.7) |
| Emergent hospitalizations, mean (SD) | 0 (0) | 0 (0) | 0 (0.2) | 0 (0.3) | 0.1 (0.5) |
| Elective hospitalizations, mean (SD) | 0 (0) | 0 (0) | 0 (0) | 0 (0.2) | 0.1 (0.3) |
| Specialist visits (capped), mean (SD) | 0 (0) | 1 (1.2) | 1 (1) | 5.5 (1.4) | 14 (5.3) |
| Infusion visits, mean (SD) | 0 (0) | 0 (0.5) | 0 (0.4) | 0.1 (1) | 0.7 (4.1) |
| Transfusion visits, mean (SD) | 0 (0) | 0 (0.8) | 0 (0.2) | 0.1 (0.8) | 0.4 (2.6) |
| Radiology or procedure visits, mean (SD) | 0 (0) | 0.4 (0.8) | 0.6 (1) | 1.2 (1.4) | 2.2 (2.6) |
| Secure electronic messages to patient, mean (SD) | 0 (0) | 0.7 (1.4) | 2.3 (4.3) | 4 (6.4) | 6.8 (11.1) |
| Secure electronic messages from patient, mean (SD) | 0 (0) | 0.9 (1.8) | 2.8 (5.3) | 5 (8.2) | 8.9 (15) |
aPCP: primary care physician.
Patient characteristics of each utilization phenotype (group E to G) in the training set (N=24,324). The total column includes data from phenotypes in Table 1.
| Characteristics | Utilization phenotype | |||
| E | F | G | Total sample | |
| Size of group (n) | 2082 | 430 | 40 | 24,324 |
| Age, years, mean (SD) | 65.1 (16.8) | 67.4 (16.3) | 60.5 (14.4) | 52.7 (17.9) |
| Male, n (%) | 716 (34.4) | 158 (36.7) | 8 (20) | 9170 (37.7) |
| White, n (%) | 799 (38.4) | 191 (44.4) | 14 (35) | 11,943 (49.1) |
| Asian, n (%) | 525 (25.2) | 76 (18) | 4 (10) | 5400 (22.2) |
| Black, n (%) | 385 (18.5) | 102 (23.7) | 17 (43) | 2165 (8.9) |
| Commercial, n (%) | 431 (20.7) | 26 (6) | 3 (8) | 14,665 (60.3) |
| Medicare or Medicaid, n (%) | 1628 (78.2) | 402 (93.5) | 37 (93) | 9219 (38.0) |
| Other payer, n (%) | 23 (1) | 2 (1) | N/Aa | 440 (1.8) |
| Active medications at PCPb visit, mean (SD) | 11 (5) | 15.7 (6.1) | 16.2 (9.3) | 4.7 (5.2) |
| Primary care visits, mean (SD) | 7 (2.8) | 11.5 (4.5) | 33.2 (10) | 2.3 (3) |
| Weighted primary care visits, mean (SD) | 10.7 (4.4) | 19.1 (7.8) | 53.1 (15.6) | 3.2 (4.7) |
| No-show visits, mean (SD) | 1.8 (2.4) | 4.3 (4.9) | 6.2 (5.3) | 0.7 (1.6) |
| Urgent care visits, mean (SD) | 0.2 (0.7) | 0.5 (1.2) | 1.2 (2.4) | 0.1 (0.5) |
| Telephone encounters, mean (SD) | 5.9 (3.8) | 19.4 (10.3) | 18.5 (22.5) | 1.9 (3.8) |
| Emergency department visits, mean (SD) | 0.5 (1) | 1.6 (2.9) | 1.8 (2.1) | 0.2 (0.7) |
| Emergent hospitalizations, mean (SD) | 0.2 (0.5) | 0.9 (1.7) | 0.9 (1.5) | 0.1 (0.4) |
| Elective hospitalizations, mean (SD) | 0 (0.2) | 0.1 (0.4) | 0 (0.2) | 0 (0.1) |
| Specialist visits (capped), mean (SD) | 4.4 (3.3) | 11.5 (8.2) | 7.6 (9.2) | 3.2 (4.9) |
| Infusion visits, mean (SD) | 0.1 (2.2) | 0.1 (1.1) | 0 (0.2) | 0.1 (1.5) |
| Transfusion visits, mean (SD) | 0 (0.6) | 0.5 (3.3) | 0.2 (1.3) | 0.1 (1.1) |
| Radiology or procedure visits, mean (SD) | 1.5 (1.7) | 2.7 (3) | 2.5 (2.9) | 0.8 (1.5) |
| Secure electronic messages to patient, mean (SD) | 3.4 (7.8) | 5.6 (14.6) | 5 (14.5) | 2.4 (6.1) |
| Secure electronic messages from patient, mean (SD) | 4.4 (10.4) | 8.1 (22.3) | 8.6 (25.5) | 3.1 (8.1) |
aN/A: not applicable.
bPCP: primary care physician.
Log-linear model using demographic variables and baseline utilization phenotype to predict subsequent year primary care telephone encounters and office visits among patients in the test set.
| Model predictors | Adjusted | AICa |
| Age-sexb | .166 | 60,780 |
| Payerc | .128 | 61,495 |
| Naïve phenotypes (NP)d | .259 | 57,724 |
| Primary care cluster utilization phenotype (UP)e | .330 | 55,088 |
| Age-sex and payer | .209 | 59,450 |
| Age-sex, payer, and NP | .343 | 54,813 |
| Age-sex, payer, and UP | .394 | 52,769 |
aAIC: Akaike information criterion.
bAge-sex bins are categorical variables of the combination of male or female with the following age groups: 18-34, 35-49, 50-64, 65-69, 70-84, and 85-115 years.
cPayers are defined as commercial, Medicare or Medicaid, or other.
dThe naïve phenotype is a categorical variable that is obtained by summing the total number of health care encounters in the baseline year. These values were rank ordered and divided into 7 percentiles.
eThe utilization phenotype is a categorical variable encoding 1 of the 7 phenotype clusters created by our algorithm.
Figure 5The change of weighted panel size for various primary care providers with more than 150 patients. The panel size increases on average by 12.8%. *These 2 primary care physician (PCPs), who are geriatricians, had a panel size increase of more than 100%.