| Literature DB >> 26380564 |
Adam Steventon1, Richard Grieve2, Jasjeet S Sekhon3.
Abstract
Various approaches have been used to select control groups in observational studies: (1) from within the intervention area; (2) from a convenience sample, or randomly chosen areas; (3) from areas matched on area-level characteristics; and (4) nationally. The consequences of the decision are rarely assessed but, as we show, it can have complex impacts on confounding at both the area and individual levels. We began by reanalyzing data collected for an evaluation of a rapid response service on rates of unplanned hospital admission. Balance on observed individual-level variables was better with external than local controls, after matching. Further, when important prognostic variables were omitted from the matching algorithm, imbalances on those variables were also minimized using external controls. Treatment effects varied markedly depending on the choice of control area, but in the case study the variation was minimal after adjusting for the characteristics of areas. We used simulations to assess relative bias and means-squared error, as this could not be done in the case study. A particular feature of the simulations was unexplained variation in the outcome between areas. We found that the likely impact of unexplained variation for hospital admissions dwarfed the benefits of better balance on individual-level variables, leading us to prefer local controls in this instance. In other scenarios, in which there was less unexplained variation in the outcome between areas, bias and mean-squared error were optimized using external controls. We identify some general considerations relevant to the choice of control population in observational studies.Entities:
Keywords: Program evaluation; Propensity score matching; Quasi-experiments
Year: 2015 PMID: 26380564 PMCID: PMC4565881 DOI: 10.1007/s10742-014-0135-8
Source DB: PubMed Journal: Health Serv Outcomes Res Methodol ISSN: 1387-3741
Standardized differences before and after matching (%), with all individual baseline variables included in the genetic matching
| Strategy 1: Local controls | Strategy 2: Random areas | Strategy 3: Matched area | Strategy 4: National | |||||
|---|---|---|---|---|---|---|---|---|
| Before |
| Before |
| Before |
| Before |
| |
| Mean predictive risk score | 59.6 |
| 55.9 |
| 55.5 |
| 56.6 |
|
| Mean age | 56.9 |
| 51.7 |
| 49.2 |
| 53.3 |
|
| Mean number of unplanned admissionsb | 48.5 |
| 42.5 |
| 37.5 |
| 43.2 |
|
| Female gender | 22.5 |
| 24.8 |
| 30.4 |
| 26.1 |
|
| Mean socioeconomic deprivation score | −1.5 |
| 21.8 |
| 13.8 |
| 20.1 |
|
| Cancer prevalence | −3.2 |
| 5.2 |
| 2.6 |
| 2.5 |
|
| Diabetes prevalence | 3.5 |
| 11.3 |
| 6.2 |
| 9.4 |
|
| Congestive heart failure prevalence | 13.0 |
| 11.0 |
| 8.5 |
| 11.3 |
|
| Ischemic heart disease prevalence | 11.5 |
| 12.8 |
| 16.5 |
| 13.4 |
|
| Mean number of chronic conditions | 26.0 |
| 25.3 |
| 24.1 |
| 24.1 |
|
| Mean number of planned admissions | 13.2 |
| 14.6 |
| 11.9 |
| 14.6 |
|
| Mean (absolute) standardized difference | 23.6 |
| 25.9 |
| 23.3 |
| 25.0 |
|
Negative values imply that the variable was lower on average in the intervention than matched control group
aFor reasons of space, standardized differences for strategy 2 are the medians over all 32 possible geographies. However, there was substantial variation depending on the choice of geography. Age, for example, showed standardized differences that ranged from 38.3 to 66.6 % before matching, depending on which area was chosen, and from −12.5 to 20.0 % after matching. Ranges for the predictive risk score were from 46.7 to 61.3 % before matching, and from 0.4 to 12.5 % after matching. For the number of unplanned admissions, ranges were from 34.7 to 48.3 % before matching, and from −9.9 to 11.5 % after matching
bAdmission counts are over the year prior to enrollment
Fig. 1Area-level variables. The first panel shows the values of each of the 43 area-level variables under each strategy, while the second panel shows relative differences from the intervention area. Figures for Strategy 2 (‘random area’) are medians over all 32 possible geographies. Figures for Strategy 4 (‘national’) are weighted means over the 32 geographies (weighted for population size)
Estimated treatment effects
| Strategy 1: Local controls | Strategy 2: Random areas | Strategy 3: Matched area | Strategy 4: National | |
|---|---|---|---|---|
| Relative risk (95 % confidence interval) | ||||
| With all baseline variables included in the genetic matching | 2.07 (1.46–2.94) | Median: 2.23 (1.52–3.28) Minimum: 1.35 (1.00–1.82) Maximum: 3.87 (2.43–6.16) | 1.93 (1.37–2.73) | 2.07 (1.47–2.92) |
| With age and predictive risk score omitted from the genetic matching | 2.76 (1.89–4.03) | Median: 2.42 (1.61–3.63) Minimum: 1.66 (1.21–2.27) Maximum: 3.87 (2.32–6.44) | 2.32 (1.57–3.42) | 2.23 (1.55–3.21) |
| Absolute risk difference (95 % confidence interval) | ||||
| With all baseline variables included in the genetic matching | 0.28 (0.27–0.29) | Median: 0.30 (0.28–0.31) Minimum: 0.14 (0.13–0.15) Maximum: 0.40 (0.39–0.41) | 0.26 (0.25–0.27) | 0.28 (0.27–0.29) |
| With age and predictive risk score omitted from the genetic matching | 0.34 (0.33–0.35) | Median: 0.31 (0.30–0.33) Minimum: 0.21 (0.20–0.23) Maximum: 0.40 (0.39–0.41) | 0.31 (0.29–0.32) | 0.30 (0.28–0.31) |
The simulation design and relationships assumed in the base case scenario
| Level | Observation | Structure | Strength of relationship with intervention assignment ( | Strength of relationship with outcome ( | |
|---|---|---|---|---|---|
|
| Individual | Observed |
are correlated | 0.5 | 0.30 |
|
| Individual | Unobserved | Range 0.1–0.3 | 0.15 | |
|
| Area | Observed | Independent | n/a | 0.01 |
|
| Area | Observed | Determines the mean of | n/a | 0.05 |
|
| Area | Unobserved | Independent | n/a | 0.06 |
Balance in observed person-level confounder,
|
|
| Means (SDs) after matching | Standardized differences (%) after matching | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Treated | Matched controls from strategy: | Strategy: | ||||||||
| (1) | (2) Random areas | (3) Matched area | (4) | (1) | (2) Random areas | (3) Matched area | (4) | |||
| 10 | 0.1 | 1.455 (0.099) | 1.453 (0.099) | 1.454 (0.099) | 1.454 (0.099) | 1.455 (0.099) | 0.18 | 0.12 | 0.12 | 0.00 |
| 0.2 | 1.471 (0.100) | 1.469 (0.099) | 1.470 (0.099) | 1.470 (0.099) | 1.471 (0.100) | 0.19 | 0.13 | 0.13 | 0.00 | |
| 0.3 | 1.486 (0.098) | 1.484 (0.098) | 1.484 (0.098) | 1.484 (0.098) | 1.486 (0.098) | 0.20 | 0.13 | 0.13 | 0.00 | |
|
| 0.1 | 1.344 (0.056) | 1.342 (0.056) | 1.343 (0.056) | 1.343 (0.056) | 1.344 (0.056) | 0.22 | 0.07 | 0.07 | 0.00 |
|
|
( |
( |
( |
( |
( |
|
|
|
| |
| 0.3 | 1.363 (0.056) | 1.361 (0.056) | 1.362 (0.056) | 1.362 (0.056) | 1.363 (0.056) | 0.24 | 0.08 | 0.07 | 0.00 | |
| 50 | 0.1 | 1.244 (0.044) | 1.241 (0.043) | 1.243 (0.043) | 1.243 (0.043) | 1.244 (0.044) | 0.30 | 0.05 | 0.05 | 0.00 |
| 0.2 | 1.250 (0.043) | 1.247 (0.043) | 1.250 (0.043) | 1.250 (0.043) | 1.250 (0.043) | 0.31 | 0.05 | 0.05 | 0.00 | |
| 0.3 | 1.256 (0.043) | 1.253 (0.043) | 1.255 (0.043) | 1.255 (0.043) | 1.256 (0.043) | 0.33 | 0.05 | 0.05 | 0.00 | |
Italic part shows the base case scenario
Balance in unobserved person-level confounder,
|
|
| Means (SDs) after matching | Standardized differences (%) after matching | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Treated | Matched controls from strategy: | Strategy: | ||||||||
| (1) | (2) Random areas | (3) Matched area | (4) | (1) | (2) Random areas | (3) Matched area | (4) | |||
| 10 | 0.1 | 1.176 (0.101) | 1.080 (0.110) | 0.097 (1.004) | 1.040 (0.251) | 0.090 (0.201) | 9.66 | 109.03 | 13.76 | 109.64 |
| 0.2 | 1.261 (0.100) | 1.071 (0.109) | 0.094 (1.003) | 1.046 (0.251) | 0.094 (0.202) | 19.30 | 118.23 | 21.81 | 118.31 | |
| 0.3 | 1.345 (0.100) | 1.062 (0.109) | 0.093 (1.010) | 1.047 (0.251) | 0.100 (0.201) | 28.74 | 127.31 | 30.27 | 126.61 | |
|
| 0.1 | 1.132 (0.058) | 1.036 (0.076) | 0.071 (0.998) | 1.015 (0.240) | 0.068 (0.164) | 9.63 | 106.74 | 11.75 | 107.03 |
|
|
( |
( |
( |
( |
( |
|
|
|
| |
| 0.3 | 1.260 (0.057) | 0.976 (0.077) | 0.071 (0.996) | 1.022 (0.237) | 0.072 (0.165) | 28.91 | 121.01 | 24.26 | 120.93 | |
| 50 | 0.1 | 1.094 (0.045) | 0.998 (0.075) | 0.040 (1.005) | 1.001 (0.238) | 0.048 (0.157) | 9.65 | 106.03 | 9.29 | 105.21 |
| 0.2 | 1.139 (0.044) | 0.948 (0.075) | 0.062 (1.003) | 0.998 (0.235) | 0.050 (0.158) | 19.36 | 108.91 | 14.31 | 110.20 | |
| 0.3 | 1.183 (0.044) | 0.900 (0.075) | 0.052 (0.997) | 0.998 (0.238) | 0.051 (0.156) | 28.83 | 115.30 | 18.93 | 115.41 | |
Italic part shows the base case scenario
Fig. 2Box plots of the estimated treatment effects based on 20,000 replications from the simulation experiment. The horizontal red line represents the true treatment effect. Saturation = 30 % and throughout (Color figure online)
Bias (mean-squared error) as a percentage of the population
| Strategy 1: Local controls | Strategy 2: Random areas | Strategy 3: Matched area | Strategy 4: National | |
|---|---|---|---|---|
| Central assumptions ( | ||||
| Simple confounding | 0.04 (6.51) | −0.02 (6.71) | 0.02 (6.67) | −0.02 (6.50) |
| No area-level variation | 0.26 (5.92) | 1.64 (11.18) | 0.24 (5.90) | 1.68 (8.97) |
| No unexplained area-level variation | 0.26 (5.61) | 2.23 (14.97) | 0.23 (5.71) | 2.18 (10.89) |
| |
( |
( |
( |
( |
| High unexplained area-level variation | 0.17 (4.49) | 4.74 (41.39) | 2.76 (19.69) | 4.68 (27.79) |
| Sensitivity analysis: low saturation ( | ||||
| Simple confounding | −0.02 (19.49) | −0.05 (19.36) | 0.01 (19.48) | 0.03 (19.72) |
| No area-level variation | 0.26 (16.86) | 1.60 (22.76) | 0.25 (17.26) | 1.66 (21.10) |
| No unexplained area-level variation | 0.25 (16.74) | 2.19 (26.26) | 0.29 (16.25) | 2.18 (22.66) |
| Base case scenario | 0.22 (15.53) | 2.70 (29.17) | 0.78 (17.28) | 2.65 (24.64) |
| High unexplained area-level variation | 0.20 (12.54) | 4.59 (49.36) | 2.69 (28.80) | 4.56 (37.29) |
| Sensitivity analysis: high saturation ( | ||||
| Simple confounding | 0.02 (4.06) | 0.00 (4.07) | −0.01 (4.04) | −0.03 (4.06) |
| No area-level variation | 0.26 (3.75) | 1.66 (8.81) | 0.20 (3.80) | 1.65 (6.57) |
| No unexplained area-level variation | 0.24 (3.51) | 2.25 (13.06) | 0.24 (3.61) | 2.18 (8.59) |
| Base case scenario | 0.23 (3.36) | 2.74 (15.80) | 0.74 (4.31) | 2.70 (11.05) |
| High unexplained area-level variation | 0.20 (2.73) | 4.89 (41.92) | 2.79 (18.49) | 4.78 (26.57) |
Italic part shows the base case scenario. = 0.2 throughout
Factors to consider when selecting control populations
| A situation in which local controls are preferred: |
| Low or moderate intervention saturation; and |
| Low risk of unobserved confounding at the individual level |
| A situation in which controls from a matched area are preferred: |
| High intervention saturation means there is a limited supply of controls from within the local area; |
| Unobserved confounding is likely at the individual level, and the unobserved confounder is a relatively strong predictor of treatment assignment; |
| The distribution of the unobserved confounder is likely to be similar in the matched control area to that in the intervention area; and |
| Area-level variation in outcomes either does not exist or can be largely explained by observed area-level variables that are accounted for in the matching |
| Other considerations include the relative population sizes of the areas, spillover effects and differences in measurement |