| Literature DB >> 27340369 |
Stephen O'Neill1, Noémi Kreif1, Richard Grieve1, Matthew Sutton2, Jasjeet S Sekhon3.
Abstract
Difference-in-differences (DiD) estimators provide unbiased treatment effect estimates when, in the absence of treatment, the average outcomes for the treated and control groups would have followed parallel trends over time. This assumption is implausible in many settings. An alternative assumption is that the potential outcomes are independent of treatment status, conditional on past outcomes. This paper considers three methods that share this assumption: the synthetic control method, a lagged dependent variable (LDV) regression approach, and matching on past outcomes. Our motivating empirical study is an evaluation of a hospital pay-for-performance scheme in England, the best practice tariffs programme. The conclusions of the original DiD analysis are sensitive to the choice of approach. We conduct a Monte Carlo simulation study that investigates these methods' performance. While DiD produces unbiased estimates when the parallel trends assumption holds, the alternative approaches provide less biased estimates of treatment effects when it is violated. In these cases, the LDV approach produces the most efficient and least biased estimates.Entities:
Keywords: Difference-in-differences; Matching; Pay-for-performance; Policy evaluation; Synthetic control method
Year: 2016 PMID: 27340369 PMCID: PMC4869762 DOI: 10.1007/s10742-016-0146-8
Source DB: PubMed Journal: Health Serv Outcomes Res Methodol ISSN: 1387-3741
Fig. 1Comparison of surgery within 48 h of emergency admission for hip fracture for participating hospitals to a non-participating hospitals, b the synthetic control, and c the matched controls
Fig. 2Comparison of mortality within 30 days of emergency admission for hip fracture for participating hospitals to a non-participating hospitals, b the synthetic control, and c the matched controls
BPT case study results: ATT on process and outcome measures according to method
| DiDa | LDV | Synthetic controls | Matching + DiD | |
|---|---|---|---|---|
| Surgery within 48 h | 0.0403 | 0.0539 | 0.0482 | 0.0488 |
| Dead within 30 days | −0.0080 | −0.0052 | −0.0051 | −0.0071 |
| Emergency re-admissions, 30 days | 0.0003 | 0.0008 | 0.0028 | 0.0047 |
| Usual residence, 56 days | 0.0228 | 0.0087 | 0.0104 | 0.0124 |
For each method, the analysis adjusted for the following covariates: proportion of patients in age groups defined in 5 year increments from 60 to 105, the proportion of males and the proportion admitted from their usual residence
Reported p values are for the null of a true ATT = 0. For DiD and LDV, asymptotic normality is assumed. For Matching +DiD, reported p-values are conditional on the matched data. For Synthetic controls, reported p-values were calculated using placebo-tests in a procedure akin to permutation tests (Abadie et al. 2010). This procedure involves iteratively resampling from the control pool, and in each iteration re-assigning each control unit as a ‘placebo treated unit’, with a probability according to the proportion of treated units in the original sample. The synthetic control method as described in Sect. 3.2.3 was then applied on these ‘placebo data’ and an ATT calculated for the placebo treated versus control units. This iterative process was repeated 200 times, to report a distribution of ATTs under the null hypothesis. The p value for the ATT was calculated according to the proportion of the replicates in which the absolute value of the placebo-ATT exceeded the estimated ATT. It should be noted that the p value based on placebo tests relate to falsification tests, while the p-values reported for the other methods relate to sampling uncertainty. Hence the p values are not directly comparable
aMcDonald et al. (2012) report similar results for their DiD estimation which was based on patient level data, including year and hospital fixed effects and using robust, unclustered standard errors. Here we conduct the analysis at the hospital trust level using quarterly data, weighting by number of admissions and cluster by hospital trust
Monte Carlo simulations: summary of parameter values across the scenarios
| Scenario | Scenario description | Total periods | Std. deviation of epsilon ( | Settings for λ | Serial correlation ( | ||
|---|---|---|---|---|---|---|---|
| Trend | Amplitude ( | Wave length ( | |||||
| A | Parallel trends holds | {3, 10, 30} | 10 | 0 | 0 | 0 | 0 |
| B | Parallel trends fails | {3, 10, 30} | 10 | 10 | 2 | 4 | 0 |
| C | Parallel trends fails + serial correlation | {3, 10, 30} | 10 | 10 | 2 | 4 | 0.7 |
| D | Parallel trends fails + high variance | {3, 10, 30} | 50 | 10 | 2 | 4 | 0 |
Across all scenarios: effect of covariates ( and Average Treatment effect Serial correlation: )
Time-varying effect of unobserved confounders:
Fig. 3Monte Carlo simulation results: bias (%) and distribution of the estimates: a Scenario A—parallel trends. b Scenario B—non parallel trends, no serial correlation (ρ = 0), low outcome variation (σe = 10). c Scenario C—non parallel trends and high serial correlation (ρ = 0.7). d Scenario D—non parallel trends and high outcome variation (σe = 50)
Monte Carlo simulation: bias (%) and RMSE for estimation of the ATT (true value of 10)
| Scenario | Description | Periods | Bias (%) | RMSE | ||||
|---|---|---|---|---|---|---|---|---|
| 3 | 10 | 30 | 3 | 10 | 30 | |||
| A | Parallel trends holds | DiD | 1 | −1 | −1 | 2 | 2 | 2 |
| Synthetic controls | 63 | 33 | 26 | 7 | 6 | 5 | ||
| LDV | 32 | 23 | 16 | 4 | 3 | 3 | ||
| Matching + DiD | 27 | 16 | 7 | 4 | 3 | 3 | ||
| B | Parallel trends fails | DiD | 127 | 57 | 132 | 13 | 6 | 13 |
| Synthetic controls | 75 | 34 | 37 | 13 | 8 | 8 | ||
| LDV | 53 | 5 | −2 | 6 | 3 | 3 | ||
| Matching + DiD | 69 | 18 | 26 | 9 | 5 | 5 | ||
| C | Parallel trends fails + serial correlation (ρ = 0.7) | DiD | 127 | 57 | 132 | 13 | 6 | 13 |
| Synthetic controls | 23 | 17 | 20 | 6 | 4 | 5 | ||
| LDV | 5 | −3 | −4 | 1 | 1 | 1 | ||
| Matching + DiD | 29 | 12 | 21 | 4 | 2 | 3 | ||
| D | Parallel trends fails + high variance | DiD | 129 | 52 | 128 | 16 | 10 | 15 |
| Synthetic controls | 419 | 189 | 176 | 47 | 30 | 25 | ||
| LDV | 355 | 165 | 90 | 37 | 20 | 16 | ||
| Matching + DiD | 301 | 124 | 106 | 34 | 20 | 18 | ||