| Literature DB >> 27867250 |
Aidan G O'Keeffe1, Gianluca Baio1.
Abstract
Regression discontinuity designs (RD designs) are used as a method for causal inference from observational data, where the decision to apply an intervention is made according to a 'decision rule' that is linked to some continuous variable. Such designs are being increasingly developed in medicine. The local average treatment effect (LATE) has been established as an estimator of the intervention effect in an RD design, particularly where a design's 'decision rule' is not adhered to strictly. Estimating the variance of the LATE is not necessarily straightforward. We consider three approaches to the estimation of the LATE: two-stage least squares, likelihood-based and a Bayesian approach. We compare these under a variety of simulated RD designs and a real example concerning the prescription of statins based on cardiovascular disease risk score.Entities:
Keywords: causal inference; local average treatment effect; regression discontinuity design; two‐stage least squares
Year: 2016 PMID: 27867250 PMCID: PMC5111792 DOI: 10.1111/sjos.12224
Source DB: PubMed Journal: Scand Stat Theory Appl ISSN: 0303-6898 Impact factor: 1.396
Figure 1Example sharp and fuzzy RD design plots. The dashed vertical line represents the intervention threshold. ‘Untreated’ labels subjects who do not receive the intervention; ‘Treated’ labels subjects who receive the intervention.
Figure 2Example plot of a simulated dataset where N = 1000. The discontinuity in outcome variable at the threshold can be clearly observed. Red icons (marked ‘Untreated’) indicate subjects who do not receive the intervention, and blue icons (marked ‘Treated’) indicate subjects who receive the intervention. The dashed vertical line represents the intervention threshold.
Two‐stage least squares estimates ( ), maximum likelihood estimates ( ) and Bayesian estimates ( ) of the LATE, together with corresponding variance estimates from the simulated datasets with a variety of chosen RD design bandwidths and dataset sizes
|
|
|
|
|
|
|
|
| V(
| V(
|
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| 0.05 | −2.00 | −2.00 | −2.01 | 0.0316 | 0.0130 | 0.0131 | 0.0191 | 0.0155 | 0.0153 |
| 0.10 | −2.00 | −2.00 | −2.00 | 0.0156 | 0.0064 | 0.0064 | 0.0167 | 0.0058 | 0.0058 |
| 0.15 | −2.00 | −2.00 | −2.01 | 0.0103 | 0.0042 | 0.0042 | 0.0137 | 0.0041 | 0.0041 |
| 0.20 | −2.00 | −2.00 | −2.01 | 0.0077 | 0.0032 | 0.0032 | 0.0113 | 0.0031 | 0.0031 |
| 0.25 | −2.00 | −2.00 | −2.01 | 0.0061 | 0.0025 | 0.0025 | 0.0094 | 0.0023 | 0.0023 |
|
| |||||||||
| 0.05 | −2.00 | −2.00 | −2.03 | 0.1727 | 0.0729 | 0.0775 | 0.0741 | 0.0683 | 0.0662 |
| 0.10 | −2.00 | −2.00 | −2.02 | 0.0807 | 0.0334 | 0.0343 | 0.0552 | 0.0294 | 0.0288 |
| 0.15 | −2.00 | −2.00 | −2.01 | 0.0532 | 0.0219 | 0.0225 | 0.0514 | 0.0261 | 0.0261 |
| 0.20 | −2.00 | −2.00 | −2.01 | 0.0395 | 0.0162 | 0.0165 | 0.0465 | 0.0194 | 0.0193 |
| 0.25 | −2.00 | −2.00 | −2.02 | 0.0314 | 0.0128 | 0.0130 | 0.0421 | 0.0146 | 0.0145 |
|
| |||||||||
| 0.05 | −1.99 | −1.99 | −2.07 | 0.4251 | 0.1948 | 0.2274 | 0.1541 | 0.1712 | 0.1498 |
| 0.10 | −2.00 | −1.99 | −2.04 | 0.1751 | 0.0755 | 0.0802 | 0.0965 | 0.0715 | 0.0692 |
| 0.15 | −2.00 | −2.00 | −2.04 | 0.1104 | 0.0462 | 0.0479 | 0.0888 | 0.0471 | 0.0461 |
| 0.20 | −2.00 | −2.00 | −2.02 | 0.0811 | 0.0340 | 0.0349 | 0.0826 | 0.0370 | 0.0363 |
| 0.25 | −2.00 | −2.00 | −2.03 | 0.0640 | 0.0268 | 0.0274 | 0.0764 | 0.0299 | 0.0296 |
|
| |||||||||
| 0.05 | −1.98 | −1.96 | −2.13 | 2.4527 | 25.2280 | 50.0041 | 6.3297 | 0.9389 | 0.6909 |
| 0.10 | −1.96 | −1.96 | −2.08 | 0.4048 | 0.2020 | 0.2171 | 0.1880 | 0.1715 | 0.1684 |
| 0.15 | −1.99 | −1.99 | −2.06 | 0.2360 | 0.1007 | 0.1104 | 0.1528 | 0.0992 | 0.0977 |
| 0.20 | −2.01 | −2.00 | −2.06 | 0.1746 | 0.0734 | 0.0776 | 0.1443 | 0.0769 | 0.0757 |
| 0.25 | −2.01 | −2.01 | −2.07 | 0.1351 | 0.0560 | 0.0583 | 0.1392 | 0.0611 | 0.0584 |
The non‐adherence probability is set at 0.1. denotes the sample variance of the maximum likelihood estimates, and denotes the sample variance of the two‐stage least square estimates.
LATE, local average treatment effect; RD, regression discontinuity.
Two‐stage least squares estimates ( ), maximum likelihood estimates ( ) and Bayesian estimates ( ) of the LATE, together with corresponding variance estimates from the simulated datasets with a variety of chosen RD design bandwidths and dataset sizes
|
|
|
|
|
|
|
|
| V(
| V(
|
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| 0.05 | −2.00 | −2.00 | −1.99 | 0.0851 | 0.0244 | 0.0248 | 0.0499 | 0.0240 | 0.0239 |
| 0.10 | −2.01 | −2.01 | −2.00 | 0.0415 | 0.0117 | 0.0118 | 0.0455 | 0.0122 | 0.0123 |
| 0.15 | −2.01 | −2.01 | −2.00 | 0.0272 | 0.0076 | 0.0077 | 0.0392 | 0.0084 | 0.0083 |
| 0.20 | −2.01 | −2.01 | −2.01 | 0.0205 | 0.0058 | 0.0058 | 0.0325 | 0.0062 | 0.0062 |
| 0.25 | −2.01 | −2.01 | −2.00 | 0.0163 | 0.0046 | 0.0046 | 0.0270 | 0.0050 | 0.0050 |
|
| |||||||||
| 0.05 | −1.97 | −1.96 | −1.95 | 0.7070 | 0.3314 | 0.4726 | 0.2008 | 0.2318 | 0.1860 |
| 0.10 | −1.98 | −1.97 | −1.96 | 0.2366 | 0.0732 | 0.0769 | 0.1419 | 0.0759 | 0.0721 |
| 0.15 | −1.99 | −1.99 | −1.97 | 0.1479 | 0.0437 | 0.0452 | 0.1361 | 0.0464 | 0.0457 |
| 0.20 | −1.99 | −1.99 | −1.97 | 0.1078 | 0.0315 | 0.0321 | 0.1282 | 0.0347 | 0.0339 |
| 0.25 | −2.00 | −2.00 | −1.98 | 0.0846 | 0.0245 | 0.0249 | 0.1171 | 0.0268 | 0.0268 |
|
| |||||||||
| 0.05 | −2.12 | −1.99 | −1.92 | 2.8039 | 19.6660 | 398.6478 | 0.4603 | 0.8190 | 0.7911 |
| 0.10 | −1.98 | −1.98 | −1.97 | 0.4996 | 0.1693 | 0.1997 | 0.2328 | 0.1608 | 0.1455 |
| 0.15 | −2.01 | −2.00 | −1.98 | 0.2935 | 0.0888 | 0.0959 | 0.2175 | 0.0826 | 0.0767 |
| 0.20 | −2.02 | −2.01 | −1.98 | 0.2190 | 0.0655 | 0.0678 | 0.2129 | 0.0613 | 0.0598 |
| 0.25 | −2.02 | −2.02 | −1.99 | 0.1707 | 0.0505 | 0.0519 | 0.2031 | 0.0498 | 0.0481 |
|
| |||||||||
| 0.05 | −2.02 | −2.05 | −1.93 | 7.0890 | 63.3852 | 412.9690 | 104.0435 | 5.3853 | 2.6314 |
| 0.10 | −2.19 | −2.17 | −1.94 | 2.9853 | 99.5390 | 545.0271 | 0.5564 | 2.2327 | 3.2587 |
| 0.15 | −2.01 | −2.01 | −1.96 | 0.6987 | 0.2678 | 0.9292 | 0.3597 | 0.2184 | 0.1650 |
| 0.20 | −1.97 | −1.98 | −1.97 | 0.5690 | 0.3189 | 0.3365 | 0.3470 | 0.1703 | 0.1528 |
| 0.25 | −2.00 | −2.00 | −1.98 | 0.3622 | 0.1194 | 0.1259 | 0.4500 | 0.0966 | 0.0914 |
The non‐adherence probability is set at 0.2. denotes the sample variance of the maximum likelihood estimates, and denotes the sample variance of the two‐stage least square estimates.
LATE, local average treatment effect; RD, regression discontinuity.
Figure 3Scatter plot showing 10‐year cardiovascular disease (CVD) risk score versus low‐density lipoprotein (LDL) cholesterol level for The Health Improvement Network data subset. Patients who received statins are denoted ‘Treated’, and those who did not receive statins are denoted ‘Untreated’. The threshold (10‐year CVD risk score of 20%) is marked by a vertical dashed line.
Two‐stage least squares estimates ( ), maximum likelihood estimates ( ) and Bayesian estimates ( ) of the LATE for the effect of statins on LDL cholesterol level, together with corresponding variance estimates with a variety of chosen RD design bandwidths
| Bandwidth ( |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 0.05 | −3.66 | −3.23 | −1.70 | 1.4908 | 3.5536 | 3.8357 | 0.1921 |
| 0.10 | −2.53 | −2.50 | −1.68 | 0.3384 | 0.4237 | 0.4150 | 0.1812 |
| 0.15 | −2.11 | −2.09 | −1.75 | 0.1641 | 0.1733 | 0.1685 | 0.1829 |
| 0.20 | −1.98 | −1.99 | −1.72 | 0.1187 | 0.1210 | 0.1216 | 0.1677 |
| 0.25 | −2.00 | −2.02 | −1.74 | 0.1157 | 0.1187 | 0.1193 | 0.1312 |
The data used were taken from the THIN database.
LATE, local average treatment effect; RD, regression discontinuity; THIN, The Health Improvement Network.