| Literature DB >> 31708716 |
Frank Windmeijer1,2, Helmut Farbmacher3, Neil Davies2,4, George Davey Smith2,4.
Abstract
We investigate the behavior of the Lasso for selecting invalid instruments in linear instrumental variables models for estimating causal effects of exposures on outcomes, as proposed recently by Kang et al. Invalid instruments are such that they fail the exclusion restriction and enter the model as explanatory variables. We show that for this setup, the Lasso may not consistently select the invalid instruments if these are relatively strong. We propose a median estimator that is consistent when less than 50% of the instruments are invalid, and its consistency does not depend on the relative strength of the instruments, or their correlation structure. We show that this estimator can be used for adaptive Lasso estimation, with the resulting estimator having oracle properties. The methods are applied to a Mendelian randomization study to estimate the causal effect of body mass index (BMI) on diastolic blood pressure, using data on individuals from the UK Biobank, with 96 single nucleotide polymorphisms as potential instruments for BMI. Supplementary materials for this article are available online.Entities:
Keywords: Causal inference; Instrumental variables estimation; Invalid instruments; Lasso; Mendelian randomization.
Year: 2018 PMID: 31708716 PMCID: PMC6817329 DOI: 10.1080/01621459.2018.1498346
Source DB: PubMed Journal: J Am Stat Assoc ISSN: 0162-1459 Impact factor: 5.033
Estimation results for 2SLS and Lasso estimators for β; L = 10, s = 3, .
| av. # instr | freq. all | ||||||
|---|---|---|---|---|---|---|---|
| selected as invalid | invalid instr | ||||||
| β | bias | std dev | rmse | mad | [min, max] | selected | |
| 2SLS | 0.2966 | 0.0808 | 0.3074 | 0.2944 | 0 | 0 | |
| 2SLS or | 0.0063 | 0.0843 | 0.0845 | 0.0570 | 3 | 1 | |
| Lasso | 0.1384 | 0.0965 | 0.1687 | 0.1352 | 6.41 [2,9] | 0.990 | |
| Post-Lasso | 0.1169 | 0.1136 | 0.1630 | 0.1143 | |||
| Lasso | 0.2206 | 0.0847 | 0.2363 | 0.2174 | 3.16 [0,8] | 0.664 | |
| Post-Lasso | 0.0905 | 0.1243 | 0.1537 | 0.0994 | |||
| 2SLS | 0.3019 | 0.0387 | 0.3044 | 0.3007 | 0 | 0 | |
| 2SLS or | 0.0047 | 0.0422 | 0.0424 | 0.0285 | 3 | 1 | |
| Lasso | 0.0721 | 0.0509 | 0.0882 | 0.0705 | 6.64 [3,9] | 1 | |
| Post-Lasso | 0.0617 | 0.0577 | 0.0845 | 0.0644 | |||
| Lasso | 0.1140 | 0.0430 | 0.1218 | 0.1165 | 3.76 [3,8] | 1 | |
| Post-Lasso | 0.0277 | 0.0521 | 0.0590 | 0.0387 | |||
| 2SLS | 0.2996 | 0.0177 | 0.3002 | 0.2992 | 0 | 0 | |
| 2SLS or | 0.0006 | 0.0182 | 0.0182 | 0.0126 | 3 | 1 | |
| Lasso | 0.0317 | 0.0236 | 0.0395 | 0.0311 | 6.44 [3,9] | 1 | |
| Post-Lasso | 0.0272 | 0.0267 | 0.0380 | 0.0282 | |||
| Lasso | 0.0479 | 0.0187 | 0.0514 | 0.0489 | 3.81 [3,9] | 1 | |
| Post-Lasso | 0.0118 | 0.0238 | 0.0265 | 0.0176 |
NOTE: Results from 1000 MC replications; β = 0; ρ = 0.25; a = 0.2; .
Estimation results for estimators of β; L = 10, .
| av. # instr | freq. all | |||||
|---|---|---|---|---|---|---|
| selected as invalid | invalid instr | |||||
| β | bias | std dev | rmse | mad | [min, max] | selected |
| Post-Lasso | 0.2696 | 0.0583 | 0.2759 | 0.2718 | 5.06 [0,9] | 0.03 |
| Post-Lasso | 0.2658 | 0.0429 | 0.2692 | 0.2651 | 0.45 [0,8] | 0 |
| 0.1128 | 0.0936 | 0.1466 | 0.1129 | |||
| ALasso | 0.1735 | 0.0952 | 0.1979 | 0.1830 | 3.73 [0,9] | 0.48 |
| Post-ALasso | 0.1324 | 0.1321 | 0.1870 | 0.1591 | ||
| ALasso | 0.2586 | 0.0420 | 0.2620 | 0.2568 | 0.46 [0,6] | 0.04 |
| Post-ALasso | 0.2428 | 0.0787 | 0.2552 | 0.2568 | ||
| Post-Lasso | 0.3004 | 0.0308 | 0.3020 | 0.3023 | 8.89 [3,9] | 0.01 |
| Post-Lasso | 0.2910 | 0.0352 | 0.2931 | 0.2932 | 6.58 [0,9] | 0.00 |
| 0.0634 | 0.0500 | 0.0808 | 0.0649 | |||
| ALasso | 0.0600 | 0.0527 | 0.0798 | 0.0596 | 4.42 [3,9] | 0.998 |
| Post-ALasso | 0.0360 | 0.0626 | 0.0722 | 0.0442 | ||
| ALasso | 0.1656 | 0.0489 | 0.1726 | 0.1668 | 3.07 [0,6] | 0.89 |
| Post-ALasso | 0.0281 | 0.0774 | 0.0823 | 0.0348 | ||
| Post-Lasso | 0.3197 | 0.0120 | 0.3199 | 0.3202 | 8.97 [8,9] | 0 |
| Post-Lasso | 0.3202 | 0.0122 | 0.3204 | 0.3204 | 8.70 [7,9] | 0 |
| 0.0278 | 0.0226 | 0.0358 | 0.0284 | |||
| ALasso | 0.0153 | 0.0222 | 0.0270 | 0.0190 | 3.92 [3,9] | 1 |
| Post-ALasso | 0.0092 | 0.0253 | 0.0269 | 0.0177 | ||
| ALasso | 0.0661 | 0.0212 | 0.0694 | 0.0668 | 3.02 [3,6] | 1 |
| Post-ALasso | 0.0010 | 0.0186 | 0.0187 | 0.0129 |
NOTE: Results from 1000 MC replications; a = 0.2; β = 0; ρ = 0.25.
Results for post-(A)Lasso 2SLS estimators for β; L = 10, s = 3.
| av. # instr | freq. all | |||||||
|---|---|---|---|---|---|---|---|---|
| selected as invalid | invalid instr | |||||||
| bias | std dev | rmse | mad | [min, max] | selected | |||
| post-Lasso | ||||||||
| 500 | 0.0896 | 0.1252 | 0.1539 | 0.1007 | 2.56 [0,5] | 0.391 | ||
| 2000 | 0.0055 | 0.0430 | 0.0434 | 0.0286 | 3.02 [3,5] | 1 | ||
| 10,000 | 0.0009 | 0.0186 | 0.0186 | 0.0129 | 3.02 [3,5] | 1 | ||
| post-ALasso | ||||||||
| 500 | 0.2172 | 0.1091 | 0.2431 | 0.2471 | 0.86 [0,5] | 0.07 | ||
| 2000 | 0.0173 | 0.0677 | 0.0699 | 0.0303 | 3.05 [1,5] | 0.93 | ||
| 10,000 | 0.0008 | 0.0186 | 0.0186 | 0.0129 | 3.01 [3,5] | 1 | ||
NOTE: Results from 1000 MC replications; β = 0; a = 0.2; ; ρ = 0.25.
Figure 1.(a–c) Rejection frequencies of robust Wald tests for H0: β = 0 at 10% level as a function of sample size, in steps of 500. Equal strength instruments design, Post-Lasso in (a), Post-ALasso in (b). Unequal strength instruments design, Post-ALasso in (c). Based on 1000 MC replications for each sample size.
Estimation results, the effect of on
| estimate | rob st err | # instr | ||
|---|---|---|---|---|
| selected as invalid | ||||
| OLS | 0.206 | 0.003 | ||
| 2SLS | 0.087 | 0.016 | 0 | 0.0000 |
| Lasso | 0.126 | 56 | ||
| Post-Lasso | 0.145 | 0.033 | 1.0000 | |
| Lasso | 0.111 | 20 | ||
| Post-Lasso | 0.142 | 0.020 | 0.6435 | |
| Post-Lasso | 0.122 | 0.018 | 12 | 0.0123 |
| median, | 0.148 | |||
| ALasso | 0.158 | 54 | ||
| Post-ALasso | 0.161 | 0.029 | 1.0000 | |
| ALasso | 0.131 | 17 | ||
| Post-ALasso | 0.151 | 0.019 | 0.4091 | |
| Post-ALasso | 0.163 | 0.018 | 11 | 0.0102 |
NOTE: Sample size n = 105,276; L = 96.