| Literature DB >> 30565280 |
Yanchun Bao1, Paul S Clarke1, Melissa Smart1, Meena Kumari1.
Abstract
The "some invalid, some valid instrumental variable estimator" (sisVIVE) is a lasso-based method for instrumental variables (IVs) regression of outcome on an exposure. In principle, sisVIVE is robust to some of the IVs in the analysis being invalid, in the sense of being related to the outcome variable through pathways not mediated by the exposure. In this paper, we consider the application of sisVIVE to a Mendelian randomization study in which multiple genetic variants are used as IVs to estimate the causal effect of body mass index on personal income in the presence of unobserved confounding. In addition to analyzing data from the large-scale longitudinal household survey Understanding Society, we conduct a simulation study to (a) assess the performance of sisVIVE in relation to that of competing robust methods like "MR-Egger" and "MR-Median" and (b) identify scenarios under which its absolute performance is poor. We find that sisVIVE outperforms alternative robust methods, in terms of mean-square error, across a wide range of scenarios, but that its performance is poor in absolute terms when the presence of indirect pleiotropy leads to failure of the "InSIDE" condition, which is not explicitly required for identification. We argue that this is because the consistency criterion for sisVIVE does not identify the true causal effect when InSIDE fails.Entities:
Keywords: MR-Egger; MR-Median; Mendelian randomization; instrumental variables; pleiotropic bias; sisVIVE
Mesh:
Year: 2018 PMID: 30565280 PMCID: PMC6492086 DOI: 10.1002/sim.8066
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Scatterplot of SNP‐BMI association against SNP‐Income association of 71 SNPs under one‐sample and two‐sample strategies. Under one‐sample (left), both SNP‐BMI and SNP‐Income associations are from UKHLS, under two‐sample (right), SNP‐BMI associations are from GWAS study. BMI, body mass index; GWAS, genomewide association study; SNP, single‐nucleotide polymorphism; UKHLS, Understanding Society: the UK Longitudinal Household Study [Colour figure can be viewed at wileyonlinelibrary.com]
Estimates of the effect of BMI on personal income using n = 8047 cases from the UK Household Longitudinal Study
| Estimate | Std. Error | P‐value | |
|---|---|---|---|
|
| −0.032 | 0.011 | 0.002*** |
|
| |||
| SPRS | −0.112 | 0.082 | 0.173 |
| IPRS | −0.049 | 0.066 | 0.458 |
| 2SLS | −0.048 | 0.064 | 0.449 |
| LIML | −0.058 | 0.081 | 0.470 |
| Weighted MR‐Egger | −0.126 | 0.115 | 0.272 |
| Egger test | 0.003 | 0.004 | 0.411 |
| Weighted MR‐Median | −0.137 | 0.097 | 0.157 |
| sisVIVE | −0.048 | ‐ | ‐ |
| Number of invalid IVs detected by sisVIVE | 0 | ||
|
| |||
| EPRS | −0.155 | 0.076 | 0.041** |
| 2SLS | −0.154 | 0.071 | 0.030** |
| Weighted MR‐Egger | −0.311 | 0.179 | 0.082* |
| Egger test | 0.005 | 0.005 | 0.302 |
| Weighted MR‐Median | −0.292 | 0.106 | 0.006*** |
| sisVIVE | −0.141 | ‐ | ‐ |
| No. invalid IVs detected by sisVIVE | 0 |
Abbreviations: BMI, body mass index; EPRS, externally weighted polygenic risk score; IPRS, internally weighted polygenic risk score; IVs, instrumental variables; LIML, limited information maximum likelihood; MR, Mendelian randomization; sisVIVE, some invalid, some valid instrumental variable estimator; SPRS, simple polygenic risk score; 2SLS, two‐stage least squares.
Mean FSI% and FSO% of sisVIVE for 1000 simulation data sets
| One Sample | Two Sample | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| True: | Precise: | Imprecise: | |||||||
|
|
|
| |||||||
| Scenarios | No. IVs | MFSI | MFSO | MFSI | MFSO | MFSI | MFSO | MFSI | MFSO |
| Invalid | (%) | (%) | (%) | (%) | (%) | (%) | (%) | (%) | |
| All IVs are valid | 0 | ‐ | 6.9 | ‐ | 0 | ‐ | 0 | ‐ | 0.42 |
| InSIDE holds and | 10 | 30.1 | 2.9 | 29.9 | 1.7 | 30.3 | 1.8 | 30.0 | 2.1 |
| Direct Pleiotropy | 20 | 20.6 | 9.2 | 21.2 | 6.8 | 21.3 | 6.7 | 20.8 | 7.9 |
| (balanced) | 30 | 17.0 | 13.3 | 17.3 | 13.3 | 17.2 | 13.4 | 17.3 | 14.7 |
| InSIDE holds and | 10 | 36.1 | 1.9 | 30.5 | 2.4 | 31.0 | 2.2 | 31.6 | 2.2 |
| Direct Pleiotropy | 20 | 25.3 | 12.8 | 24.9 | 13.0 | 24.8 | 12.5 | 23.7 | 8.9 |
| (positive) | 30 | 21.2 | 31.9 | 23.3 | 32.3 | 23.0 | 30.4 | 21.0 | 18.6 |
| InSIDE fails and | 10 | 31.9 | 41.3 | 35.7 | 42.7 | 34.4 | 43.9 | 10.6 | 40.8 |
| Direct Pleiotropy | 20 | 39.2 | 42.1 | 32.5 | 54.2 | 32.6 | 54.2 | 23.1 | 66.3 |
| (positive) | 30 | 37.6 | 44.5 | 29.5 | 59.5 | 29.1 | 59.3 | 22.4 | 70.6 |
Abbreviations: IVs, instrumental variables; FSI, False Select In; FSO, False Select Out; MFSI, mean FSI; MFSO, mean FSO; sisVIVE, some invalid, some valid instrumental variable estimator.
Simulation results for multiple instruments when 10 SNPs are invalid, U∼N(0, 1), εXi ∼N(0, 1), εYi ∼N(0, 1), MC step = 1000, and sample size = 10,000
| (A) | |||||
|---|---|---|---|---|---|
| 10 Pleiotropic SNPs | |||||
| (Positive Direct Pleiotropy, InSIDE Holds) | |||||
| True value | −0.2 | ||||
| Mean (SD) | Mean SE | MSE | Coverage, % | Power, % | |
|
| |||||
| SPRS | 0.342 (0.145) | 0.092 | 0.315 | 9.0 | 86.4 |
| IPRS | 0.270 (0.124) | 0.150 | 0.236 | 7.0 | 42.5 |
| Weighted Egger | 0.199 (0.258) | 0.257 | 0.225 | 64.8 | 13.3 |
| Weighted Median | 0.076 (0.110) | 0.114 | 0.088 | 30.6 | 9.2 |
| sisVIVE‐SPRS | −0.106 (0.115) | 0.106 | 0.022 | 81.2 | 15.1 |
| sisVIVE‐IPRS | 0.042 (0.083) | 0.078 | 0.065 | 14.5 | 11.6 |
|
| |||||
| EPRS | 0.251 (0.136) | 0.084 | 0.222 | 2.0 | 75.4 |
| 2SLS | 0.255 (0.139) | 0.192 | 0.226 | 24.7 | 15.3 |
| Weighted Egger | −0.214 (0.483) | 0.473 | 0.233 | 93.7 | 6.4 |
| Weighted Median | −0.099 (0.154) | 0.152 | 0.034 | 90.3 | 10.6 |
| sisVIVE‐SPRS | −0.130 (0.118) | 0.108 | 0.019 | 85.3 | 20.0 |
| sisVIVE‐2SLS | −0.128 (0.109) | 0.096 | 0.017 | 84.2 | 29.5 |
|
| |||||
|
| |||||
|
| |||||
|
|
|
|
|
| |
|
| |||||
| SPRS | 0.574 (0.079) | 0.047 | 0.606 | 0 | 100 |
| IPRS | 0.938 (0.078) | 0.056 | 1.301 | 0 | 100 |
| Weighted Egger | 1.093 (0.097) | 0.061 | 1.682 | 0 | 100 |
| Weighted Median | 0.998 (0.115) | 0.065 | 1.449 | 0 | 100 |
| sisVIVE‐SPRS | 0.517 (0.145) | 0.115 | 0.535 | 2.3 | 91.5 |
| sisVIVE‐IPRS | 0.761 (0.140) | 0.058 | 0.944 | 0.1 | 99.4 |
|
| |||||
| EPRS | 0.968 (0.080) | 0.031 | 1.371 | 0 | 100 |
| 2SLS | 0.970 (0.079) | 0.057 | 1.376 | 0 | 100 |
| Weighted Egger | 1.149 (0.104) | 0.057 | 1.830 | 0 | 100 |
| Weighted Median | 1.016 (0.116) | 0.066 | 1.492 | 0 | 100 |
| sisVIVE‐SPRS | 0.550 (0.149) | 0.077 | 0.585 | 0.1 | 98.8 |
| sisVIVE‐2SLS | 0.814 (0.133) | 0.051 | 1.062 | 0 | 99.7 |
Abbreviations: EPRS, externally weighted polygenic risk score; IPRS, internally weighted polygenic risk score; SD, standard deviation; SE, standard error; sisVIVE, some invalid, some valid instrumental variable estimator; SPRS, simple polygenic risk score; 2SLS, two‐stage least squares.