| Literature DB >> 26661904 |
Stephen Burgess1, Frank Dudbridge2, Simon G Thompson1.
Abstract
Mendelian randomization is the use of genetic instrumental variables to obtain causal inferences from observational data. Two recent developments for combining information on multiple uncorrelated instrumental variables (IVs) into a single causal estimate are as follows: (i) allele scores, in which individual-level data on the IVs are aggregated into a univariate score, which is used as a single IV, and (ii) a summary statistic method, in which causal estimates calculated from each IV using summarized data are combined in an inverse-variance weighted meta-analysis. To avoid bias from weak instruments, unweighted and externally weighted allele scores have been recommended. Here, we propose equivalent approaches using summarized data and also provide extensions of the methods for use with correlated IVs. We investigate the impact of different choices of weights on the bias and precision of estimates in simulation studies. We show that allele score estimates can be reproduced using summarized data on genetic associations with the risk factor and the outcome. Estimates from the summary statistic method using external weights are biased towards the null when the weights are imprecisely estimated; in contrast, allele score estimates are unbiased. With equal or external weights, both methods provide appropriate tests of the null hypothesis of no causal effect even with large numbers of potentially weak instruments. We illustrate these methods using summarized data on the causal effect of low-density lipoprotein cholesterol on coronary heart disease risk. It is shown that a more precise causal estimate can be obtained using multiple genetic variants from a single gene region, even if the variants are correlated. © 2015 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.Entities:
Keywords: Mendelian randomization; aggregated data; allele score; causal inference; genetic risk score; genetic variants; instrumental variables; summarized data; weak instruments
Mesh:
Substances:
Year: 2015 PMID: 26661904 PMCID: PMC4832315 DOI: 10.1002/sim.6835
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Summary of instrumental variable (IV) estimation methods discussed in this paper.
| Method | Equation(s) | Comments |
|---|---|---|
|
| ||
| Two‐stage least squares | Commonly used method in IV analysis (Section | |
| Allele score | Combine IVs into a single score, and use the score as a | |
| single IV in a two‐stage least squares (or equivalently, | ||
| ratio) method (Section | ||
|
| ||
| Allele score |
| The allele score estimate obtained using individual‐level |
| data can be approximated using summarized data | ||
| (Section | ||
| Summary statistic (inverse‐variance weighted) |
| The summary statistic estimate combines the estimates |
| from each IV in an inverse‐variance weighted formula | ||
| (Section | ||
| weighted linear regression through the origin using the | ||
| precisions of the IV associations with the outcome as | ||
| weights. | ||
| Likelihood‐based method |
| The likelihood‐based method fits a model for the |
| summarized data using either maximum likelihood or | ||
| Bayesian methods for inference (Section | ||
|
| ||
| Allele score |
| The allele score estimate with summarized data is not |
| affected by correlation between the IVs; although the | ||
| estimate's precision is altered (Section | ||
| Summary statistic (inverse‐variance weighed) |
| With correlated variants, the summary statistic |
| formula can be used to test for a causal effect (although | ||
| the standard error of the expression must be modified, | ||
| Section | ||
| causal effect. | ||
| Weighted generalized linear regression |
| With correlated variants, a weighting matrix can be |
| obtained using the standard errors of the IV associations | ||
| with the outcome and the correlations between the | ||
| variants. The coefficient from weighted generalized | ||
| linear regression using this weighting matrix provides | ||
| an estimate of the causal effect (Section | ||
| Likelihood‐based method |
| Correlation between summarized estimates can be |
| incorporated into the likelihood model for the | ||
| summarized data. (Appendix A.3). | ||
Comparison of allele score methods for uncorrelated IVs.
|
|
|
| OLS | Crudely weighted | Equally weighted | Externally weighted | |
|---|---|---|---|---|---|---|---|
| Using individual‐level data | |||||||
|
| 0.05 | 0.010 | 3.3 | 0.697 | 0.346 (0.136) | 0.198 (0.178) | 0.199 (0.205) |
| 0.10 | 0.030 | 10.2 | 0.687 | 0.246 (0.080) | 0.199 (0.089) | 0.198 (0.090) | |
| 0.20 | 0.102 | 37.9 | 0.650 | 0.212 (0.042) | 0.199 (0.044) | 0.199 (0.043) | |
|
| 0.05 | 0.010 | 3.3 | −0.297 | 0.052 (0.135) | 0.201 (0.178) | 0.200 (0.205) |
| 0.10 | 0.030 | 10.2 | −0.287 | 0.151 (0.080) | 0.198 (0.089) | 0.198 (0.090) | |
| 0.20 | 0.102 | 37.9 | −0.250 | 0.186 (0.042) | 0.199 (0.044) | 0.199 (0.043) | |
| Using summarized data | |||||||
|
| 0.05 | 0.010 | 3.3 | 0.697 | 0.346 (0.171) | 0.198 (0.204) | 0.199 (0.235) |
| 0.10 | 0.030 | 10.2 | 0.687 | 0.246 (0.093) | 0.199 (0.101) | 0.198 (0.102) | |
| 0.20 | 0.102 | 37.9 | 0.650 | 0.212 (0.048) | 0.199 (0.050) | 0.199 (0.049) | |
|
| 0.05 | 0.010 | 3.3 | −0.297 | 0.052 (0.133) | 0.201 (0.168) | 0.200 (0.194) |
| 0.10 | 0.030 | 10.2 | −0.287 | 0.151 (0.076) | 0.198 (0.083) | 0.198 (0.084) | |
| 0.20 | 0.102 | 37.9 | −0.250 | 0.186 (0.039) | 0.199 (0.042) | 0.199 (0.041) | |
Median estimates over 10 000 simulations of β =0.2 (median standard errors) from simulation study with 15 uncorrelated instrumental variables (IVs) varying direction of confounding (β ) as shown by median observational estimate (OLS) and average strength of IV (α; strength is also expressed by the mean values of the R 2 and F statistics), using allele score methods with crude weights (derived from the data under analysis), equal weights (unweighted analysis) and external weights (equivalent to estimates derived from an independent sample of equal size to the data under analysis), calculated from individual‐level and summarized data.
Comparison of allele score methods for uncorrelated instrumental variables (IVs).
|
| Crudely weighted | Equally weighted | Externally weighted | |
|---|---|---|---|---|
| Using individual‐level data | ||||
|
| 0.05 | 0.342 (0.140) | 0.179 (0.193) | 0.172 (0.235) |
| 0.10 | 0.244 (0.081) | 0.195 (0.091) | 0.194 (0.093) | |
| 0.20 | 0.211 (0.042) | 0.199 (0.045) | 0.198 (0.044) | |
|
| 0.05 | 0.056 (0.139) | 0.219 (0.193) | 0.230 (0.242) |
| 0.10 | 0.154 (0.081) | 0.203 (0.091) | 0.203 (0.092) | |
| 0.20 | 0.187 (0.042) | 0.200 (0.045) | 0.200 (0.044) | |
| Using summarized data | ||||
|
| 0.05 | 0.342 (0.174) | 0.179 (0.214) | 0.172 (0.258) |
| 0.10 | 0.244 (0.095) | 0.195 (0.103) | 0.194 (0.105) | |
| 0.20 | 0.211 (0.049) | 0.199 (0.051) | 0.198 (0.050) | |
|
| 0.05 | 0.056 (0.136) | 0.219 (0.179) | 0.230 (0.223) |
| 0.10 | 0.154 (0.077) | 0.203 (0.085) | 0.203 (0.086) | |
| 0.20 | 0.187 (0.040) | 0.200 (0.042) | 0.200 (0.041) | |
Mean estimates (mean standard errors) over 10000 simulations of β =0.2 from simulation study with 15 uncorrelated IVs varying direction of confounding (β ) and average strength of IV (α), using allele score methods with crude weights (derived from the data under analysis), equal weights (unweighted analysis) and external weights (equivalent to estimates derived from an independent sample of equal size to the data under analysis), calculated from individual‐level and summarized data.
Further comparison of allele score methods for uncorrelated instrumental variables (IVs).
| Crudely weighted | Equally weighted | Externally weighted | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| Individual levela | Summarized data | Individual level | Summarized data | Individual level | Summarized data | ||
|
|
| 0.05 | 0.003 | 3.3 | 0.340 (0.245) | 0.340 (0.305) | 0.200 (0.314) | 0.200 (0.362) | 0.208 (0.357) | 0.208 (0.415) |
| 0.10 | 0.010 | 10.1 | 0.244 (0.141) | 0.244 (0.165) | 0.201 (0.154) | 0.201 (0.177) | 0.198 (0.156) | 0.198 (0.179) | ||
| 0.20 | 0.036 | 37.6 | 0.211 (0.073) | 0.211 (0.084) | 0.198 (0.077) | 0.199 (0.088) | 0.199 (0.075) | 0.199 (0.086) | ||
|
| 0.05 | 0.003 | 3.3 | 0.065 (0.244) | 0.066 (0.241) | 0.205 (0.315) | 0.205 (0.298) | 0.201 (0.359) | 0.201 (0.341) | |
| 0.10 | 0.010 | 10.1 | 0.159 (0.141) | 0.159 (0.134) | 0.200 (0.155) | 0.201 (0.145) | 0.199 (0.157) | 0.199 (0.147) | ||
| 0.20 | 0.036 | 37.6 | 0.188 (0.073) | 0.188 (0.069) | 0.200 (0.077) | 0.200 (0.072) | 0.199 (0.076) | 0.199 (0.071) | ||
|
|
| 0.05 | 0.010 | 3.3 | 0.346 (0.136) | 0.346 (0.171) | 0.198 (0.180) | 0.198 (0.204) | 0.199 (0.205) | 0.199 (0.235) |
| 0.10 | 0.030 | 10.2 | 0.246 (0.080) | 0.246 (0.093) | 0.199 (0.089) | 0.199 (0.101) | 0.198 (0.090) | 0.198 (0.102) | ||
| 0.20 | 0.102 | 37.9 | 0.212 (0.042) | 0.212 (0.048) | 0.199 (0.044) | 0.199 (0.050) | 0.199 (0.043) | 0.199 (0.049) | ||
|
| 0.05 | 0.010 | 3.3 | 0.052 (0.135) | 0.052 (0.133) | 0.201 (0.178) | 0.201 (0.168) | 0.200 (0.205) | 0.200 (0.194) | |
| 0.10 | 0.030 | 10.2 | 0.151 (0.080) | 0.151 (0.076) | 0.198 (0.089) | 0.198 (0.083) | 0.198 (0.090) | 0.198 (0.084) | ||
| 0.20 | 0.102 | 37.9 | 0.186 (0.042) | 0.186 (0.039) | 0.199 (0.044) | 0.199 (0.042) | 0.199 (0.043) | 0.199 (0.041) | ||
|
|
| 0.05 | 0.016 | 3.3 | 0.348 (0.104) | 0.348 (0.131) | 0.199 (0.138) | 0.199 (0.157) | 0.198 (0.159) | 0.198 (0.181) |
| 0.10 | 0.048 | 10.2 | 0.247 (0.061) | 0.247 (0.072) | 0.199 (0.069) | 0.199 (0.078) | 0.199 (0.069) | 0.199 (0.079) | ||
| 0.20 | 0.158 | 37.6 | 0.213 (0.033) | 0.213 (0.037) | 0.200 (0.034) | 0.200 (0.039) | 0.200 (0.033) | 0.200 (0.038) | ||
|
| 0.05 | 0.016 | 3.3 | 0.051 (0.104) | 0.051 (0.102) | 0.201 (0.137) | 0.201 (0.129) | 0.200 (0.158) | 0.200 (0.148) | |
| 0.10 | 0.048 | 10.2 | 0.152 (0.062) | 0.153 (0.058) | 0.200 (0.069) | 0.200 (0.064) | 0.200 (0.070) | 0.200 (0.065) | ||
| 0.20 | 0.158 | 37.6 | 0.187 (0.032) | 0.188 (0.030) | 0.200 (0.034) | 0.201 (0.032) | 0.200 (0.034) | 0.200 (0.031) | ||
Median estimates over 10000 simulations of β =0.2 (median standard errors) from simulation study with uncorrelated IVs varying the number of IVs (K = 5,15,25), direction of confounding (β ) and average strength of IV (α; strength is also expressed by the mean values of the R 2 and F statistics) using allele score methods with crude weights (derived from the data under analysis), equal weights (unweighted analysis) and external weights (equivalent to estimates derived from an independent sample of equal size to the data under analysis) calculated from individual‐level and summarized data.
a Median estimates and standard errors calculated from individual‐level data using a crudely weighted allele score were equal to those from a (multivariable) two‐stage least squares method to at least three decimal places in almost all simulated datasets.
Comparison of summarized data methods for uncorrelated IVs.
| Imprecise external weights (from 5000 individuals) | Precise external weights (from 50000 individuals) | Oracle weights | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Median | SD | SE | Coverage | Power | Median | SD | SE | Coverage | Power | Median | SD | SE | Coverage | Power | |
| Allele score method using summarized data | ||||||||||||||||
|
| 0.05 | 0.200 | 0.239 | 0.219 | 97.3 | 14.8 | 0.198 | 0.192 | 0.201 | 97.0 | 17.6 | 0.198 | 0.186 | 0.197 | 97.1 | 18.0 |
| 0.10 | 0.200 | 0.093 | 0.102 | 97.1 | 52.1 | 0.201 | 0.089 | 0.098 | 97.0 | 55.6 | 0.201 | 0.088 | 0.098 | 97.0 | 56.0 | |
| 0.20 | 0.199 | 0.044 | 0.049 | 97.2 | 97.8 | 0.199 | 0.043 | 0.049 | 97.2 | 98.0 | 0.199 | 0.043 | 0.049 | 97.1 | 98.1 | |
|
| 0.05 | 0.201 | 0.232 | 0.186 | 93.1 | 20.6 | 0.201 | 0.196 | 0.165 | 92.5 | 24.9 | 0.201 | 0.191 | 0.161 | 92.5 | 25.7 |
| 0.10 | 0.200 | 0.092 | 0.084 | 92.7 | 68.1 | 0.200 | 0.088 | 0.081 | 93.3 | 71.2 | 0.199 | 0.087 | 0.080 | 93.4 | 71.7 | |
| 0.20 | 0.200 | 0.044 | 0.041 | 93.0 | 99.7 | 0.200 | 0.043 | 0.040 | 92.8 | 99.8 | 0.200 | 0.043 | 0.040 | 92.8 | 99.8 | |
| Summary statistic methoda | ||||||||||||||||
|
| 0.05 | 0.148 | 0.168 | 0.162 | 93.1 | 14.8 | 0.191 | 0.191 | 0.187 | 94.8 | 17.6 | 0.199 | 0.193 | 0.190 | 95.0 | 18.0 |
| 0.10 | 0.183 | 0.094 | 0.091 | 94.2 | 52.1 | 0.199 | 0.096 | 0.095 | 94.8 | 55.6 | 0.201 | 0.096 | 0.095 | 95.0 | 56.0 | |
| 0.20 | 0.194 | 0.048 | 0.047 | 94.9 | 97.8 | 0.198 | 0.048 | 0.047 | 95.0 | 98.0 | 0.199 | 0.048 | 0.047 | 94.9 | 98.1 | |
|
| 0.05 | 0.147 | 0.139 | 0.133 | 93.0 | 20.6 | 0.194 | 0.158 | 0.154 | 94.7 | 24.9 | 0.201 | 0.161 | 0.157 | 94.8 | 25.7 |
| 0.10 | 0.181 | 0.077 | 0.075 | 94.0 | 68.1 | 0.198 | 0.078 | 0.078 | 95.3 | 71.2 | 0.200 | 0.079 | 0.078 | 95.3 | 71.7 | |
| 0.20 | 0.196 | 0.040 | 0.039 | 94.3 | 99.7 | 0.200 | 0.040 | 0.039 | 94.9 | 99.8 | 0.200 | 0.040 | 0.039 | 95.0 | 99.8 | |
| Likelihood‐based methoda | ||||||||||||||||
|
| 0.05 | 0.203 | 0.242 | 0.195 | 94.0 | 18.3 | 0.194 | 0.182 | 0.191 | 97.3 | 17.0 | |||||
| 0.10 | 0.201 | 0.104 | 0.097 | 94.4 | 54.3 | 0.200 | 0.096 | 0.095 | 95.0 | 55.6 | ||||||
| 0.20 | 0.197 | 0.050 | 0.048 | 94.5 | 97.7 | 0.198 | 0.052 | 0.048 | 93.4 | 97.8 | ||||||
|
| 0.05 | 0.204 | 0.201 | 0.161 | 94.3 | 25.4 | 0.198 | 0.153 | 0.157 | 97.1 | 24.4 | |||||
| 0.10 | 0.198 | 0.086 | 0.080 | 94.5 | 70.1 | 0.199 | 0.079 | 0.079 | 95.2 | 71.3 | ||||||
| 0.20 | 0.199 | 0.042 | 0.040 | 94.3 | 99.7 | 0.200 | 0.046 | 0.039 | 92.0 | 99.7 | ||||||
Median estimates over 10000 simulations of β =0.2, standard deviation (SD) of estimates, median standard error (SE) of estimates, coverage (%) of nominal 95% confidence interval for the causal parameter and empirical power (%) based on nominal 95% confidence interval to detect a causal effect from simulation study with 15 uncorrelated instrumental variables (IVs) varying the direction of confounding (β ) and average strength of IV (α) using three summarized data methods: allele score, summary statistic and likelihood‐based methods, with weights taken from an external source corresponding to an independent sample of size 5000, 50000 and using the true (oracle) weights.
aThe ‘weights’ for the summary statistic and likelihood‐based methods are used as the association estimates in equations (4) and (6). In the allele score method, the weights (w ) and the association estimates ( ) in the data under analysis are both used to provide the causal estimate.
Comparison of summarized data methods for uncorrelated instrumental variables (IVs).
|
| Imprecise weights | Precise weights | Oracle weights | |
|---|---|---|---|---|
| Allele score method using summarized data | ||||
|
| 0.05 | 0.175 (0.236) | 0.182 (0.204) | 0.183 (0.199) |
| 0.10 | 0.197 (0.102) | 0.197 (0.097) | 0.197 (0.097) | |
| 0.20 | 0.198 (0.049) | 0.198 (0.048) | 0.198 (0.048) | |
|
| 0.05 | 0.226 (0.194) | 0.219 (0.168) | 0.218 (0.164) |
| 0.10 | 0.205 (0.084) | 0.204 (0.080) | 0.204 (0.080) | |
| 0.20 | 0.201 (0.040) | 0.201 (0.040) | 0.201 (0.040) | |
| Summary statistic method | ||||
|
| 0.05 | 0.149 (0.166) | 0.193 (0.189) | 0.200 (0.192) |
| 0.10 | 0.184 (0.092) | 0.199 (0.096) | 0.201 (0.096) | |
| 0.20 | 0.194 (0.047) | 0.199 (0.048) | 0.199 (0.048) | |
|
| 0.05 | 0.150 (0.136) | 0.193 (0.155) | 0.201 (0.158) |
| 0.10 | 0.183 (0.076) | 0.199 (0.079) | 0.198 (0.079) | |
| 0.20 | 0.195 (0.039) | 0.200 (0.040) | 0.198 (0.040) | |
| Likelihood‐based method | ||||
|
| 0.05 | 0.214 (0.212) | 0.200 (0.193) | |
| 0.10 | 0.202 (0.099) | 0.200 (0.096) | ||
| 0.20 | 0.198 (0.049) | 0.200 (0.048) | ||
|
| 0.05 | 0.213 (0.176) | 0.199 (0.159) | |
| 0.10 | 0.201 (0.082) | 0.200 (0.079) | ||
| 0.20 | 0.199 (0.041) | 0.202 (0.040) | ||
Mean estimates (mean standard errors) over 10000 simulations of β =0.2 from simulation study with 15 uncorrelated IVs varying the direction of confounding (β ) and average strength of IV (α) using three summarized data methods: allele score, summary statistic and likelihood‐based methods, with weights taken from an external source corresponding to an independent sample of size 5000, 50000 and using the true (oracle) weights.
Investigation into bias of summary statistic estimator.
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
| Median | Coverage | Power | Median | Coverage | Power | |
| Crude weights | |||||||
|
| 0.05 | 0.348 | 88.2 | 58.4 | 0.146 | 82.8 | 17.2 |
| 0.10 | 0.248 | 94.5 | 78.2 | 0.047 | 91.7 | 8.3 | |
| 0.20 | 0.213 | 96.6 | 99.3 | 0.012 | 94.0 | 6.0 | |
|
| 0.05 | 0.056 | 77.9 | 7.3 | −0.149 | 83.0 | 17.0 |
| 0.10 | 0.152 | 88.3 | 53.0 | −0.047 | 91.9 | 8.1 | |
| 0.20 | 0.188 | 91.8 | 99.5 | −0.013 | 94.2 | 5.8 | |
| External weights | |||||||
|
| 0.05 | 0.144 | 93.7 | 15.5 | −0.001 | 95.2 | 4.8 |
| 0.10 | 0.183 | 94.2 | 52.4 | 0.000 | 95.1 | 4.9 | |
| 0.20 | 0.196 | 94.4 | 97.9 | 0.000 | 94.9 | 5.1 | |
|
| 0.05 | 0.150 | 92.8 | 19.8 | 0.002 | 94.8 | 5.2 |
| 0.10 | 0.182 | 93.7 | 67.7 | 0.000 | 95.2 | 4.8 | |
| 0.20 | 0.196 | 94.1 | 99.7 | 0.000 | 95.2 | 4.8 | |
Median estimates over 10000 simulations from summary statistic method with causal effect β =0.2 and β =0, coverage (%) of nominal 95% confidence interval for the causal parameter and empirical power (%) based on nominal 95% confidence interval to detect a causal effect from simulation study with 15 uncorrelated instrumental variables (IVs) varying the direction of confounding (β ) and average strength of IV (α) with crude weights and with external weights corresponding to an independent sample of size 5000.
Comparison of allele score methods for correlated instrumental variables (IVs).
|
|
|
| Allele score using individual‐level data | Allele score using summarized data | |
|---|---|---|---|---|---|
| Positive causal effect: | |||||
|
| 0.05 | 0.019 | 6.3 | 0.201 (0.116) [43.3] | 0.201 (0.129) [34.9] |
| 0.10 | 0.062 | 22.2 | 0.201 (0.058) [87.2] | 0.201 (0.065) [84.2] | |
| 0.20 | 0.201 | 85.8 | 0.200 (0.029) [99.9] | 0.200 (0.033) [99.8] | |
|
| 0.05 | 0.019 | 6.3 | 0.202 (0.117) [39.4] | 0.202 (0.107) [48.0] |
| 0.10 | 0.062 | 22.2 | 0.199 (0.058) [91.8] | 0.199 (0.053) [93.2] | |
| 0.20 | 0.201 | 85.8 | 0.200 (0.029) [100.0] | 0.200 (0.027) [100.0] | |
| Null causal effect: | |||||
|
| 0.05 | 0.019 | 6.3 | 0.001 (0.116) [4.4] | 0.001 (0.116) [4.8] |
| 0.10 | 0.062 | 22.2 | 0.000 (0.058) [4.7] | 0.000 (0.058) [4.8] | |
| 0.20 | 0.201 | 85.8 | 0.000 (0.029) [5.1] | 0.000 (0.029) [5.0] | |
|
| 0.05 | 0.019 | 6.3 | −0.002 (0.116) [4.3] | −0.002 (0.117) [4.8] |
| 0.10 | 0.062 | 22.2 | 0.000 (0.058) [5.0] | −0.001 (0.058) [5.0] | |
| 0.20 | 0.201 | 85.8 | 0.000 (0.029) [5.1] | 0.000 (0.029) [5.2] | |
Median estimates over 10000 simulations of β =0.2 or β =0 (median standard errors) [power (%) based on nominal 95% confidence interval] from simulation study with 15 correlated IVs varying direction of confounding (β ) and average strength of IV (α; strength is also expressed by the mean values of the R 2 and F statistics) using allele score methods calculated from individual‐level and summarized data, with equal weights.
Comparison of allele score methods for correlated instrumental variables (IVs).
|
| Allele score using individual‐level data | Allele score using summarized data | |
|---|---|---|---|
| Positive causal effect: | |||
|
| 0.05 | 0.193 (0.123) | 0.193 (0.135) |
| 0.10 | 0.199 (0.060) | 0.199 (0.067) | |
| 0.20 | 0.200 (0.030) | 0.200 (0.034) | |
|
| 0.05 | 0.211 (0.124) | 0.211 (0.112) |
| 0.10 | 0.201 (0.060) | 0.201 (0.055) | |
| 0.20 | 0.200 (0.030) | 0.201 (0.028) | |
| Null causal effect: | |||
|
| 0.05 | −0.007 (0.123) | −0.007 (0.121) |
| 0.10 | −0.002 (0.060) | −0.002 (0.060) | |
| 0.20 | −0.001 (0.030) | −0.001 (0.030) | |
|
| 0.05 | 0.007 (0.123) | 0.007 (0.121) |
| 0.10 | 0.003 (0.060) | 0.003 (0.060) | |
| 0.20 | 0.001 (0.030) | 0.001 (0.030) | |
Mean estimates (mean standard errors) over 10000 simulations of β =0.2 or β =0 from simulation study with 15 correlated IVs varying direction of confounding (β ) and average strength of IV (α) using allele score methods calculated from individual‐level and summarized data, with equal weights.
Comparison of summarized data methods for correlated instrumental variables (IVs).
|
| Allele score using individual‐level data | Allele score using summarized data | Weighted generalized linear regression | Likelihood‐based method | |
|---|---|---|---|---|---|
| Positive causal effect: | |||||
|
| 0.05 | 0.201 (0.120) [41.5] | 0.201 (0.134) [33.2] | 0.147 (0.109) [27.6] | 0.197 (0.131) [33.2] |
| 0.10 | 0.201 (0.059) [85.6] | 0.201 (0.066) [82.6] | 0.184 (0.061) [81.9] | 0.198 (0.066) [83.0] | |
| 0.20 | 0.200 (0.030) [99.8] | 0.200 (0.033) [99.8] | 0.195 (0.032) [99.8] | 0.190 (0.032) [99.8] | |
|
| 0.05 | 0.202 (0.121) [36.7] | 0.202 (0.111) [45.4] | 0.147 (0.090) [38.6] | 0.201 (0.109) [45.1] |
| 0.10 | 0.199 (0.060) [90.2] | 0.199 (0.055) [91.6] | 0.182 (0.051) [91.3] | 0.194 (0.054) [92.0] | |
| 0.20 | 0.200 (0.030) [100.0] | 0.200 (0.027) [100.0] | 0.196 (0.026) [100.0] | 0.183 (0.026) [100.0] | |
| Null causal effect: | |||||
|
| 0.05 | 0.002 (0.120) [4.5] | 0.002 (0.120) [5.0] | 0.002 (0.098) [4.9] | 0.003 (0.114) [6.7] |
| 0.10 | 0.000 (0.060) [4.6] | 0.000 (0.060) [4.8] | 0.000 (0.055) [4.9] | 0.000 (0.058) [5.1] | |
| 0.20 | −0.001 (0.030) [5.0] | −0.001 (0.030) [5.0] | 0.000 (0.029) [4.9] | 0.000 (0.029) [4.8] | |
|
| 0.05 | 0.000 (0.120) [4.5] | 0.000 (0.120) [5.0] | −0.001 (0.098) [4.6] | −0.001 (0.114) [6.4] |
| 0.10 | 0.000 (0.060) [4.6] | 0.000 (0.060) [4.8] | 0.000 (0.055) [4.8] | 0.000 (0.058) [4.9] | |
| 0.20 | 0.000 (0.030) [5.1] | 0.000 (0.030) [5.3] | 0.000 (0.028) [5.2] | 0.000 (0.029) [5.0] | |
Median estimates over 10000 simulations of β =0.2 or β =0 (median standard errors) [power (%) based on nominal 95% confidence interval] from simulation study with 15 correlated IVs varying direction of confounding (β ) and average strength of IV (α) using allele score method calculated from individual‐level data and allele score, weighted generalized linear regression and likelihood‐based methods all calculated from summarized data, with external (N = 5000) weights.
Comparison of summarized data methods for correlated instrumental variables (IVs).
|
| Allele score using individual‐level data | Allele score using summarized data | Weighted generalized linear regression | Likelihood‐based method | |
|---|---|---|---|---|---|
| Positive causal effect: | |||||
|
| 0.05 | 0.192 (0.130) | 0.192 (0.143) | 0.146 (0.112) | 0.202 (0.144) |
| 0.10 | 0.199 (0.061) | 0.199 (0.068) | 0.184 (0.063) | 0.197 (0.068) | |
| 0.20 | 0.200 (0.031) | 0.200 (0.034) | 0.196 (0.033) | 0.192 (0.033) | |
|
| 0.05 | 0.212 (0.131) | 0.212 (0.118) | 0.148 (0.093) | 0.206 (0.122) |
| 0.10 | 0.201 (0.062) | 0.201 (0.056) | 0.182 (0.052) | 0.196 (0.056) | |
| 0.20 | 0.201 (0.031) | 0.201 (0.028) | 0.196 (0.027) | 0.185 (0.026) | |
| Null causal effect: | |||||
|
| 0.05 | −0.007 (0.131) | −0.007 (0.128) | 0.002 (0.101) | 0.009 (0.125) |
| 0.10 | −0.002 (0.062) | −0.002 (0.062) | 0.000 (0.057) | 0.002 (0.060) | |
| 0.20 | −0.001 (0.031) | −0.001 (0.030) | −0.001 (0.029) | 0.000 (0.030) | |
|
| 0.05 | 0.008 (0.131) | 0.008 (0.128) | −0.001 (0.101) | 0.005 (0.126) |
| 0.10 | 0.003 (0.062) | 0.003 (0.062) | 0.001 (0.057) | 0.003 (0.060) | |
| 0.20 | 0.001 (0.030) | 0.001 (0.030) | 0.000 (0.029) | 0.001 (0.029) | |
Median estimates (mean standard errors) over 10000 simulations of β =0.2 or β =0 from simulation study with 15 correlated IVs varying direction of confounding (β ) and average strength of IV (α), using allele score method calculated from individual‐level data, and allele score, weighted generalized linear regression and likelihood‐based methods all calculated from summarized data, with external (N = 5000) weights.
Further comparison of summarized data methods with correlated variants.
|
| Allele score using individual‐level data | Allele score using summarized data | Weighted generalized linear regression | Likelihood‐based method | |
|---|---|---|---|---|---|
| Positive causal effect: | |||||
|
| 0.05 | 0.225 (0.112) [52.8] | 0.225 (0.126) [43.7] | 0.280 (0.117) [66.9] | 0.308 (0.128) [69.3] |
| 0.10 | 0.208 (0.058) [89.2] | 0.208 (0.065) [86.5] | 0.224 (0.063) [93.0] | 0.228 (0.064) [92.9] | |
| 0.20 | 0.201 (0.030) [99.9] | 0.201 (0.033) [99.9] | 0.206 (0.032) [100.0] | 0.200 (0.032) [99.9] | |
|
| 0.05 | 0.175 (0.113) [33.8] | 0.175 (0.104) [41.0] | 0.122 (0.097) [26.9] | 0.146 (0.108) [30.5] |
| 0.10 | 0.192 (0.058) [89.3] | 0.192 (0.054) [90.7] | 0.177 (0.052) [88.4] | 0.184 (0.054) [92.0] | |
| 0.20 | 0.198 (0.030) [100.0] | 0.199 (0.027) [100.0] | 0.195 (0.026) [100.0] | 0.184 (0.026) [100.0] | |
| Null causal effect: | |||||
|
| 0.05 | 0.024 (0.112) [6.1] | 0.024 (0.114) [5.3] | 0.079 (0.106) [11.6] | 0.090 (0.115) [13.4] |
| 0.10 | 0.006 (0.059) [5.4] | 0.006 (0.059) [4.9] | 0.023 (0.056) [5.3] | 0.023 (0.058) [6.5] | |
| 0.20 | 0.002 (0.030) [5.1] | 0.002 (0.030) [4.9] | 0.006 (0.029) [5.3] | 0.005 (0.029) [5.1] | |
|
| 0.05 | −0.027 (0.112) [6.6] | −0.027 (0.113) [5.9] | −0.080 (0.106) [12.3] | −0.077 (0.113) [11.5] |
| 0.10 | −0.007 (0.058) [5.4] | −0.007 (0.059) [5.1] | −0.023 (0.056) [7.0] | −0.019 (0.057) [5.9] | |
| 0.20 | −0.001 (0.030) [5.3] | 0.001 (0.030) [5.1] | −0.006 (0.029) [5.6] | −0.005 (0.029) [4.8] | |
Median estimates over 10000 simulations of β =0.2 or β =0 (median standard errors) [power (%) based on nominal 95% confidence interval] from simulation study with 15 correlated instrumental variables (IVs) varying direction of confounding (β ) and average strength of IV (α) using allele score method calculated from individual‐level data and allele score, weighted generalized linear regression and likelihood‐based methods all calculated from summarized data, with crude weights.
Comparison of summarized data methods for correlated instrumental variables (IVs) with binary outcome.
|
| Allele score using individual‐level data | Allele score using summarized data | Weighted generalized linear regression | Likelihood‐based method | |
|---|---|---|---|---|---|
| Positive causal effect: | |||||
|
| 0.05 | 0.150 (0.210) [10.9] | 0.139 (0.191) [11.1] | 0.119 (0.172) [10.8] | 0.160 (0.201) [14.0] |
| 0.10 | 0.148 (0.102) [31.6] | 0.147 (0.100) [32.2] | 0.143 (0.095) [33.4] | 0.153 (0.101) [34.8] | |
| 0.20 | 0.146 (0.049) [80.6] | 0.147 (0.049) [81.0] | 0.147 (0.048) [83.3] | 0.149 (0.049) [83.0] | |
|
| 0.05 | 0.159 (0.218) [11.7] | 0.149 (0.199) [12.0] | 0.131 (0.178) [11.5] | 0.174 (0.209) [14.8] |
| 0.10 | 0.160 (0.106) [31.6] | 0.159 (0.104) [35.4] | 0.155 (0.099) [36.3] | 0.165 (0.104) [37.5] | |
| 0.20 | 0.158 (0.051) [80.6] | 0.159 (0.051) [84.1] | 0.159 (0.049) [86.3] | 0.162 (0.050) [86.2] | |
| Null causal effect: | |||||
|
| 0.05 | 0.000 (0.219) [5.2] | 0.004 (0.199) [5.2] | 0.015 (0.179) [5.3] | 0.018 (0.208) [6.6] |
| 0.10 | 0.000 (0.108) [5.0] | 0.002 (0.106) [5.0] | 0.010 (0.101) [5.4] | 0.010 (0.105) [5.5] | |
| 0.20 | 0.000 (0.054) [4.9] | 0.002 (0.054) [4.9] | 0.005 (0.052) [5.2] | 0.005 (0.053) [4.8] | |
|
| 0.05 | −0.003 (0.220) [4.6] | 0.002 (0.199) [4.6] | 0.013 (0.179) [4.8] | 0.017 (0.208) [6.0] |
| 0.10 | 0.000 (0.108) [4.9] | 0.003 (0.106) [4.8] | 0.009 (0.101) [4.3] | 0.009 (0.105) [5.3] | |
| 0.20 | 0.000 (0.054) [4.8] | 0.001 (0.054) [4.8] | 0.004 (0.052) [4.8] | 0.004 (0.052) [4.3] | |
Median estimates over 10000 simulations of β =0.2 or β =0 (median standard errors) [power (%) based on nominal 95% confidence interval] from simulation study with 15 correlated IVs varying direction of confounding (β ); and average strength of IV (α), using allele score method calculated from individual‐level data, and allele score, weighted generalized linear regression and likelihood‐based methods all calculated from summarized data, with external (N = 5000) weights.
Genetic variants located in PCSK9 gene region on chromosome 1 used in applied example from main paper: rsid, position (hg18), coding and non‐coding alleles, frequency of the coding allele, beta‐coefficient for association with LDL‐c with SE taken from GLGC, beta‐coefficient for association with CHD risk taken from CARDIoGRAM.
| rsid | Position | Coding/non‐coding allele | Coding allele frequency | Association with LDL‐c (SE) | Association with CHD risk (SE) |
|---|---|---|---|---|---|
| rs1887552 | 55 260 222 | A/T | 0.29 | 0.037 (0.006) | 0.018 (0.017) |
| rs11588151 | 55 260 236 | A/G | 0.81 | 0.059 (0.008) | 0.072 (0.024) |
| rs9436961 | 55 261 419 | T/A | 0.27 | 0.046 (0.006) | 0.019 (0.017) |
| rs2479418 | 55 267 465 | G/A | 0.49 | 0.018 (0.005) | 0.033 (0.014) |
| rs2479417 | 55 268 332 | T/C | 0.35 | 0.017 (0.006) | 0.002 (0.015) |
| rs2495497 | 55 268 583 | T/C | 0.12 | 0.035 (0.008) | 0.003 (0.023) |
|
|
|
|
|
|
|
| rs17192725 | 55 268 719 | A/G | 0.07 | 0.048 (0.011) | 0.046 (0.039) |
| rs17111490 | 55 268 764 | T/C | 0.07 | 0.002 (0.014) | −0.042 (0.043) |
| rs2094470 | 55 269 890 | C/T | 0.10 | 0.036 (0.011) | 0.048 (0.028) |
The primary SNP (rs11206510) is displayed in italics.
LDL‐c, low‐density lipoprotein cholesterol; SE, standard error; GLGC, Global Lipids Genetics Consortium; CHD, coronary heart disease; SNP, single nucleotide polymorphism.
Figure 1Estimated genetic associations and 95% confidence intervals with low‐density lipoprotein cholesterol (LDL‐c) and with coronary heart disease risk for 10 genetic variants in the PCSK9 gene region.
Estimates and 95% confidence intervals (CI) of causal effect of low‐density lipoprotein‐cholesterol on coronary heart disease risk using genetic variants from PCSK9 gene region from various analysis methods.
| Method | Equations | Estimate | 95% CI |
|---|---|---|---|
| Estimate based on single genetic variant (rs11206510) |
| 2.62 | 1.52, 4.49 |
| Summary statistic method based on all genetic variants |
| 2.25 | 1.65, 3.07 |
| ignoring correlation | |||
| Weighted generalized linear regression method based |
| 2.28 | 1.53, 3.38 |
| on all genetic variants incorporating correlation | |||
| Allele score method based on all genetic variants |
| 2.25 | 1.41, 3.59 |
| incorporating correlation using estimated weights | |||
| Allele score method based on all genetic variants |
| 2.14 | 1.18, 3.86 |
| incorporating correlation using equal weights | |||
| Likelihood‐based method based on all genetic | See Appendix A.3 | 2.31 | 1.53, 3.50 |
| variants incorporating correlation |