| Literature DB >> 30561657 |
Jack Bowden1,2, Fabiola Del Greco M3, Cosetta Minelli4, Qingyuan Zhao5, Debbie A Lawlor1,2, Nuala A Sheehan6, John Thompson6, George Davey Smith1,2.
Abstract
BACKGROUND: Two-sample summary-data Mendelian randomization (MR) incorporating multiple genetic variants within a meta-analysis framework is a popular technique for assessing causality in epidemiology. If all genetic variants satisfy the instrumental variable (IV) and necessary modelling assumptions, then their individual ratio estimates of causal effect should be homogeneous. Observed heterogeneity signals that one or more of these assumptions could have been violated.Entities:
Keywords: Cochran’s Q statistic; Two-sample summary-data Mendelian randomization; inverse-variance weighted estimate; outlier detection
Mesh:
Year: 2019 PMID: 30561657 PMCID: PMC6659376 DOI: 10.1093/ije/dyy258
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Mean Q statistic and type I error rate (T1E) of first-order, second-order, iterative (four iterations were performed) and exact weighting
| Mean | First-order | Second-order | Modified | |||||
|---|---|---|---|---|---|---|---|---|
| Iterative | Exact | |||||||
|
|
| T1E( |
| T1E( |
| T1E( |
| T1E( |
| No heterogeneity, | ||||||||
| 100 | 23.9 | 0.044 | 22.8 | 0.022 | 23.9 | 0.044 | 23.9 | 0.044 |
| 61 | 24.1 | 0.052 | 21.9 | 0.016 | 24.1 | 0.051 | 24.1 | 0.051 |
| 40 | 23.9 | 0.049 | 20.3 | 0.006 | 23.9 | 0.048 | 23.9 | 0.048 |
| 25 | 24.0 | 0.052 | 17.7 | 0.002 | 23.9 | 0.051 | 23.9 | 0.051 |
| 10 | 24.0 | 0.052 | 12.3 | 0.000 | 23.6 | 0.047 | 23.4 | 0.042 |
| No heterogeneity, | ||||||||
| 100 | 24.2 | 0.053 | 22.9 | 0.028 | 24.0 | 0.049 | 24.0 | 0.049 |
| 61 | 24.4 | 0.058 | 21.9 | 0.017 | 24.0 | 0.051 | 24.0 | 0.051 |
| 40 | 24.7 | 0.064 | 20.3 | 0.007 | 23.9 | 0.050 | 23.9 | 0.049 |
| 25 | 25.9 | 0.092 | 17.8 | 0.002 | 24.1 | 0.052 | 23.9 | 0.048 |
| 10 | 31.4 | 0.272 | 13.4 | 0.000 | 25.6 | 0.095 | 23.7 | 0.043 |
| No heterogeneity, | ||||||||
| 100 | 24.7 | 0.065 | 22.8 | 0.027 | 23.9 | 0.052 | 23.9 | 0.051 |
| 61 | 25.6 | 0.084 | 21.8 | 0.017 | 23.9 | 0.048 | 23.9 | 0.047 |
| 40 | 27.3 | 0.132 | 20.5 | 0.009 | 24.1 | 0.053 | 24.0 | 0.050 |
| 25 | 31.7 | 0.282 | 18.2 | 0.003 | 24.4 | 0.060 | 23.9 | 0.048 |
| 10 | 53.9 | 0.792 | 15.8 | 0.004 | 27.8 | 0.166 | 23.9 | 0.051 |
Results are the average of 10 000 simulated data sets. Type I error rate (T1E(Q)) refers to the proportion of times Q is greater than the upper 95th percentile of a distribution.
Figure 1.Distribution of Q statistics (with 25 degrees of freedom) using first-order, second-order and exact weights. The causal effect β = 0.1 and the mean F-statistic equals 100 (left) and 10 (right) respectively.
Figure 2.Left: Power of Cochran’s Q statistic to detect heterogeneity as a function of the pleiotropy variance and number of SNPs (L) using first-order, second-order and exact weights. Pleiotropy is simulated under a multiplicative random-effects model. The causal effect is equal to 0.05 and the mean F-statistic is 61. Top group: L=100; middle group: L=25; bottom group: L=10. Right: Equivalent power plot except the causal effect is equal to 0.1 and the mean F-statistic is 25.
Mean causal estimate , standard error (SE) and coverage frequency (CF) of the 95% confidence interval when using first-order, second-order, iterative and exact weights
| Mean | First-order | Second-order | Modified | ||
|---|---|---|---|---|---|
| Iterative | Exact |
| |||
|
|
|
|
|
| CF2 |
| No heterogeneity, | |||||
| 100 | 0.000 (0.011); 0.952 | 0.000 (0.011); 0.951 | 0.000 (0.011); 0.952 | 0.000 (0.011); 0.961 | 0.948 |
| 61 | 0.000 (0.011); 0.947 | 0.000 (0.011); 0.947 | 0.000 (0.011); 0.948 | 0.000 (0.011); 0.956 | 0.946 |
| 40 | 0.000 (0.011); 0.954 | 0.000 (0.010); 0.952 | 0.000 (0.011); 0.955 | 0.000 (0.011); 0.957 | 0.946 |
| 25 | 0.000 (0.011); 0.947 | 0.000 (0.010); 0.941 | 0.000 (0.011); 0.949 | 0.000 (0.011); 0.942 | 0.949 |
| 10 | 0.000 (0.009); 0.952 | 0.000 (0.007); 0.928 | 0.000 (0.009); 0.958 | 0.000 (0.010); 0.836 | 0.958 |
| No heterogeneity, | |||||
| 100 | 0.049 (0.011); 0.952 | 0.049 (0.011); 0.951 | 0.049 (0.011); 0.954 | 0.050 (0.011); 0.962 | 0.952 |
| 61 | 0.049 (0.011); 0.948 | 0.047 (0.011); 0.944 | 0.049 (0.011); 0.952 | 0.050 (0.011); 0.961 | 0.953 |
| 40 | 0.048 (0.011); 0.939 | 0.045 (0.011); 0.918 | 0.048 (0.011); 0.943 | 0.050 (0.012); 0.951 | 0.946 |
| 25 | 0.046 (0.011); 0.910 | 0.041 (0.010); 0.819 | 0.046 (0.011); 0.923 | 0.050 (0.012); 0.940 | 0.954 |
| 10 | 0.033 (0.010); 0.589 | 0.027 (0.008); 0.286 | 0.034 (0.011); 0.670 | 0.051 (0.012); 0.868 | 0.957 |
| No heterogeneity, | |||||
| 100 | 0.099 (0.011); 0.945 | 0.097 (0.011); 0.945 | 0.099 (0.012); 0.950 | 0.100 (0.012); 0.963 | 0.946 |
| 61 | 0.098 (0.011); 0.932 | 0.095 (0.011); 0.920 | 0.098 (0.012); 0.944 | 0.100 (0.012); 0.956 | 0.947 |
| 40 | 0.097 (0.012); 0.911 | 0.091 (0.011); 0.859 | 0.097 (0.012); 0.933 | 0.100 (0.013); 0.954 | 0.951 |
| 25 | 0.092 (0.012); 0.844 | 0.083 (0.011); 0.649 | 0.092 (0.013); 0.896 | 0.101 (0.014); 0.947 | 0.955 |
| 10 | 0.065 (0.013); 0.348 | 0.055 (0.010); 0.094 | 0.072 (0.014); 0.518 | 0.102 (0.016); 0.895 | 0.964 |
Number of variants L = 25. CF1 = coverage of a symmetric 95% confidence interval, CF2 = coverage of inverted Q statistic confidence interval.
Mean causal estimate , standard error (SE) and coverage frequency (CF) of the 95% confidence interval when using first-order, second-order, iterative and exact weights
| Mean | First-order | Second-order | Modified | ||
|---|---|---|---|---|---|
| Iterative | Exact | ||||
|
|
|
|
|
|
|
| Heterogeneity, | |||||
| 100 | 0.000(0.016); 0.949 | 0.000 (0.015); 0.950 | 0.000 (0.016); 0.950 | 0.000 (0.016); 0.939 | 2.000 |
| 61 | 0.000 (0.016); 0.950 | 0.000 (0.015); 0.951 | 0.000 (0.016); 0.951 | 0.000 (0.016); 0.940 | 2.004 |
| 40 | 0.000 (0.016); 0.953 | 0.000 (0.014); 0.951 | 0.000 (0.016); 0.955 | 0.000 (0.016); 0.944 | 1.999 |
| 25 | 0.000 (0.015); 0.949 | 0.000 (0.013); 0.945 | 0.000 (0.015); 0.954 | 0.000 (0.017); 0.945 | 2.003 |
| 10 | 0.000 (0.013); 0.952 | 0.000 (0.009); 0.924 | 0.000 (0.013); 0.960 | 0.000 (0.037); 0.970 | 1.943 |
| Heterogeneity, | |||||
| 100 | 0.050 (0.016); 0.948 | 0.048 (0.015); 0.947 | 0.050 (0.016); 0.949 | 0.050 (0.016); 0.938 | 2.002 |
| 62 | 0.049 (0.016); 0.951 | 0.046 (0.015); 0.943 | 0.049 (0.016); 0.954 | 0.050 (0.016); 0.943 | 1.998 |
| 40 | 0.048 (0.016); 0.949 | 0.044 (0.014); 0.924 | 0.048 (0.016); 0.953 | 0.050 (0.017); 0.943 | 1.995 |
| 25 | 0.046 (0.015); 0.933 | 0.039 (0.013); 0.839 | 0.046 (0.016); 0.940 | 0.051 (0.018); 0.944 | 1.987 |
| 10 | 0.033 (0.014); 0.719 | 0.025 (0.010); 0.378 | 0.034 (0.015); 0.778 | 0.051 (0.037); 0.960 | 1.967 |
| Heterogeneity, | |||||
| 100 | 0.099 (0.016); 0.947 | 0.096 (0.016); 0.942 | 0.099 (0.016); 0.952 | 0.100 (0.016); 0.942 | 2.005 |
| 61 | 0.098 (0.016); 0.941 | 0.092 (0.016); 0.922 | 0.098 (0.017); 0.951 | 0.100 (0.017); 0.941 | 2.004 |
| 40 | 0.097 (0.016); 0.932 | 0.088 (0.015); 0.862 | 0.097 (0.017); 0.947 | 0.101 (0.017); 0.940 | 2.003 |
| 25 | 0.092 (0.016); 0.888 | 0.078 (0.015); 0.676 | 0.093 (0.018); 0.924 | 0.101 (0.019); 0.942 | 2.003 |
| 10 | 0.065 (0.016); 0.456 | 0.051 (0.012); 0.131 | 0.072 (0.018); 0.639 | 0.101 (0.042); 0.956 | 2.023 |
L = 25. equals the variance inflation factor estimate (true value = 2).
Figure 3.Top: Scatter plot of SNP–outcome associations vs SNP–exposure associations . IVW estimate shown as a black slope. Bottom-left: Q contribution plots for the same data. Bottom-right: Q contributions after removal of rs17249754.
IVW and weighted median analyses of the causal effect of SBP on CHD risk for the complete data (top) and with SNP rs17249754 removed (bottom)
| Method (weights) | Estimate (CI) | SE |
| Het. Stat ( |
|
|---|---|---|---|---|---|
| All 26 SNPs | |||||
| Causal estimate | |||||
| IVW (first—RE) |
| 0.010 | 3.01 × 10–5 |
| 2.68 |
| IVW (second—RE) |
| 0.010 | 4.54 × 10–5 |
| 2.35 |
| IVW (iterative—RE) |
| 0.010 | 2.40 × 10–5 |
| 2.51 |
| IVW (exact—RE) |
| 0.014 | 4.60 × 10–4 |
| 2.61 |
| Weighted median (first-order weights) | |||||
| Weighted median |
| 0.011 | 4.90 × 10–6 | – | – |
| SNP rs17249754 removed | |||||
| Causal estimate | |||||
| IVW (first—RE) |
| 0.008 | 2.63 × 10–8 |
| 1.46 |
| IVW (second—RE) |
| 0.008 | 4.06 × 10–8 |
| 1.27 |
| IVW (iterative—RE) |
| 0.008 | 2.90 × 10–8 |
| 1.37 |
| IVW (exact—RE) |
| 0.009 | 8.37 × 10–8 |
| 1.39 |
| Weighted median (first-order weights) | |||||
| Weighted median |
| 0.011 | 2.33 × 10–6 | – | |
is the IVW estimate. is the weighted median estimate. All IVW estimates fitted under a multiplicative random-effects model (RE), where refers to the variance inflation factor estimate. The weighted median naturally accounts for heterogeneity via a bootstrapped variance.