| Literature DB >> 32249995 |
Eric A W Slob1,2, Stephen Burgess3,4.
Abstract
The number of Mendelian randomization (MR) analyses including large numbers of genetic variants is rapidly increasing. This is due to the proliferation of genome-wide association studies, and the desire to obtain more precise estimates of causal effects. Since it is unlikely that all genetic variants will be valid instrumental variables, several robust methods have been proposed. We compare nine robust methods for MR based on summary data that can be implemented using standard statistical software. Methods were compared in three ways: by reviewing their theoretical properties, in an extensive simulation study, and in an empirical example. In the simulation study, the best method, judged by mean squared error was the contamination mixture method. This method had well-controlled Type 1 error rates with up to 50% invalid instruments across a range of scenarios. Other methods performed well according to different metrics. Outlier-robust methods had the narrowest confidence intervals in the empirical example. With isolated exceptions, all methods performed badly when over 50% of the variants were invalid instruments. Our recommendation for investigators is to perform a variety of robust methods that operate in different ways and rely on different assumptions for valid inferences to assess the reliability of MR analyses.Entities:
Keywords: Mendelian randomization; causal inference; pleiotropy; robust estimation; summary statistics
Mesh:
Year: 2020 PMID: 32249995 PMCID: PMC7317850 DOI: 10.1002/gepi.22295
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.344
Figure 1Illustrative diagram showing the model assumed for genetic variant , with effect on the unobserved confounder , effect on exposure , and direct effect on outcome . The causal effect of the exposure on the outcome is . Dotted lines represent possible ways the instrumental variable assumptions could be violated
Summary comparison of methods
| Method | Consistency assumption | Strengths and/or weaknesses |
|---|---|---|
| Weighted median | Majority valid | Robust to outliers, sensitive to additional/removal of genetic variants, may be less efficient |
| Mode‐based estimation | Plurality valid | Robust to outliers, sensitive to bandwidth parameter and addition/removal of genetic variants, generally conservative |
| MR‐PRESSO | Outlier‐robust | Removes outliers, efficient with valid IVs, very high false positive rate with several invalid IVs |
| MR‐Robust | Outlier‐robust | Downweights outliers, efficient with valid IVs, high false‐positive rate with several invalid IVs |
| MR‐Lasso | Outlier‐robust | Removes outliers, efficient with valid IVs, high false‐positive rate with several invalid IVs |
| MR‐Egger | InSIDE | Sensitive to outliers, sensitive to violations of InSIDE assumption, InSIDE assumption often not plausible, may be less efficient |
| Contamination mixture | Plurality valid | Robust to outliers, sensitive to variance parameter and addition/removal of genetic variants, less conservative than MBE |
| MR‐Mix | Plurality valid | Robust to outliers, requires large numbers of genetic variants, very high false‐positive rate in several scenarios |
| MR‐RAPS | Pleiotropic effects (except outliers) normally distributed about zero | Downweights outliers, sensitive to violations of balanced pleiotropy assumption |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; IV, instrumental variable; MBE, mode‐based estimation; MR, Mendelian randomization; PRESSO, Pleiotropy Residual Sum and Outlier; RAPS, Robust Adjusted Profile Score.
Figure 2Scatter plot of genetic associations with body mass index (standard deviation units) and coronary artery disease risk (log odds ratios) for 94 variants taken from the GIANT and CARDIoGRAMplusC4D consortia, respectively
Mean, median, SD of estimates, and Type 1 error/empirical power (%) with 10 genetic variants
| Null casual effect: θ = 0 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30% invalid | 50% invalid | 70% invalid | ||||||||||
| Method | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error |
| Scenario 1: Balanced pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.000 | 0.000 | 0.071 | 0.139 | 0.002 | 0.001 | 0.132 | 0.276 | 0.002 | 0.000 | 0.223 | 0.481 |
| Mode‐based estimation | 0.000 | 0.000 | 0.101 | 0.111 | 0.002 | 0.000 | 0.151 | 0.268 | 0.002 | 0.001 | 0.224 | 0.619 |
| MR‐PRESSO | 0.000 | 0.000 | 0.111 | 0.122 | −0.001 | 0.000 | 0.178 | 0.154 | 0.000 | 0.001 | 0.239 | 0.174 |
| MR‐Robust | 0.000 | 0.000 | 0.029 | 0.110 | 0.001 | 0.001 | 0.127 | 0.076 | 0.002 | 0.002 | 0.224 | 0.104 |
| MR‐Lasso | 0.001 | 0.000 | 0.048 | 0.042 | 0.000 | 0.000 | 0.088 | 0.076 | 0.004 | 0.001 | 0.183 | 0.156 |
| MR‐Egger | 0.007 | 0.004 | 0.419 | 0.093 | 0.005 | 0.008 | 0.563 | 0.097 | 0.006 | 0.014 | 0.684 | 0.098 |
| Contamination mixture | 0.000 | 0.000 | 0.025 | 0.052 | 0.000 | 0.000 | 0.077 | 0.069 | 0.002 | 0.000 | 0.379 | 0.126 |
| MR‐Mix | 0.000 | 0.000 | 0.274 | 0.225 | −0.001 | 0.000 | 0.431 | 0.292 | 0.000 | 0.000 | 0.561 | 0.356 |
| MR‐RAPS | 0.000 | −0.001 | 0.106 | 0.039 | 0.001 | 0.000 | 0.172 | 0.062 | 0.001 | 0.000 | 0.226 | 0.083 |
| Scenario 2: Directional pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.013 | 0.006 | 0.060 | 0.140 | 0.036 | 0.016 | 0.108 | 0.287 | 0.084 | 0.036 | 0.175 | 0.500 |
| Mode‐based estimation | 0.007 | 0.001 | 0.081 | 0.114 | 0.020 | 0.006 | 0.122 | 0.264 | 0.059 | 0.030 | 0.180 | 0.585 |
| MR‐PRESSO | 0.028 | 0.013 | 0.079 | 0.132 | 0.069 | 0.031 | 0.133 | 0.168 | 0.122 | 0.071 | 0.182 | 0.214 |
| MR‐Robust | 0.003 | 0.002 | 0.031 | 0.106 | 0.042 | 0.023 | 0.105 | 0.084 | 0.115 | 0.094 | 0.169 | 0.152 |
| MR‐Lasso | 0.008 | 0.005 | 0.044 | 0.056 | 0.024 | 0.012 | 0.082 | 0.125 | 0.075 | 0.035 | 0.161 | 0.283 |
| MR‐Egger | 0.001 | −0.006 | 0.329 | 0.093 | 0.000 | −0.013 | 0.408 | 0.091 | −0.005 | −0.012 | 0.477 | 0.095 |
| Contamination mixture | 0.000 | 0.001 | 0.025 | 0.059 | 0.003 | 0.001 | 0.056 | 0.078 | 0.060 | 0.006 | 0.281 | 0.137 |
| MR‐Mix | 0.045 | 0.016 | 0.200 | 0.247 | 0.084 | 0.023 | 0.301 | 0.331 | 0.144 | 0.050 | 0.399 | 0.443 |
| MR‐RAPS | 0.039 | 0.030 | 0.082 | 0.053 | 0.081 | 0.071 | 0.128 | 0.095 | 0.130 | 0.119 | 0.165 | 0.152 |
| Scenario 3: Directional pleiotropy, InSIDE violated | ||||||||||||
| Weighted median | 0.022 | 0.011 | 0.071 | 0.179 | 0.073 | 0.030 | 0.137 | 0.384 | 0.135 | 0.080 | 0.188 | 0.599 |
| Mode‐based estimation | 0.013 | 0.002 | 0.090 | 0.132 | 0.044 | 0.011 | 0.148 | 0.317 | 0.094 | 0.051 | 0.192 | 0.621 |
| MR‐PRESSO | 0.047 | 0.023 | 0.095 | 0.155 | 0.113 | 0.063 | 0.153 | 0.223 | 0.179 | 0.147 | 0.185 | 0.301 |
| MR‐Robust | 0.004 | 0.002 | 0.032 | 0.106 | 0.069 | 0.040 | 0.121 | 0.109 | 0.169 | 0.152 | 0.171 | 0.216 |
| MR‐Lasso | 0.013 | 0.008 | 0.050 | 0.073 | 0.050 | 0.024 | 0.108 | 0.203 | 0.122 | 0.067 | 0.180 | 0.415 |
| MR‐Egger | 0.049 | 0.024 | 0.326 | 0.098 | 0.066 | 0.042 | 0.411 | 0.097 | 0.048 | 0.034 | 0.464 | 0.096 |
| Contamination mixture | 0.000 | 0.000 | 0.025 | 0.060 | 0.005 | 0.001 | 0.061 | 0.080 | 0.079 | 0.009 | 0.273 | 0.163 |
| MR‐Mix | 0.064 | 0.026 | 0.207 | 0.283 | 0.125 | 0.040 | 0.304 | 0.375 | 0.196 | 0.080 | 0.391 | 0.529 |
| MR‐RAPS | 0.062 | 0.050 | 0.091 | 0.085 | 0.132 | 0.118 | 0.132 | 0.182 | 0.188 | 0.180 | 0.160 | 0.262 |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; MR, Mendelian randomization; PRESSO, Pleiotropy Residual Sum and Outlier; RAPS, Robust Adjusted Profile Score; SD, standard deviation.
Mean, median, SD of estimates, and Type 1 error/empirical power (%) with 30 genetic variants
| Null casual effect: | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30% invalid | 50% invalid | 70% invalid | ||||||||||
| Method | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error |
| Scenario 1: Balanced pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.000 | 0.000 | 0.033 | 0.085 | −0.001 | 0.000 | 0.066 | 0.168 | −0.002 | −0.002 | 0.134 | 0.333 |
| Mode‐based estimation | 0.000 | 0.000 | 0.029 | 0.052 | 0.000 | 0.000 | 0.063 | 0.127 | 0.000 | −0.001 | 0.136 | 0.494 |
| MR‐PRESSO | 0.000 | 0.000 | 0.052 | 0.208 | −0.001 | 0.000 | 0.091 | 0.276 | −0.002 | 0.000 | 0.145 | 0.351 |
| MR‐Robust | 0.000 | 0.000 | 0.023 | 0.069 | 0.000 | 0.000 | 0.075 | 0.024 | −0.001 | −0.004 | 0.172 | 0.054 |
| MR‐Lasso | 0.000 | −0.001 | 0.025 | 0.038 | 0.000 | 0.000 | 0.036 | 0.061 | −0.001 | 0.000 | 0.081 | 0.111 |
| MR‐Egger | 0.004 | 0.003 | 0.319 | 0.068 | 0.006 | 0.002 | 0.400 | 0.073 | −0.010 | −0.008 | 0.464 | 0.074 |
| Contamination mixture | 0.000 | 0.000 | 0.022 | 0.062 | 0.000 | 0.000 | 0.030 | 0.078 | −0.002 | 0.001 | 0.177 | 0.127 |
| MR‐Mix | 0.000 | 0.000 | 0.141 | 0.052 | 0.000 | 0.000 | 0.215 | 0.053 | 0.002 | 0.000 | 0.321 | 0.036 |
| MR‐RAPS | −0.001 | −0.001 | 0.077 | 0.019 | 0.000 | −0.003 | 0.132 | 0.041 | −0.002 | −0.004 | 0.178 | 0.055 |
| Scenario 2: Directional pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.011 | 0.009 | 0.031 | 0.100 | 0.031 | 0.021 | 0.066 | 0.235 | 0.083 | 0.048 | 0.127 | 0.438 |
| Mode‐based estimation | 0.001 | 0.000 | 0.026 | 0.049 | 0.006 | 0.003 | 0.054 | 0.132 | 0.040 | 0.026 | 0.113 | 0.454 |
| MR‐PRESSO | 0.024 | 0.016 | 0.042 | 0.230 | 0.071 | 0.047 | 0.089 | 0.424 | 0.145 | 0.119 | 0.134 | 0.584 |
| MR‐Robust | 0.003 | 0.002 | 0.022 | 0.065 | 0.034 | 0.026 | 0.067 | 0.030 | 0.149 | 0.140 | 0.133 | 0.159 |
| MR‐Lasso | 0.004 | 0.003 | 0.023 | 0.058 | 0.014 | 0.011 | 0.039 | 0.135 | 0.061 | 0.039 | 0.097 | 0.340 |
| MR‐Egger | 0.004 | −0.004 | 0.228 | 0.073 | 0.001 | −0.005 | 0.285 | 0.074 | −0.002 | −0.008 | 0.328 | 0.071 |
| Contamination mixture | 0.001 | 0.001 | 0.020 | 0.064 | 0.001 | 0.001 | 0.028 | 0.085 | 0.015 | 0.003 | 0.141 | 0.140 |
| MR‐Mix | 0.018 | 0.006 | 0.135 | 0.078 | 0.041 | 0.010 | 0.216 | 0.107 | 0.096 | 0.010 | 0.355 | 0.119 |
| MR‐RAPS | 0.046 | 0.042 | 0.058 | 0.051 | 0.110 | 0.105 | 0.099 | 0.160 | 0.179 | 0.175 | 0.129 | 0.273 |
| Scenario 3: Directional pleiotropy, InSIDE violated | ||||||||||||
| Weighted median | 0.022 | 0.017 | 0.042 | 0.168 | 0.067 | 0.040 | 0.095 | 0.401 | 0.156 | 0.114 | 0.155 | 0.668 |
| Mode‐based estimation | 0.002 | 0.001 | 0.033 | 0.057 | 0.016 | 0.006 | 0.073 | 0.172 | 0.077 | 0.048 | 0.140 | 0.531 |
| MR‐PRESSO | 0.050 | 0.035 | 0.061 | 0.397 | 0.132 | 0.108 | 0.114 | 0.653 | 0.232 | 0.216 | 0.146 | 0.816 |
| MR‐Robust | 0.004 | 0.004 | 0.023 | 0.052 | 0.059 | 0.045 | 0.080 | 0.041 | 0.224 | 0.216 | 0.136 | 0.335 |
| MR‐Lasso | 0.008 | 0.007 | 0.025 | 0.086 | 0.033 | 0.024 | 0.054 | 0.267 | 0.123 | 0.089 | 0.130 | 0.597 |
| MR‐Egger | 0.092 | 0.074 | 0.234 | 0.105 | 0.099 | 0.090 | 0.277 | 0.091 | 0.094 | 0.089 | 0.312 | 0.084 |
| Contamination mixture | 0.000 | 0.001 | 0.020 | 0.062 | 0.002 | 0.002 | 0.029 | 0.093 | 0.026 | 0.005 | 0.156 | 0.166 |
| MR‐Mix | 0.029 | 0.010 | 0.141 | 0.095 | 0.056 | 0.010 | 0.220 | 0.139 | 0.125 | 0.020 | 0.327 | 0.154 |
| MR‐RAPS | 0.082 | 0.075 | 0.068 | 0.174 | 0.172 | 0.165 | 0.103 | 0.415 | 0.256 | 0.251 | 0.124 | 0.591 |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; MR, Mendelian randomization; PRESSO, Pleiotropy Residual Sum and Outlier; RAPS, Robust Adjusted Profile Score; SD, standard deviation.
Mean, median, SD of estimates, and Type 1 error/empirical power (%) with 100 genetic variants
| Null casual effect: θ = 0 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30% invalid | 50% invalid | 70% invalid | ||||||||||
| Method | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error | Mean | Median |
| Type 1 error |
| Scenario 1: Balanced pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.000 | 0.000 | 0.025 | 0.069 | −0.001 | 0.000 | 0.041 | 0.124 | 0.000 | 0.000 | 0.077 | 0.234 |
| Mode‐based estimation | 0.000 | 0.000 | 0.024 | 0.038 | 0.000 | 0.000 | 0.035 | 0.082 | 0.000 | 0.000 | 0.084 | 0.333 |
| MR‐PRESSO | 0.000 | 0.000 | 0.025 | 0.134 | 0.000 | 0.001 | 0.047 | 0.224 | 0.000 | −0.001 | 0.083 | 0.313 |
| MR‐Robust | 0.000 | 0.000 | 0.020 | 0.052 | 0.000 | 0.001 | 0.053 | 0.024 | 0.000 | −0.001 | 0.126 | 0.044 |
| MR‐Lasso | 0.000 | 0.000 | 0.019 | 0.042 | 0.000 | 0.000 | 0.029 | 0.072 | 0.000 | 0.000 | 0.055 | 0.120 |
| MR‐Egger | −0.001 | −0.001 | 0.195 | 0.067 | −0.001 | 0.000 | 0.252 | 0.069 | −0.003 | −0.005 | 0.296 | 0.065 |
| Contamination mixture | 0.000 | 0.000 | 0.019 | 0.064 | 0.000 | 0.000 | 0.029 | 0.088 | 0.002 | 0.000 | 0.211 | 0.136 |
| MR‐Mix | 0.000 | 0.000 | 0.075 | 0.038 | −0.001 | 0.000 | 0.072 | 0.024 | 0.000 | 0.000 | 0.058 | 0.000 |
| MR‐RAPS | 0.000 | −0.001 | 0.053 | 0.016 | −0.001 | 0.000 | 0.095 | 0.036 | 0.000 | −0.003 | 0.133 | 0.052 |
| Scenario 2: Directional pleiotropy, InSIDE satisfied | ||||||||||||
| Weighted median | 0.013 | 0.012 | 0.023 | 0.105 | 0.033 | 0.029 | 0.039 | 0.258 | 0.087 | 0.071 | 0.084 | 0.537 |
| Mode‐based estimation | 0.000 | 0.000 | 0.020 | 0.037 | 0.004 | 0.003 | 0.030 | 0.089 | 0.034 | 0.030 | 0.067 | 0.351 |
| MR‐PRESSO | 0.022 | 0.018 | 0.026 | 0.294 | 0.071 | 0.062 | 0.056 | 0.628 | 0.162 | 0.150 | 0.096 | 0.856 |
| MR‐Robust | 0.004 | 0.004 | 0.018 | 0.051 | 0.042 | 0.038 | 0.047 | 0.040 | 0.193 | 0.189 | 0.100 | 0.425 |
| MR‐Lasso | 0.004 | 0.004 | 0.017 | 0.077 | 0.020 | 0.018 | 0.029 | 0.242 | 0.076 | 0.066 | 0.067 | 0.617 |
| MR‐Egger | 0.001 | −0.003 | 0.143 | 0.062 | −0.002 | −0.005 | 0.180 | 0.059 | 0.003 | 0.001 | 0.210 | 0.058 |
| Contamination mixture | 0.000 | 0.001 | 0.017 | 0.061 | 0.001 | 0.001 | 0.025 | 0.090 | 0.018 | 0.005 | 0.160 | 0.156 |
| MR‐Mix | 0.005 | 0.000 | 0.074 | 0.034 | 0.004 | 0.000 | 0.072 | 0.035 | 0.006 | 0.000 | 0.070 | 0.007 |
| MR‐RAPS | 0.058 | 0.056 | 0.042 | 0.142 | 0.140 | 0.138 | 0.072 | 0.435 | 0.233 | 0.232 | 0.097 | 0.663 |
| Scenario 3: Directional pleiotropy, InSIDE violated | ||||||||||||
| Weighted median | 0.027 | 0.025 | 0.027 | 0.258 | 0.077 | 0.065 | 0.062 | 0.619 | 0.184 | 0.163 | 0.116 | 0.881 |
| Mode‐based estimation | 0.001 | 0.001 | 0.021 | 0.042 | 0.010 | 0.008 | 0.035 | 0.120 | 0.065 | 0.054 | 0.087 | 0.465 |
| MR‐PRESSO | 0.053 | 0.047 | 0.040 | 0.658 | 0.152 | 0.142 | 0.079 | 0.943 | 0.276 | 0.270 | 0.103 | 0.993 |
| MR‐Robust | 0.007 | 0.007 | 0.019 | 0.054 | 0.078 | 0.071 | 0.059 | 0.080 | 0.292 | 0.289 | 0.099 | 0.805 |
| MR‐Lasso | 0.010 | 0.009 | 0.018 | 0.153 | 0.049 | 0.043 | 0.041 | 0.575 | 0.165 | 0.151 | 0.097 | 0.927 |
| MR‐Egger | 0.119 | 0.114 | 0.148 | 0.162 | 0.141 | 0.138 | 0.178 | 0.159 | 0.123 | 0.124 | 0.194 | 0.114 |
| Contamination mixture | 0.001 | 0.001 | 0.017 | 0.069 | 0.003 | 0.003 | 0.026 | 0.107 | 0.021 | 0.009 | 0.135 | 0.199 |
| MR‐Mix | 0.007 | 0.000 | 0.073 | 0.037 | 0.008 | 0.000 | 0.070 | 0.034 | 0.007 | 0.000 | 0.068 | 0.006 |
| MR‐RAPS | 0.104 | 0.101 | 0.049 | 0.545 | 0.224 | 0.221 | 0.076 | 0.896 | 0.330 | 0.327 | 0.090 | 0.976 |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; MR, Mendelian randomization; PRESSO, Pleiotropy Residual Sum and Outlier; RAPS, Robust Adjusted Profile Score; SD, standard deviation.
Figure 3Mean squared errors for the different methods in Scenario 2 (directional pleiotropy, InSIDE satisfied) with a null causal effect for 30 variants. Note the vertical axis is on a logarithmic scale
Figure 4Mean squared errors for the different methods in Scenario 3 (directional pleiotropy, InSIDE violated) with a null causal effect for 30 variants. Note the vertical axis is on a logarithmic scale
Estimates and 95% CI for the effect of BMI on coronary artery disease risk from robust methods
| Method | Causal estimate (95% CI) | CI width |
|---|---|---|
| Weighted median | 0.376 (0.206, 0.546) | 0.340 |
| Mode‐based estimation | 0.382 (0.181, 0.583) | 0.402 |
| MR‐PRESSO | 0.410 (0.309, 0.511) | 0.202 |
| MR‐Robust | 0.425 (0.325, 0.526) | 0.201 |
| MR‐Lasso | 0.442 (0.354, 0.530) | 0.176 |
| MR‐Egger | 0.481 (0.165, 0.796) | 0.631 |
| (intercept) | −0.003 (−0.011, 0.005) | |
| Contamination mixture | 0.490 (0.372, 0.602) | 0.230 |
| MR‐Mix | 0.425 (−0.283, 1.133) | 1.416 |
| MR‐RAPS | 0.390 (0.308, 0.546) | 0.238 |
Note: Estimates represent log odds ratios for CAD risk per 1 kg/m2 increase in BMI.
Abbreviations: BMI, body mass index; CI, confidence intervals; PRESSO, Pleiotropy Residual Sum and Outlier; RAPS, Robust Adjusted Profile Score.