| Literature DB >> 28960498 |
Jessica M B Rees1, Angela M Wood1, Stephen Burgess1,2.
Abstract
Methods have been developed for Mendelian randomization that can obtain consistent causal estimates while relaxing the instrumental variable assumptions. These include multivariable Mendelian randomization, in which a genetic variant may be associated with multiple risk factors so long as any association with the outcome is via the measured risk factors (measured pleiotropy), and the MR-Egger (Mendelian randomization-Egger) method, in which a genetic variant may be directly associated with the outcome not via the risk factor of interest, so long as the direct effects of the variants on the outcome are uncorrelated with their associations with the risk factor (unmeasured pleiotropy). In this paper, we extend the MR-Egger method to a multivariable setting to correct for both measured and unmeasured pleiotropy. We show, through theoretical arguments and a simulation study, that the multivariable MR-Egger method has advantages over its univariable counterpart in terms of plausibility of the assumption needed for consistent causal estimation and power to detect a causal effect when this assumption is satisfied. The methods are compared in an applied analysis to investigate the causal effect of high-density lipoprotein cholesterol on coronary heart disease risk. The multivariable MR-Egger method will be useful to analyse high-dimensional data in situations where the risk factors are highly related and it is difficult to find genetic variants specifically associated with the risk factor of interest (multivariable by design), and as a sensitivity analysis when the genetic variants are known to have pleiotropic effects on measured risk factors.Entities:
Keywords: MR-Egger; Mendelian randomization; invalid instruments; multivariable; pleiotropy
Mesh:
Substances:
Year: 2017 PMID: 28960498 PMCID: PMC5725762 DOI: 10.1002/sim.7492
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Causal directed acyclic graph illustrating univariable Mendelian randomization assumptions with potential violation of IV3 by a pleiotropic effect indicated by a dotted line. The genetic effect of G on X is , the direct (pleiotropic) effect of G on Y via an independent pathway is α (representing the potential violation of the IV3 assumption), and the causal effect of the risk factor X on the outcome Y is θ. U represents the set of variables that confound the association between X and Y
Figure 2Causal directed acyclic graph illustrating multivariable Mendelian randomization assumptions for a set of genetic variants G , 3 risk factors X 1, X 2, and X 3, and outcome Y. The genetic effect of G on X is , the direct (pleiotropic) effect of G on Y is , and the causal effect of the risk factor X on the outcome Y is θ . U represents the set of variables that confound the associations between X and Y
Log causal odds ratios (95% confidence intervals) for coronary heart disease per standard deviation increase in HDL‐C, with 2‐sided P‐values. Estimates of the intercept are given in univariable and multivariable MR‐Egger
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| ||||||
| All variants | −0.130 (−0.227, −0.033) | 0.049 | 0.009 | ‐ | ‐ | ‐ |
| Reduced set of variantsa | −0.114 (−0.211, −0.017) | 0.049 | 0.022 | ‐ | ‐ | ‐ |
|
| ||||||
| All variants | −0.016 (−0.138, 0.106) | 0.062 | 0.800 | −0.007 | 0.002 | 0.004 |
| Reduced set of variantsa | 0.067 (−0.070, 0.204) | 0.069 | 0.332 | −0.012 | 0.004 | 0.001 |
|
| −0.039 (−0.123, 0.045) | 0.042 | 0.359 | ‐ | ‐ | ‐ |
|
| 0.036 (−0.063, 0.134) | 0.050 | 0.477 | −0.005 | 0.002 | 0.008 |
Abbreviations: CI, confidence interval; HDL‐C, high‐density lipoprotein cholesterol; IVW, inverse‐variance weighted; MR, Mendelian randomization; SE, standard error.
a95 variants associated with HDL‐C at a genome‐wide level of significance (P‐value<5×10−8).
Causal log odds ratios (95% confidence intervals) for coronary heart disease per standard deviation increase in HDL‐C, LDL‐C, and triglycerides from multivariable IVW and multivariable MR‐Egger. Estimates from multivariable MR‐Egger are presented from 3 models where the reference allele is the risk increasing allele for HDL‐C, LDL‐C, or triglycerides. Estimates of the intercept are given for multivariable MR‐Egger
|
|
| |||
|---|---|---|---|---|
|
|
|
|
| |
|
| −0.039 (−0.123, 0.045) | 0.375 (0.292, 0.457) | 0.173 (0.063, 0.283) | ‐ |
|
| ||||
| Orientation with respect toa: | ||||
| HDL‐C |
| 0.378 (0.297, 0.458) | 0.136 (0.024, 0.247) | −0.005 (−0.009, −0.001) |
| LDL‐C | −0.034 (−0.118, 0.049) |
| 0.194 (0.081, 0.308) | −0.003 (−0.007, 0.001) |
| TG | −0.018 (−0.102, 0.066) | 0.350 (0.267, 0.433) |
| 0.005 (0.001, 0.009) |
Abbreviations: HDL‐C, high‐density lipoprotein cholesterol; LDL‐C, low‐density lipoprotein cholesterol; MR, Mendelian randomization; TG, triglycerides.
aAlleles orientated for all genetic associations with respect to the risk increasing allele for HDL‐C, LDL‐C, or triglycerides.
Performance of multivariable IVW, univariable MR‐Egger, and multivariable MR‐Egger with respect to for a null (θ 1=0) and positive (θ 1=0.3) causal effect where are generated independently for all k. All tests were performed at the 5% level of significance
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|
| Mean
| Power, | Mean
| Power, % | Mean
| Power, % | |||
| (mean SE) | % | (mean SE) | Intercept | Causal | (mean SE) | Intercept | Causal | |
|
| ||||||||
|
| ||||||||
| 0.000 (0.045) | 3.8 | −0.002 (0.158) | 9.1 | 4.7 | 0.000 (0.084) | 3.7 | 4.1 | |
|
| ||||||||
|
| ‐0.001 (0.100) | 4.7 | −0.001 (0.187) | 7.8 | 4.7 | 0.000 (0.165) | 4.6 | 4.6 |
|
| ||||||||
|
| 0.041 (0.100) | 6.7 | −0.003 (0.187) | 12.2 | 4.3 | −0.002 (0.165) | 5.9 | 4.5 |
|
| 0.210 (0.100) | 55.3 | 0.002 (0.187) | 49.2 | 4.6 | 0.002 (0.166) | 36.3 | 4.6 |
|
| 0.417 (0.102) | 97.4 | 0.000 (0.187) | 91.6 | 4.3 | 0.001 (0.165) | 88.0 | 4.6 |
|
| ||||||||
|
| 0.074 (0.100) | 12.3 | 0.089 (0.187) | 6.7 | 7.6 | 0.088 (0.165) | 4.3 | 8.4 |
|
| 0.240 (0.100) | 67.2 | 0.089 (0.187) | 34.1 | 7.8 | 0.088 (0.165) | 21.1 | 8.8 |
|
| 0.450 (0.101) | 98.6 | 0.088 (0.187) | 84.1 | 7.6 | 0.088 (0.165) | 78.7 | 8.7 |
|
| ||||||||
|
| ||||||||
| 0.300 (0.044) | 98.9 | 0.300 (0.157) | 9.3 | 50.1 | 0.300 (0.084) | 4.3 | 87.3 | |
|
| ||||||||
|
| 0.301 (0.100) | 84.6 | 0.303 (0.187) | 7.5 | 38.2 | 0.302 (0.166) | 4.9 | 46.4 |
|
| ||||||||
|
| 0.343 (0.100) | 91.5 | 0.300 (0.187) | 12.8 | 36.8 | 0.299 (0.165) | 6.0 | 45.8 |
|
| 0.509 (0.100) | 99.7 | 0.300 (0.188) | 50.6 | 37.3 | 0.299 (0.166) | 37.1 | 46.1 |
|
| 0.716 (0.102) | 100.0 | 0.300 (0.187) | 91.1 | 37.1 | 0.299 (0.166) | 87.9 | 46.1 |
|
| ||||||||
|
| 0.374 (0.099) | 94.3 | 0.390 (0.187) | 6.6 | 56.4 | 0.389 (0.165) | 4.6 | 65.8 |
|
| 0.539 (0.100) | 99.8 | 0.388 (0.187) | 34.4 | 55.6 | 0.387 (0.165) | 21.5 | 65.5 |
|
| 0.747 (0.101) | 100.0 | 0.383 (0.187) | 84.7 | 55.1 | 0.384 (0.165) | 78.3 | 65.2 |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; IVW, inverse‐variance weighted; MR, Mendelian randomization; SE, standard error.
Performance of multivariable IVW, univariable MR‐Egger, and multivariable MR‐Egger with being correlated for all k
| Multivariable IVW | Univariable MR‐Egger | Multivariable MR‐Egger | ||||||
|---|---|---|---|---|---|---|---|---|
| Mean
| Power, | Mean
| Power, % | Mean
| Power, % | |||
| (mean SE) | % | (mean SE) | Intercept | Causal | (mean SE) | Intercept | Causal | |
|
| ||||||||
|
| ||||||||
| 0.000 (0.047) | 4.0 | 0.099 (0.157) | 4.3 | 10.1 | 0.000 (0.086) | 4.4 | 4.6 | |
|
| ||||||||
|
| −0.001 (0.104) | 4.7 | 0.093 (0.187) | 4.5 | 7.4 | −0.003 (0.169) | 4.6 | 4.4 |
|
| ||||||||
|
| 0.043 (0.104) | 7.0 | 0.099 (0.187) | 5.8 | 8.0 | 0.001 (0.169) | 5.9 | 4.8 |
|
| 0.213 (0.105) | 52.7 | 0.095 (0.187) | 33.3 | 7.6 | 0.000 (0.169) | 37.2 | 4.5 |
|
| 0.426 (0.107) | 96.3 | 0.096 (0.187) | 84.5 | 7.6 | −0.001 (0.169) | 89.2 | 4.6 |
|
| ||||||||
|
| 0.062 (0.104) | 9.5 | 0.184 (0.187) | 4.6 | 17.9 | 0.078 (0.169) | 4.7 | 7.6 |
|
| 0.235 (0.104) | 62.1 | 0.187 (0.187) | 20.5 | 18.3 | 0.082 (0.169) | 22.3 | 7.5 |
|
| 0.448 (0.106) | 97.9 | 0.181 (0.187) | 73.3 | 17.8 | 0.077 (0.169) | 80.3 | 7.2 |
|
| ||||||||
|
| ||||||||
| 0.300 (0.047) | 98.7 | 0.395 (0.158) | 4.4 | 70.8 | 0.299 (0.087) | 3.9 | 86.2 | |
|
| ||||||||
|
| 0.300 (0.104) | 81.5 | 0.399 (0.187) | 4.4 | 58.0 | 0.301 (0.169) | 4.6 | 44.4 |
|
| ||||||||
|
| 0.342 (0.104) | 89.4 | 0.395 (0.187) | 6.4 | 57.4 | 0.301 (0.169) | 5.9 | 44.4 |
|
| 0.513 (0.105) | 99.4 | 0.394 (0.187) | 33.0 | 57.4 | 0.296 (0.169) | 38.0 | 43.4 |
|
| 0.729 (0.107) | 100.0 | 0.400 (0.187) | 83.5 | 58.2 | 0.304 (0.169) | 88.6 | 45.5 |
|
| ||||||||
|
| 0.365 (0.104) | 92.1 | 0.489 (0.187) | 4.2 | 74.0 | 0.382 (0.169) | 4.6 | 63.2 |
|
| 0.535 (0.104) | 99.7 | 0.486 (0.187) | 20.3 | 72.9 | 0.382 (0.169) | 21.1 | 63.2 |
|
| 0.749 (0.106) | 100.0 | 0.488 (0.187) | 72.5 | 73.4 | 0.381 (0.169) | 79.6 | 62.8 |
Abbreviations: InSIDE, Instrument Strength Independent of Direct Effect; IVW, inverse‐variance weighted; MR, Mendelian randomization; SE, standard error.
Figure 3Causal directed acyclic graph illustrating the causal relationships between the 2 risk factors X 1 and X 2, and outcome Y. The causal effect of X 1 on X 2 is γ, and the direct causal effect of the risk factor X on the outcome Y is θ . The total causal effect of X 1 on Y is θ 1+γ θ 2, consisting of the direct effect (θ 1) and the indirect effect via X 2 (γ θ 2). U represents the set of variables that confound the associations between X and Y