| Literature DB >> 33570130 |
Eleanor Sanderson1,2, Tom G Richardson1,2, Gibran Hemani1,2, George Davey Smith1,2.
Abstract
A key assumption of Mendelian randomization (MR) analysis is that there is no association between the genetic variants used as instruments and the outcome other than through the exposure of interest. One way in which this assumption can be violated is through population stratification, which can introduce confounding of the relationship between the genetic variants and the outcome and so induce an association between them. Negative control outcomes are increasingly used to detect unobserved confounding in observational epidemiological studies. Here we consider the use of negative control outcomes in MR studies to detect confounding of the genetic variants and the exposure or outcome. As a negative control outcome in an MR study, we propose the use of phenotypes which are determined before the exposure and outcome but which are likely to be subject to the same confounding as the exposure or outcome of interest. We illustrate our method with a two-sample MR analysis of a preselected set of exposures on self-reported tanning ability and hair colour. Our results show that, of the 33 exposures considered, genome-wide association studies (GWAS) of adiposity and education-related traits are likely to be subject to population stratification that is not controlled for through adjustment, and so any MR study including these traits may be subject to bias that cannot be identified through standard pleiotropy robust methods. Negative control outcomes should therefore be used regularly in MR studies to detect potential population stratification in the data used.Entities:
Keywords: Mendelian randomization; Population stratification; negative control outcomes
Mesh:
Year: 2021 PMID: 33570130 PMCID: PMC8407870 DOI: 10.1093/ije/dyaa288
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Figure 1Instrumental variable assumptions, and violation of these assumptions through population stratification or pleiotropy. (a) Instrumental variable assumptions. (b) Confounding of the genetic instrument and outcome introduced by population stratification. (c) Mechanisms through which pleiotropy can cause bias in Mendelian randomization estimates. X is the exposure of interest, Y is the outcome of interest, Gx are the genetic variants associated with X used as instruments, GY are genetic variants associated with Y, C is a confounder of the exposure outcome relationship. In (a): assumption IV1 is illustrated by the bold line from GX to X. Violations of assumptions IV2 and IV3 are given by the dashed lines from C to GX and from GX to Y, respectively. In (b): the presence of population stratification creates an association between GX and Y that does not go through X, violating one of the IV assumptions. In (c): pleiotropy will cause bias in MR estimates if either both edges marked a, or the edge marked b, are present. Pleiotropy in MR studies is explained in detail elsewhere
Figure 2Inverse variance weight (IVW) estimates from Mendelian randomization (MR) analyses on self-reported tanning ability. IVW results from MR analyses of 33 preselected traits on tanning ability. A higher score indicates less being less likely to tan and more likely to burn when exposed to strong sunlight. Full results from these analyses are given in Supplementary Table S1
Full Mendelian randomization (MR) results for exposures which show potential association with tanning ability
| Exposure | No. SNPs | Est. method | Effect | Std error | 95% confidence interval | ||
|---|---|---|---|---|---|---|---|
| Years of schooling | 68 | IVW | 0.072 | 0.029 | 0.012 | [0.016 | 0.104] |
| MR Egger | 0.056 | 0.149 | 0.710 | [-0.237 | -0.409] | ||
| Weighted median | 0.092 | 0.026 | <0.001 | [0.040 | 0.171] | ||
| Weighted mode | 0.103 | 0.049 | 0.038 | [0.007 | 0.118] | ||
| Childhood obesity | 5 | IVW | 0.028 | 0.014 | 0.040 | [0.001 | 0.031] |
| MR Egger | −0.088 | 0.100 | 0.444 | [-0.285 | −0.646] | ||
| Weighted median | 0.018 | 0.007 | 0.009 | [0.004 | 0.026] | ||
| Weighted mode | 0.015 | 0.008 | 0.109 | [0.001 | 0.017] | ||
| Body mass index | 78 | IVW | 0.049 | 0.016 | 0.002 | [0.018 | 0.083] |
| MR Egger | 0.055 | 0.039 | 0.165 | [-0.022 | 0.012] | ||
| Weighted median | 0.047 | 0.018 | 0.008 | [0.012 | 0.071] | ||
| Weighted mode | 0.038 | 0.019 | 0.050 | [0.001 | 0.039] | ||
| HDL cholesterol | 84 | IVW | −0.034 | 0.016 | 0.036 | [-0.065 | −0.162] |
| MR Egger | −0.006 | 0.030 | 0.848 | [-0.064 | −0.131] | ||
| Weighted median | −0.020 | 0.010 | 0.046 | [-0.040 | −0.100] | ||
| Weighted mode | −0.019 | 0.009 | 0.030 | [-0.037 | −0.091] | ||
| Triglycerides | 55 | IVW | 0.040 | 0.012 | 0.001 | [0.016 | 0.072] |
| MR Egger | 0.024 | 0.019 | 0.226 | [-0.014 | −0.004] | ||
| Weighted median | 0.014 | 0.011 | 0.226 | [-0.008 | −0.003] | ||
| Weighted mode | 0.010 | 0.012 | 0.406 | [-0.014 | −0.017] | ||
| Inflammatory bowel disease | 62 | IVW | 0.009 | 0.003 | 0.002 | [0.003 | 0.015] |
| MR Egger | 0.008 | 0.007 | 0.273 | [-0.006 | −0.004] | ||
| Weighted median | 0.004 | 0.003 | 0.185 | [-0.002 | 0.000] | ||
| Weighted mode | 0.003 | 0.004 | 0.459 | [-0.005 | −0.007] | ||
| Waist circumference | 45 | IVW | 0.050 | 0.023 | 0.033 | [0.004 | 0.057] |
| MR Egger | 0.098 | 0.062 | 0.121 | [-0.023 | 0.052] | ||
| Weighted median | 0.052 | 0.022 | 0.017 | [0.009 | 0.070] | ||
| Weighted mode | 0.035 | 0.022 | 0.128 | [-0.009 | 0.017] | ||
| Extreme height | 44 | IVW | −0.013 | 0.006 | 0.031 | [-0.024 | −0.061] |
| MR Egger | −0.044 | 0.027 | 0.115 | [-0.097 | −0.234] | ||
| Weighted median | −0.003 | 0.003 | 0.257 | [-0.009 | −0.021] | ||
| Weighted mode | 0.000 | 0.006 | 0.932 | [-0.011 | −0.023] | ||
| Obesity class 1 | 17 | IVW | 0.020 | 0.010 | 0.038 | [0.001 | 0.022] |
| MR Egger | −0.009 | 0.027 | 0.735 | [-0.063 | −0.133] | ||
| Weighted median | 0.014 | 0.007 | 0.042 | [0.001 | 0.015] | ||
| Weighted mode | 0.013 | 0.007 | 0.100 | [-0.002 | 0.010] | ||
| Obesity class 2 | 11 | IVW | 0.011 | 0.005 | 0.030 | [0.001 | 0.014] |
| MR Egger | 0.002 | 0.016 | 0.927 | [-0.030 | −0.058] | ||
| Weighted median | 0.012 | 0.006 | 0.026 | [0.001 | 0.015] | ||
| Weighted mode | 0.011 | 0.006 | 0.099 | [-0.001 | 0.009] | ||
| Overweight | 14 | IVW | 0.036 | 0.015 | 0.018 | [0.006 | 0.048] |
| MR Egger | −0.025 | 0.050 | 0.630 | [-0.124 | −0.267] | ||
| Weighted median | 0.028 | 0.011 | 0.011 | [0.006 | 0.040] | ||
| Weighted mode | 0.029 | 0.012 | 0.033 | [0.005 | 0.039] | ||
Results from inverse variance weight (IVW), MR Egger, weighted mode and weighted median analyses for those phenotypes which indicated a potential effect on tanning ability from an MR analysis of 33 preselected phenotypes on tanning ability.
SNPs, single nucleotide polymorphisms; Est., estimation; Std, standard; HDL, high-density lipoprotein.
Figure 3Inverse variance weight (IVW) estimates from Mendelian randomization (MR) analyses on self-reported natural hair colour. IVW results from MR analyses of 33 preselected traits on hair colour. A higher score indicates darker hair colour. Full results from these analyses are given in Supplementary Table S1
Mendelian randomization (MR) results for exposures which show potential association with hair colour
| Exposure | No. SNPs | Est. method | Effect | Std. error | 95% confidence interval | ||
|---|---|---|---|---|---|---|---|
| Years of schooling | 71 | IVW | 0.070 | 0.027 | 0.012 | [0.017 | 0.123] |
| MR Egger | 0.069 | 0.140 | 0.502 | [-0.205 | 0.342] | ||
| Weighted median | 0.074 | 0.024 | 0.002 | [0.027 | 0.120] | ||
| Weighted mode | 0.112 | 0.060 | 0.046 | [-0.005 | 0.230] | ||
| Coeliac disease | 13 | IVW | −0.004 | 0.002 | 0.012 | [-0.008 | 0.000] |
| MR Egger | −0.007 | 0.003 | 0.010 | [-0.012 | −0.002] | ||
| Weighted median | −0.007 | 0.002 | 0.005 | [-0.011 | −0.002] | ||
| Weighted mode | −0.005 | 0.002 | 0.002 | [-0.009 | −0.001] | ||
| LDL cholesterol | 79 | IVW | −0.020 | 0.009 | 0.025 | [-0.038 | −0.002] |
| MR Egger | −0.016 | 0.013 | 0.189 | [-0.042 | 0.010] | ||
| Weighted median | −0.025 | 0.008 | 0.001 | [-0.040 | −0.010] | ||
| Weighted mode | −0.020 | 0.006 | <0.001 | [-0.031 | −0.009] | ||
| Total cholesterol | 87 | IVW | −0.028 | 0.010 | 0.005 | [-0.048 | −0.009] |
| MR Egger | −0.018 | 0.016 | 0.212 | [-0.050 | 0.014] | ||
| Weighted median | −0.026 | 0.008 | 0.003 | [-0.043 | −0.009] | ||
| Weighted mode | −0.026 | 0.007 | <0.001 | [-0.040 | −0.012] | ||
| Triglycerides | 54 | IVW | −0.053 | 0.015 | 0.001 | [-0.082 | −0.024] |
| MR Egger | −0.059 | 0.025 | 0.044 | [-0.107 | −0.010] | ||
| Weighted median | −0.032 | 0.012 | 0.008 | [-0.054 | −0.009] | ||
| Weighted mode | −0.037 | 0.013 | 0.040 | [-0.063 | −0.011] | ||
| Extreme body mass index | 7 | IVW | 0.008 | 0.004 | 0.028 | [0.001 | 0.015] |
| MR Egger | −0.017 | 0.017 | 0.375 | [-0.050 | 0.017] | ||
| Weighted median | 0.006 | 0.005 | 0.173 | [-0.003 | 0.016] | ||
| Weighted mode | 0.001 | 0.006 | 0.812 | [-0.010 | 0.013] | ||
| Obesity class 2 | 11 | IVW | 0.010 | 0.004 | 0.010 | [0.002 | 0.018] |
| MR Egger | −0.011 | 0.012 | 0.353 | [-0.034 | 0.012] | ||
| Weighted median | 0.004 | 0.005 | 0.427 | [-0.006 | 0.015] | ||
| Weighted mode | 0.002 | 0.006 | 0.710 | [-0.009 | 0.014] | ||
Results from inverse variance weight (IVW), MR Egger, weighted mode and weighted median analyses for those phenotypes which indicated a potential effect on tanning ability from an MR analysis of 33 preselected phenotypes on self-reported natural hair colour. P-values in parentheses.
SNPs, single nucleotide polymorphisms; Est., estimation; Std, standard; LDL, low-density lipoprotein.