| Literature DB >> 24388127 |
Amy E Taylor1, Neil M Davies2, Jennifer J Ware3, Tyler VanderWeele4, George Davey Smith2, Marcus R Munafò5.
Abstract
Mendelian randomization methods, which use genetic variants as instrumental variables for exposures of interest to overcome problems of confounding and reverse causality, are becoming widespread for assessing causal relationships in epidemiological studies. The main purpose of this paper is to demonstrate how results can be biased if researchers select genetic variants on the basis of their association with the exposure in their own dataset, as often happens in candidate gene analyses. This can lead to estimates that indicate apparent "causal" relationships, despite there being no true effect of the exposure. In addition, we discuss the potential bias in estimates of magnitudes of effect from Mendelian randomization analyses when the measured exposure is a poor proxy for the true underlying exposure. We illustrate these points with specific reference to tobacco research.Entities:
Keywords: Causal inference; Instrumental variable; Mendelian randomization; Smoking; Tobacco
Mesh:
Year: 2013 PMID: 24388127 PMCID: PMC3989031 DOI: 10.1016/j.ehb.2013.12.002
Source DB: PubMed Journal: Econ Hum Biol ISSN: 1570-677X Impact factor: 2.184
Definitions of genetic terms for Mendelian randomization.
| Term | Definition |
|---|---|
| Allele | One form of a genetic variant |
| Canalization | Process of developmental compensation for the effects of a genetic variant which may disrupt normal development |
| Genetic variant | Part of the genetic code for which there is more than one form in the population. This can be a single nucleotide polymorphism but other forms of variation exist |
| Genome wide association study (GWAS) | Hypothesis-free study which investigates associations of a large number of genetic variants across the whole genome with a trait of interest |
| Linkage disequilibrium | Non-random association between genetic variants at different positions along the chromosome |
| Pleiotropic | Influencing more than one phenotypic trait |
| Single nucleotide polymorphism (SNP) | Variation at a single nucleotide base pair in the DNA sequence |
Results of data simulation showing unbiased estimate from two-stage least-squares regression in the presence of confounding.
| Model | Beta (95% CI) |
|---|---|
| 0.26 (0.25, 0.28) | |
| −0.29 (−0.37, −0.21) |
True value of beta from simulation is −0.3.
Regression statistics of 10 most strongly associated randomly generated instruments.
| Association of instrument with exposure | Association of exposure with outcome | ||||||
|---|---|---|---|---|---|---|---|
| Beta | SE | Beta | SE | ||||
| Instrument 1 | −0.078 | 0.021 | <0.001 | 0.436 | 0.222 | 0.05 | 13.66 |
| Instrument 2 | 0.072 | 0.021 | 0.001 | 0.902 | 0.255 | <0.001 | 11.48 |
| Instrument 3 | 0.070 | 0.021 | 0.001 | 0.304 | 0.260 | 0.24 | 10.89 |
| Instrument 4 | 0.064 | 0.021 | 0.002 | 0.123 | 0.305 | 0.69 | 9.31 |
| Instrument 5 | 0.065 | 0.021 | 0.002 | 0.488 | 0.265 | 0.07 | 9.58 |
| Instrument 6 | −0.063 | 0.021 | 0.003 | 0.757 | 0.280 | 0.007 | 8.79 |
| Instrument 7 | 0.062 | 0.021 | 0.003 | 1.024 | 0.314 | 0.001 | 8.55 |
| Instrument 8 | 0.059 | 0.021 | 0.005 | 0.04 | 0.352 | 0.91 | 7.92 |
| Instrument 9 | 0.060 | 0.021 | 0.005 | 0.97 | 0.319 | 0.002 | 7.96 |
| Instrument 10 | 0.058 | 0.021 | 0.006 | 0.934 | 0.320 | 0.003 | 7.53 |
From linear regression of exposure on instrument.
From 2-stage least squares regression of outcome on exposure using the instrument.
F-statistic for strength of instrument.
Fig. 1Directed acyclic graphs for Mendelian randomization analyses of (A) total lifetime exposure to tobacco smoke and (B) reported number of cigarettes smoked per day with outcome measures. Observed variables are denoted using squares, unobserved variables are denoted using circles, causal effects have arrows. Dashed lines represent non-causal associations. X: total lifetime exposure to tobacco smoke, X2: reported number of cigarettes smoked per day, a genetic variant (Z), outcome (Y), and an unobserved confounder (U).