| Literature DB >> 25064373 |
George Davey Smith1, Gibran Hemani2.
Abstract
Observational epidemiological studies are prone to confounding, reverse causation and various biases and have generated findings that have proved to be unreliable indicators of the causal effects of modifiable exposures on disease outcomes. Mendelian randomization (MR) is a method that utilizes genetic variants that are robustly associated with such modifiable exposures to generate more reliable evidence regarding which interventions should produce health benefits. The approach is being widely applied, and various ways to strengthen inference given the known potential limitations of MR are now available. Developments of MR, including two-sample MR, bidirectional MR, network MR, two-step MR, factorial MR and multiphenotype MR, are outlined in this review. The integration of genetic information into population-based epidemiological studies presents translational opportunities, which capitalize on the investment in genomic discovery research.Entities:
Mesh:
Year: 2014 PMID: 25064373 PMCID: PMC4170722 DOI: 10.1093/hmg/ddu328
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1.Schematic representation of MR. (A) Mendelian randomization can be used to test the hypothesis that trait A causes trait B, provided that conditions (1), (2) and (3) are met adequately, governing that ZA is a valid instrument, in that (1) it is associated with the intermediate phenotype of interest; (2) has no association with the outcome except through the intermediate phenotype, and (3) is not related to measured or unmeasured confounding factors. (B). In bi-directional MR, the causal direction between traits (A and B) (if any) can be elucidated, if valid instruments are present for each trait.
Examples of MR
| Type | Exposure/trait | Disease/outcome | Conclusion |
|---|---|---|---|
| Biomarkers | CRP | Coronary heart disease | Observational association between CRP and coronary heart disease is a result of confounding and/or reverse causation ( |
| Serum iron | Parkinson's disease | Higher serum iron levels lower the risk of Parkinson's disease ( | |
| Uric acid | Coronary heart disease | Observational association between uric acid and coronary heart disease is, in part, due to confounding by BMI ( | |
| Macrophage migration inhibitory factor (MIF) | Type 2 diabetes | Elevated MIF, amongst other factors, increases the risk of type 2 diabetes ( | |
| Interleukin 6 (IL6) | Coronary heart disease | IL6 increases the risk of coronary heart disease ( | |
| Behaviours | Smoking | Anxiety/depression | Anxiety and depression amongst smokers does not appear to be a consequence of smoking ( |
| Alcohol consumption | Blood pressure | Alcohol use increases blood pressure ( | |
| Physiological measures | BMI | Symptomatic gallstone disease | Higher BMI increases the risk of symptomatic gall stone disease ( |
| Maternal influences (corrected for genetic correlation between mother and child) | Alcohol consumption | Childhood school performance | The observational finding that moderate maternal alcohol intake is associated with more favourable school performance is due to confounding, and the casual association is in the opposite direction ( |
| Maternal BMI | Fat mass of offspring | Fat mass in children aged 9–11 is not strongly influenced by BMI of mothers during pregnancy ( |
Limitations of MR
| Limitation | Role in MR studies | Approaches to evaluating or avoiding the limitation |
|---|---|---|
| Low statistical power | MR studies are often of low power and effect estimates are imprecise because of this | Increase sample size and or combine genetic variants so they explain more of the variance of the intermediate phenotype |
| Reverse causation | A genetic variant may be causing the disease outcome which in turn causes the biomarker, or the causal direction could be in the opposite direction. 2SLS will not distinguish between these cases | Bi-directional MR can be used to distinguish between the two causal models |
| Population stratification | Spurious associations used as instruments can lead to faulty causal inference | Restrict analyses to ethnically homogeneous groups, and apply correction methods using ancestrally informative markers or principal components from genome-wide data. Perform analysis within a family study context, e.g. between siblings. |
| Reintroduced confounding though pleiotropy | A genetic variant may directly influence more than one post-transcriptional process. Known to be the case for some genetic variants | When possible utilize |
| LD induced confounding | LD is crucial in genetic association studies as it allows marker SNPs to proxy for un-genotyped causal SNPs. However, this can reintroduce confounding if LD leads to the association of SNPs related to more than one post-transcriptional process. This case will be similar to the pleiotropy situation | Studies can be carried out in populations with different LD structures. Approaches to avoiding distortion by pleiotropy will also counter problems owing to LD |
| Canalization/developmental compensation | During development, compensatory processes may be generated that counter the phenotypic perturbation consequent on the genetic variant utilized as an instrument | No general approach developed, although context-specific biological knowledge can be applied. The period of the life course when influence of genetic variation on intermediate phenotypes emerge can indicate whether canalization could, in principle, be an issue |
| Lack of genetic variants to proxy for modifiable exposure of interest | No reliable genetic variant associations for many intermediate phenotypes of interest, although an increasing number of these now identified | Continued genome-wide and sequencing-based studies |
| Complexity of associations | Without adequate biological knowledge, misleading inferences regarding intermediate phenotypes and disease may be drawn | Increased biological understanding of genotype–phenotype links |
Figure 2.Effect of lower LDL-C on risk of CHD [taken from Ference et al. (2012) (52)]. Boxes represent the proportional risk reduction (1-OR) of CHD for each exposure allele plotted against the absolute magnitude of lower LDL-C associated with that allele (measured in mg/dl). SNPs are plotted in order of increasing absolute magnitude of associations with lower LDL-C. The line (forced to pass through the origin) represents the increase in proportional risk reduction of CHD per unit lower long-term exposure to LDL-C.