| Literature DB >> 32760811 |
Stephen Burgess1,2, George Davey Smith3,4, Neil M Davies3,4, Frank Dudbridge5, Dipender Gill6, M Maria Glymour7, Fernando P Hartwig3,8, Michael V Holmes9,10, Cosetta Minelli11, Caroline L Relton3,4, Evropi Theodoratou12,13.
Abstract
This paper provides guidelines for performing Mendelian randomization investigations. It is aimed at practitioners seeking to undertake analyses and write up their findings, and at journal editors and reviewers seeking to assess Mendelian randomization manuscripts. The guidelines are divided into nine sections: motivation and scope, data sources, choice of genetic variants, variant harmonization, primary analysis, supplementary and sensitivity analyses (one section on robust statistical methods and one on other approaches), data presentation, and interpretation. These guidelines will be updated based on feedback from the community and advances in the field. Updates will be made periodically as needed, and at least every 18 months. Copyright:Entities:
Keywords: Mendelian randomization; causal inference; genetic epidemiology; guidelines
Year: 2020 PMID: 32760811 PMCID: PMC7384151 DOI: 10.12688/wellcomeopenres.15555.2
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Flowchart highlighting some of the key analytic choices in performing a Mendelian randomization (MR) analysis.
Figure 2. Generic analytic pipeline for Mendelian randomization (MR).
Figure 3. Checklist of questions to consider when reviewing a Mendelian randomization investigation.
Summary of some methods proposed for Mendelian randomization: inverse-variance weighted method and robust methods.
| Method | Consistency
| Strengths and weaknesses | Reference | Software |
|---|---|---|---|---|
| Inverse-variance
| All variants valid or
| Most efficient (greatest statistical power), biased if average
|
|
|
| MR-Egger | InSIDE | Sensitive to outliers, sensitive to violations of InSIDE assumption,
|
|
|
| Weighted median | Majority valid | Robust to outliers, sensitive to addition/removal of genetic variants |
|
|
| Mode-based
| Plurality valid | Robust to outliers, sensitive to bandwidth parameter and addition/
|
|
|
| MR-PRESSO | Outlier-robust | Removes outliers, efficient with valid IVs, very high false positive
|
|
|
| MR-Robust | Outlier-robust | Downweights outliers, efficient with valid IVs, high false positive
|
|
|
| MR-Lasso | Outlier-robust | Removes outliers, efficient with valid IVs, high false positive rate
|
| |
| MR-RAPS | Balanced pleiotropy
| Downweights outliers, sensitive to violations of balanced
|
|
|
| Contamination
| Plurality valid | Robust to outliers, sensitive to variance parameter and addition/
|
|
|
| MR-Mix | Plurality valid | Robust to outliers, requires large numbers of genetic variants,
|
|
|
Each of the methods in the table can be implemented using summarized data. False positive rates refer to the simulation study by Slob and Burgess [27]. InSIDE is the Instrument Strength Independent of Direct Effect assumption.
* Implemented in MendelianRandomization package for R ( https://cran.r-project.org/web/packages/MendelianRandomization/index.html)
† Implemented in mrrobust package for Stata ( https://github.com/remlapmot/mrrobust)
‡ Implemented for R in its own software package:
- MR-PRESSO in mrpresso package ( https://github.com/rondolab/MR-PRESSO),
- MR-RAPS in mr.raps package ( https://github.com/qingyuanzhao/mr.raps),
- MR-Mix in MRMix package ( https://github.com/gqi/MRMix).
Figure 4. Directed acyclic graphs illustrating validity and invalidity of instrumental variable assumptions in different scenarios.
a) Mediator is on causal pathway from exposure to outcome. b) Mediator is on causal pathway from genetic variants to exposure. c) Genetic variants influence the exposure, which has downstream effect on a related variable which does not affect the outcome. d) Genetic variants influence a related variable, and the related variable affects the outcome and exposure of interest. e) Genetic variants influence the outcome primarily, and only influence the exposure via the outcome. We note that the related variable may be known or unknown.
In scenarios a, b, and c, as there is no alternative pathway from the genetic variants to the outcome, the instrumental variable assumptions are satisfied. In scenario d, the pathway from the genetic variants to the outcome does not pass via the exposure, and so the instrumental variable assumptions are not satisfied for the exposure (although they are satisfied for the related variable). Scenarios a, b, and c are examples of “vertical pleiotropy” that do not invalidate the instrumental variable assumptions. Scenario d reflects a situation where the causal risk factor has been incorrectly identified – it is not the exposure, but the related variable. Scenario e reflects a reverse causation situation where the genetic variant has been incorrectly identified as primarily affecting the exposure.
Figure 5. Scatter plot of genetic associations with the outcome (vertical axis) against genetic associations with the exposure (horizontal axis).
Examples illustrated are: (left) no heterogeneity in the variant-specific causal estimates (effect of LDL-cholesterol on coronary heart disease risk using 8 variants associated with LDL-cholesterol); and (right) heterogeneity in the variant-specific causal estimates (effect of C-reactive protein on coronary heart disease risk using 17 genome-wide significant predictors of C-reactive protein). As indicated by differences in estimates, not all genetic variants are valid instrumental variables for C-reactive protein, and so a causal interpretation is not appropriate. Taken from Burgess et al., 2018 [68].