| Literature DB >> 36248277 |
April E Hartley1, Grace M Power1, Eleanor Sanderson1, George Davey Smith1.
Abstract
Mendelian randomization (MR) is an increasingly popular component of an epidemiologist's toolkit, used to provide evidence of a causal effect of one trait (an exposure, eg, body mass index [BMI]) on an outcome trait or disease (eg, osteoarthritis). Identifying these effects is important for understanding disease etiology and potentially identifying targets for therapeutic intervention. MR uses genetic variants as instrumental variables for the exposure, which should not be influenced by the outcome or confounding variables, overcoming key limitations of traditional epidemiological analyses. For MR to generate a valid estimate of effect, key assumptions must be met. In recent years, there has been a rapid rise in MR methods that aim to test, or are robust to violations of, these assumptions. In this review, we provide an overview of MR for a non-expert audience, including an explanation of these key assumptions and how they are often tested, to aid a better reading and understanding of the MR literature. We highlight some of these new methods and how they can be useful for specific methodological challenges in the musculoskeletal field, including for conditions or traits that share underlying biological pathways, such as bone and joint disease.Entities:
Keywords: CAUSAL INFERENCE; EPIDEMIOLOGY; GENETIC EPIDEMIOLOGY; MENDELIAN RANDOMIZATION; STATISTICAL METHODS
Year: 2022 PMID: 36248277 PMCID: PMC9549705 DOI: 10.1002/jbm4.10675
Source DB: PubMed Journal: JBMR Plus ISSN: 2473-4039
Fig. 1The three core assumptions of Mendelian randomization. Red dashed arrows represent pathways that should not exist if the instrument is valid. Blue arrows represent conditions that need to be true for an instrument to be considered valid. The black dotted arrow represents the fact that the confounder does not need to influence the exposure to violate instrumental variable 2 (IV2): the assumption is no confounding of the genetic instrument and the outcome.
Fig. 2Directed acyclic graph highlighting vertical versus horizontal pleiotropy. (A) Vertical pleiotropy is where the instrument is only associated with the outcome (fracture) via the exposure (type 2 diabetes). This is a key requirement of Mendelian randomization (MR) and does not lead to biased effect estimates (except in the case of misspecifying the primary phenotype outlined below). (B) Horizontal pleiotropy is where the instrument is associated with the outcome via a pathway independent of the exposure. This will lead to bias in the effect estimate for type 2 diabetes on fracture. (C) Where the instrument is in close proximity to a single‐nucleotide polymorphism (SNP) strongly associated with the outcome, the genotypes at these two loci can be correlated due to linkage disequilibrium (LD). This will cause the same bias as horizontal pleiotropy in an MR study. It is important to note that although SNPs are often selected based on the lowest p value in a region, they are not necessarily causal for the exposure but are in LD with a disease‐causing variant. (D) Misspecification of the primary phenotype can cause bias in an MR analysis. This occurs when genetic variants associated with a phenotype upstream of the exposure are identified in the genomewide association study (GWAS) of the exposure due to a large sample size and thus well‐powered GWAS. For example, in an MR of type 2 diabetes on fracture, a positive effect of type 2 diabetes may be observed due to the inclusion of body mass index (BMI) SNPs as instruments for type 2 diabetes.
Summary of the Two Approaches to Mendelian Randomization
| Individual‐level data MR | Summary‐level data MR | |
|---|---|---|
|
Requirements |
Measured exposure, outcome in the same population. Genotype dosage for all instruments in the same population (or a polygenic risk score) | SNP‐exposure association results and SNP‐outcome association results from separate populations, including: Effect allele Other allele Effect allele frequency Beta for the per allele effect on the exposure or outcome (unit increase or log odds) Standard error for the beta |
| Possible analysis methods |
To test for causal effect: linear/logistic regression of SNP genotype or polygenic risk score on the outcome To quantify causal effect: Single SNP: Wald ratio estimate βoutcome/βexposure Single/multiple SNPs/polygenic risk score: Two‐stage least‐squares regression |
To test for causal effect: determine SNP‐outcome effect using summary statistics from a published GWAS To quantify causal effect: Single SNP: Wald ratio estimate Multiple SNPs: an inverse‐variance weighted meta‐analysis of the Wald ratio estimate for each SNP |
| Testing the relevance assumption | First‐stage | Mean |
| Testing the independence assumption | Associations between the instrument(s) and potential confounders can be directly tested for all known/measured confounders | N/A |
| Testing the exclusion‐restriction assumption | Sargan test for heterogeneity in individual SNP results |
Cochran's MR‐Egger intercept as a measure of the average effect of the SNP on the outcome when there is no effect of the SNP on the exposure |
| Pleiotropy‐robust methods |
MVMR MR‐GENIUS controls for some directional pleiotropy(
sisVIVE and adaptive LASSO for outlier removal(
|
Several methods, including MR‐Egger, weighted median, weighted mode, MR‐CAUSE, MR‐PRESSO, MVMR, reviewed in Sanderson et al.(9) Can be broadly categorized as outlier adjustment, outlier removal, or estimate adjustment methods |
| Benefits |
More flexibility in models (eg, can test for non‐linear effects) and covariates Ability to perform subgroup analyses (eg, sex‐stratified) |
Larger sample sizes increase power Greater range of sensitivity analyses to determine pleiotropy‐robust estimates of causal effect |
| Limitations |
Sample limited to those with measured exposure, outcome, and genotype, often restricting sample size Fewer methods to interrogate pleiotropy Weak instrument bias is toward the observational (confounded) estimate, potentially resulting in type 1 error |
Assumes the two study populations are drawn from the same underlying population in terms of ethnicity, sex distribution, etc. Weak instrument bias toward the null, resulting in type 2 error Overlap in individuals between samples can result in bias toward the observational estimate in the presence of weak instruments (type 1 error) Unable to control which covariates are adjusted for Unable to perform subgroup analyses unless summary statistics are available for specific subgroups for both exposure and outcome |
Fig. 3Results commonly presented in Mendelian randomization (MR) studies. (A) A scatterplot is often presented comparing results of the different summary‐level meta‐analyses. Here we are looking for consistency in the slope (ie, causal effect estimate) between the different approaches, which we do observe. The MR‐Egger slope is slightly weaker due to the intercept not being fixed at 0, although the intercept is close to 0 and the slope is similar in direction and magnitude to the other methods, meaning we can be more confident in the inverse‐variance weighted (IVW) causal effect estimate. (B) A single‐nucleotide polymorphism (SNP) analysis can be performed to detect heterogeneity, with each point on the forestplot representing the Wald ratio for an individual SNP. Here we do observe heterogeneity in effect estimates, which means we should be cautious of horizontal pleiotropy, but the majority of SNPs do provide causal effect estimates in the positive direction. (C) The funnel, or volcano, plot shows the Wald ratio for each SNP on the x axis and the weight on the y axis (ie, the inverse of the standard error). Asymmetry in this plot would indicate that an SNP has a large effect on the outcome, relative to its precision, indicating pleiotropy. As this plot is relatively symmetrical, we can be more confident in the IVW estimate. (D) A leave‐one‐out analysis can identify outlier SNPs with a large effect on the overall IVW estimate. If the effect estimate (indicated by the points, with horizontal bars indicating 95% confidence intervals) changes in magnitude and/or direction when an individual SNP is excluded, this is evidence that the SNP is an invalid instrument. In this case, there is no evidence to suggest any of the SNPs are outliers. (E) A radial plot of the SNP weight versus the beta times its weight can show outlier SNPs (in green), plotted alongside the IVW estimate. All plots have been generated using the TwoSampleMR( ) and RadialMR( ) packages and publicly available summary statistics( , ) available through the IEU OpenGWAS project.( )
Fig. 4Directed acyclic graph illustrating the concept of multivariable Mendelian randomization. Including additional instruments for body mass index (BMI) in the analysis accounts for the horizontal pleiotropy acting via BMI and gives an estimate of the direct effect of bone mineral density (BMD) on hip osteoarthitis (OA) and BMI on hip OA. However, if there is another pleiotropic pathway via an unknown or unmeasured variable (Exposure3), the effect estimates will still be biased. GRS = genetic risk score.
Fig. 5Directed acyclic graphs highlighting the difference between uncorrelated and correlated pleiotropy. *Weighted median and weighted mode can give an unbiased estimate in the presence of either form of pleiotropy, as long as at least 50% SNPs are non‐pleiotropic for the weighted median and the most common Wald ratio comes from the non‐pleiotropic SNPs for the weighted mode. Dashed arrows represent pathways that do not need to be present. BMI = body mass index; BMD = bone mineral density; GWS = genomewide significant; OA = osteoarthritis; SNP = single‐nucleotide polymorphism; MR = Mendelian randomization; LCV = latent causal variable; LHC = latent heritable confounder.
Fig. 6Illustration of the potential issue of collider bias in a study of disease progression/prognosis. The black box around fracture indicates that this variable has been conditioned on by restricting the prognosis (fracture‐related mortality) analysis to individuals who have experienced a fracture. In this hypothetical example, by conditioning on having a fracture, an inverse association between experiencing a fall and level of physical activity may be induced, with those experiencing a fall measuring a higher level of physical activity than fracture cases who have not experienced a fall. If physical inactivity also causes fracture‐related mortality, experiencing a fall may appear protective against mortality. This will also be the case for any instrument associated with experiencing falls. The magnitude and direction of bias will depend on the interaction between the causes of fracture on probability of experiencing a fracture.( )