| Literature DB >> 35622304 |
Maria Carolina Borges1,2, Deborah A Lawlor1,2,3, Qian Yang4,5, Eleanor Sanderson1,2, Kate Tilling1,2,3.
Abstract
With the increasing size and number of genome-wide association studies, individual single nucleotide polymorphisms are increasingly found to associate with multiple traits. Many different mechanisms could result in proposed genetic IVs for an exposure of interest being associated with multiple non-exposure traits, some of which could bias MR results. We describe and illustrate, through causal diagrams, a range of scenarios that could result in proposed IVs being related to non-exposure traits in MR studies. These associations could occur due to five scenarios: (i) confounding, (ii) vertical pleiotropy, (iii) horizontal pleiotropy, (iv) reverse causation and (v) selection bias. For each of these scenarios we outline steps that could be taken to explore the underlying mechanism and mitigate any resulting bias in the MR estimation. We recommend MR studies explore possible IV-non-exposure associations across a wider range of traits than is usually the case. We highlight the pros and cons of relying on sensitivity analyses without considering particular pleiotropic paths versus systematically exploring and controlling for potential pleiotropic or other biasing paths via known traits. We apply our recommendations to an illustrative example of the effect of maternal insomnia on offspring birthweight in UK Biobank.Entities:
Keywords: Causal diagram; Confounding; Mendelian randomization; Pleiotropy; Selection bias; UK Biobank
Mesh:
Year: 2022 PMID: 35622304 PMCID: PMC9329407 DOI: 10.1007/s10654-022-00874-5
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 12.434
Fig. 1Directed acyclic graphs illustrating scenarios when an unexpected genetic instrumental variable-non-exposure trait association could be observed. Z: genetic instrumental variable; X: exposure of interest; Y: outcome of interest; U: unmeasured confounders; W: non-exposure traits; C: confounding factors, e.g. population stratification, cryptic relatedness and assortative mating; S, selection. For simplicity, we use single nodes even when there may be multiple variables, and these scenarios do not consider time-varying exposures and critical/sensitive-period exposure effects [2, 17]. Scenarios illustrated by 1.1, 1.2, 3.1–3.3, 5.1–5.4 would be expected to bias the MR estimate of X–Y effect; 1.3, 1.4, 2.1, 2.2 and 3.4 would not; 5.2 and 5.4 would be unbiased under the null
Approaches to explore the plausibility of the scenario and methods that can produce unbiased test of the causal effect under the corresponding scenario
| Scenarios | One-sample MR with individual data | Two-sample MR with summary data |
|---|---|---|
| Population stratification | Check Z—population structure (e.g. principal components, birthplace, home location or study centre) | Check how the GWAS of X and Y dealt with population structure |
| Adjust for population structure in Z–W and compare the adjusted estimates with crude estimates | Use negative control outcomesa | |
| Cryptic relatedness | Estimate genetic similarity | Check how GWAS of X and Y dealt with cryptic relatedness |
| Remove one individual from each genetically related pair from analyses | ||
| Assortative mating | Check associations of individuals’ Z (for X) with their partner’s W and Z (for W) | Rely on previous evidence |
| Adjust for parental Z for X in two-stage least squares | ||
| Linkage disequilibrium | Use ‘robust methods’ when multiple SNPs from different regions are proposed as IVs | Use ‘robust methods’ when multiple SNPs from different regions are proposed as IVs |
| Apply colocalization methods (e.g. HEIDI [ | Apply colocalization methods (e.g. HEIDI [ | |
| Vertical pleiotropy | Univariable MR for W–Y, bidirectional MR between X and W, tests for heterogeneity between multiple Zc, IV inequalities test (for categorical exposure only)d | Univariable MR for W–Y, bidirectional MR between X and W and Steiger directionality test, tests for heterogeneity between Zc, MR-Egger intercept |
| Method: Two-stage least squares (for continuous outcomes) | Method: Inverse variance weighted | |
| Horizontal pleiotropy | Univariable MR for W–Y, bidirectional MR between X and W, tests for heterogeneity between multiple Zc, IV inequalities test (for categorical exposure only)d | Univariable MR for W–Y, bidirectional MR between X and W and Steiger directionality test, tests for heterogeneity between Zc, MR-Egger intercept |
| Methods: Multivariable MR, sisVIVE, MR-GENIUS, MR-MiSTERI | Methods: Multivariable MR, MR-Egger, weighted median, weighted mode, MR-PRESSO, MR-TRYX |
GWAS genome-wide association studies, MR Mendelian randomization, W non-exposure traits, X exposure of interest, Y outcome of interest, Z genetic instrumental variables
aNegative control outcome is assumed to share the same underlying confounding structure as the outcome of interest, but not be influenced by the exposure of interest[37]
bHEIDI assumes that there is a single causal SNP within the locus, and each other SNP shows an effect due to LD with the causal SNP[22, 23]
cWe assume that multiple genetic IVs are unrelated, and an observed heterogeneity is due to different mechanisms of multiple IV-exposure associations[38]
dIV inequalities test would be limited by low sensitivity with a small number of proposed IVs but low computational burden, or by high computational burden with a large number of proposed IVs but high sensitivity[39]
Summary of select sensitivity analyses for exploring bias due to horizontal pleiotropy in Mendelian randomization (MR)
| Name | Brief description | MR assumptions | Other issues |
|---|---|---|---|
| sisVIVE [ | It is an extension to two-stage least squares, which incorporates LASSO penalization | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: at least 50% of proposed IVs are valid; (4) monotonicitya | It works for continuous outcomes only, is computationally intensive, and the current implementation do not provide 95% CIs |
| MR-GENIUS [ | It is a version of G-estimation which is robust to time-varying SNP-exposure associations, unmeasured confounding and violation of IV assumptions | (1) the alternative to relevance: proposed genetic IVs should strongly affect the variance rather than the mean of the exposure; both (2) independence and (3) exclusion restriction can be relaxed; (4) the alternative to homogeneityb: no additive interaction with unmeasured selection | Estimates on binary exposures have ambiguous units. Proposed genetic IVs often explain a small variance of the exposure |
| MR-MiSTERI [ | It is another version of G-estimation for estimating the causal effect among compliersa, which is robust to time-varying SNP-exposure associations, unmeasured confounding and violation of IV assumptions | (1) relevance, (2) independence, and (3) exclusion restriction all can be relaxed; (4) the alternative to monotonicitya: exposure-outcome effect does not vary with proposed invalid IV on additive scale; selection bias due to confounding does not vary with proposed invalid IV on multiplicative scale; residual variance for outcome is heteroscedastic and thus varies with proposed invalid IV | Its R package can only be used for continuous exposure and outcome at present |
| MR-Egger [ | It allows a non-zero intercept to test unbalanced horizontal pleiotropy | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: InSIDEc; (4) homogeneityb | It is sensible to outliers and tends to suffer from low statistical power |
| Weighted median [ | It is defined as the median of a weighted empirical density function of the Wald ratio estimates | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: at least 50% of weight comes from valid IVs; (4) homogeneityb | Nil |
| Weighted mode [ | It calculates the weighted mode of the Wald ratio estimates | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: zero modal pleiotropy assumption d; (4) homogeneitya or monotonicity b | Researchers need to choose a bandwidth to obtain the clustering effect, and different bandwidths might provide inconsistent estimates [ |
| MR-PRESSO [ | It assesses horizontal pleiotropy based on the contribution of each SNP to heterogeneity and provides adjusted MR estimates by removing outlier SNPs | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: InSIDEc and outliers (identified via MR-PRESSO global test) are due to potential horizontal pleiotropy; (4) homogeneitya or monotonicityb | After removing outlier SNPs, the standard errors would decrease. Therefore, it would be more likely to reject the null |
| MR-TRYX [ | It assesses horizontal pleiotropy based on the contribution of each SNP to heterogeneity and attempts to adjust for their horizontal pleiotropic effects using extra publicly available GWAS from MR-Base | (1) relevance; (2) independence; (3) the alternative to exclusion restriction: outliers (identified via RadialMR [ | GWAS from MR-Base may not cover the whole genome or conducted in the target population (e.g. only female participants) |
ACE average causal effect, CI confidence interval, GWAS genome-wide association studies, InSIDE instrument strength independent of direct effect, IV instrumental variable, SNPs single nucleotide polymorphisms
aMonotonicity means proposed IV cannot increase exposure level in some participants while decrease it in others. Thus, MR quantifies the magnitude of ACE among the unknown subpopulation of compliers who are monotonically affected by the proposed IV (i.e. local ACE)[2, 4]
bHomogeneity means the exposure-outcome effect is homogeneous across population and does not depend on proposed IV. Thus, MR quantifies the magnitude of ACE for the reference population (i.e. global ACE)[2, 4]
cThe strength of genetic IV – exposure association should not be correlated with the strength of the pleiotropic effects across proposed IVs[70]
dThe majority of SNPs could be invalid providing that the set of SNPs which form the largest homogeneous cluster are valid[72]
Fig. 2Associations of unweighted polygenetic risk score (PRS) for insomnia with six non-exposure traits before and after adjustment for population stratification. Supplementary Table 1 summarizes how education, frequency of alcohol intake and ever smoking are coded in this study
Fig. 3Mendelian randomization estimates for a non-exposure traits-birthweight (W–Y) effects, b non-exposure traits-insomnia (W–X) effects, and c insomnia-non-exposure traits (X–W) effects. “Usually” having insomnia is coded as 1, while “sometimes/rarely/never” having insomnia is coded as 0 (Supplementary Table 1)
Fig. 4Multivariable Mendelian randomization (MVMR) estimates for the effect of maternal insomnia on offspring birthweight. Estimates are differences in mean birthweight when comparing reporting usually experiencing insomnia to never, rarely or sometimes experiencing it with and without adjustment for potential horizontal pleiotropy via maternal age at first birth, education and ever smoking
Fig. 5Sensitivity analyses for the effect of maternal insomnia on offspring birthweight using two-sample Mendelian randomization (MR)