| Literature DB >> 35274198 |
Kaitlin H Wade1,2, James Yarmolinsky3,4, Richard M Martin3,4, Caroline L Relton3,4,5, Edward Giovannucci6, Sarah J Lewis3,4,5, Iona Y Millwood7, Marcus R Munafò4,5,8, Fleur Meddens9,10, Kimberley Burrows3,4, Joshua A Bell3,4, Neil M Davies3,4,11, Daniela Mariosa12, Noora Kanerva13, Emma E Vincent3,4,14, Karl Smith-Byrne12, Florence Guida12, Marc J Gunter12, Eleanor Sanderson3,4, Frank Dudbridge15, Stephen Burgess16,17, Marilyn C Cornelis18, Tom G Richardson3,4, Maria Carolina Borges3,4, Jack Bowden3,4,19, Gibran Hemani3,4, Yoonsu Cho3,4, Wes Spiller3,4, Rebecca C Richmond3,4, Alice R Carter3,4, Ryan Langdon3,4, Deborah A Lawlor3,4,5, Robin G Walters7, Karani Santhanakrishnan Vimaleswaran20, Annie Anderson21, Meda R Sandu3,4,22, Kate Tilling3,4,5, George Davey Smith3,4,5.
Abstract
Dietary factors are assumed to play an important role in cancer risk, apparent in consensus recommendations for cancer prevention that promote nutritional changes. However, the evidence in this field has been generated predominantly through observational studies, which may result in biased effect estimates because of confounding, exposure misclassification, and reverse causality. With major geographical differences and rapid changes in cancer incidence over time, it is crucial to establish which of the observational associations reflect causality and to identify novel risk factors as these may be modified to prevent the onset of cancer and reduce its progression. Mendelian randomization (MR) uses the special properties of germline genetic variation to strengthen causal inference regarding potentially modifiable exposures and disease risk. MR can be implemented through instrumental variable (IV) analysis and, when robustly performed, is generally less prone to confounding, reverse causation and measurement error than conventional observational methods and has different sources of bias (discussed in detail below). It is increasingly used to facilitate causal inference in epidemiology and provides an opportunity to explore the effects of nutritional exposures on cancer incidence and progression in a cost-effective and timely manner. Here, we introduce the concept of MR and discuss its current application in understanding the impact of nutritional factors (e.g., any measure of diet and nutritional intake, circulating biomarkers, patterns, preference or behaviour) on cancer aetiology and, thus, opportunities for MR to contribute to the development of nutritional recommendations and policies for cancer prevention. We provide applied examples of MR studies examining the role of nutritional factors in cancer to illustrate how this method can be used to help prioritise or deprioritise the evaluation of specific nutritional factors as intervention targets in randomised controlled trials. We describe possible biases when using MR, and methodological developments aimed at investigating and potentially overcoming these biases when present. Lastly, we consider the use of MR in identifying causally relevant nutritional risk factors for various cancers in different regions across the world, given notable geographical differences in some cancers. We also discuss how MR results could be translated into further research and policy. We conclude that findings from MR studies, which corroborate those from other well-conducted studies with different and orthogonal biases, are poised to substantially improve our understanding of nutritional influences on cancer. For such corroboration, there is a requirement for an interdisciplinary and collaborative approach to investigate risk factors for cancer incidence and progression.Entities:
Keywords: Cancer; Causality; Mendelian randomization; Nutrition
Mesh:
Year: 2022 PMID: 35274198 PMCID: PMC9010389 DOI: 10.1007/s10552-022-01562-1
Source DB: PubMed Journal: Cancer Causes Control ISSN: 0957-5243 Impact factor: 2.532
Fig. 1Framework and assumptions of Mendelian randomization (MR) analyses. In addition to a gene-environment equivalence assumption, MR relies on the following three core assumptions of formal instrumental variable analysis (in addition to those described as the “homogeneity”, “monotonicity” and “no effect modification” assumptions): (1) the “relevance” assumption—the genetic variant(s) being used as an instrument (Z) is robustly associated with the exposure (X); (2) the “independence” or “exchangeability” assumption—there are no common causes of the genetic variant(s) and outcome (e.g., population substructure, assortative mating and dynastic effects); and (3) the “exclusion restriction” assumption—there is no independent pathway between the genetic variant(s) and outcome (Y) other than through the exposure (X)—also known as horizontal pleiotropy
Overview of sensitivity analyses available to examine evidence of violations of Mendelian randomization assumptions
| Method | Purpose | What it does | Assumptions | Strengths | Limitations |
|---|---|---|---|---|---|
| MR-Egger regression and intercept test | Examines invalidation of the third MR assumption (i.e., horizontal pleiotropy). Specifically, this method tests for the presence of directional pleiotropy (MR-Egger intercept test) and the robustness of findings to directional pleiotropy (MR-Egger regression) | Performs a weighted generalized linear regression of the SNP-outcome effect estimates on the SNP-exposure effect estimates with an unconstrainted intercept term. If the InSIDE and NOME assumptions are met, the intercept term can provide a formal statistical test for directional pleiotropy and the slope generated from MR-Egger regression can provide an effect estimate that is adjusted for directional pleiotropy | InSIDE, NOME | Permits unbiased causal effects to be estimated even when all variants are invalid IVs | Sensitive to outliers; requires the InSIDE assumption to hold; low statistical power in the presence of no invalid instruments |
| Weighted median [ | Examines invalidation and robustness of findings of the third MR assumption (i.e., horizontal pleiotropy) | Individual SNP effect estimates are ordered and weighted by the inverse of their variance. Providing at least 50% of the instruments are valid, the weighted median of this distribution is taken as an unbiased estimate of the causal effect | The median estimate (weighted by precision of SNPs) is unaffected by horizontal pleiotropy | Greater statistical power than MR-Egger; does not require the InSIDE assumption | Requires at least 50% of the information from variants to come from valid IVs |
| Weighted mode [ | Examines invalidation and robustness of findings of the third MR assumption (i.e., horizontal pleiotropy) | Individual SNP effect estimates are ordered and weighted by the inverse of their variance. Providing the ZEMPA assumption is satisfied, the weighted mode generates a causal estimate using the mode of a smoothed empirical density function of the distribution of weighted SNP effect estimates | ZEMPA | Can generate unbiased causal estimates even when many SNPs in an instrument are invalid | Lower statistical power to detect causal effects than weighted median, under the condition of no invalid instruments; sensitive to bandwidth parameter |
| MR-CAUSE [ | Examines invalidation and robustness of findings of the third MR assumption (i.e., horizontal pleiotropy) | Compares the expected log pointwise posterior density (i.e., estimate of how well the posterior distribution of a model is expected to predict a new set of data) under three models: a “sharing model” (i.e., permitting horizontal pleiotropy but no causal effect between traits), a “causal model” (i.e., permitting horizontal pleiotropy and assuming a causal effect), and a “null model” (i.e., neither a causal nor shared factor) | Assumes a single unobserved shared factor between two traits of interest | Can account for both correlated and non-correlated horizontal pleiotropy. Greater statistical power than MR-Egger and weighted mode when there is a true causal effect and no correlated horizontal pleiotropy | Inferior control of false positive rate in the presence of no causal effect and 0 to 50% of variants acting through a shared factor, as compared to MR-Egger and the weighted mode. Has somewhat lower statistical power than the weighted median approach when there is a true causal effect and no correlated horizontal pleiotropy |
| Multivariable MR [ | Examines invalidation and robustness of findings of the third MR assumption (i.e., horizontal pleiotropy) | Performs a weighted generalized linear regression with adjustment for measured horizontal pleiotropy between instruments and outcomes | Requires that there are at least as many genetic instruments available as there are exposures | Can adjust estimates for the presence of measured horizontal pleiotropy | There can still be horizontal pleiotropy through variants having effects on unmeasured outcomes that are independent to the exposure of interest |
Outlier detection tests (e.g., MR-PRESSO [ | Remove or down-weight genetic variants that are outliers in an MR analysis | Differing methods | Perform better when a large proportion of variants are not horizontally pleiotropic | Explicitly remove or down-weight contributions of outliers that may be indicative of IV assumption violations; can improve statistical efficiency of models | Residual directional pleiotropy can remain after removing or down-weighting outlying SNPs; methods are underpowered when few SNPs are available; interpretation of an “outlying” variant may be ambiguous when there are few SNPs available in a multi-SNP instrument |
| Colocalization | Examines whether an association of a SNP with two or more traits represents both traits sharing a single causal variant or distinct causal variants in linkage disequilibrium | Differing methods | Some methods assume at most a single causal variant within the region for two traits examined | Can rule out findings being driven by two traits having distinct causal variants in high linkage disequilibrium | Can be underpowered for disease outcomes as compared to molecular traits |
| Steiger filtering/reverse direction MR [ | Examines whether the association of a variant with two traits (e.g., A and B) represents a proximal effect of the variant on trait A which then influences trait B or vice versa. Reverse direction MR attempts to understand direction of effect between two traits | Steiger filtering compares the proportion of the variance explained in the exposure and outcome by SNPs used as instruments to help establish directionality between associations | No horizontal pleiotropy | Can help to elucidate the direction of association between two traits | Steiger filtering is sensitive to differences in measurement error and sample size across traits examined |
InSIDE INstrument Strength Independent of Direct Effect, NOME NO Measurement Error, ZEMPA ZEro Modal Pleiotropy Assumption
Fig. 2Multivariable MR for correlated nutritional factors. Multivariable MR uses multiple genetic instruments (Z1, …, Zn) associated with multiple, potentially correlated exposures (e.g., X1, X2, and X3) to jointly estimate the independent causal effect of each of the exposures on a particular outcome (Y). It can also be used to explore mediation following two-step MR analyses to provide a better understanding of the direct, indirect and total effects of each exposure [64, 115]
Fig. 3Directed acyclic graph illustrating selection bias in a Mendelian randomization analysis of cancer prognosis. In this example, estimating the causal effect of body mass index on colorectal cancer survival, the sample is restricted to colorectal cancer cases. Conditioning analyses on colorectal cancer incidence (i.e., case status, a collider in this scenario) could generate a spurious association between two causes of colorectal cancer incidence (i.e., body mass index and cigarette smoking). This then induces an association between body mass index and colorectal cancer survival (via cigarette smoking) even in the absence of a true causal relationship between these two traits in the target population
World Cancer Research Fund (WCRF) cancer prevention recommendations
| • |
| • |
| • |
| • |
| • |
| • |
| • |
| • |
| • |
| • |
Examples of genetic variants associated with nutritional and nutritionrelated factors
| • |
| • |
| • |
| • Macronutrients (e.g., the 21 loci published by the Social Sciences Genetic Association Consortium [SSGAC] [ |
| • Iron, ferritin, transferrin and transferrin saturation |
| • Alpha- and beta-carotene, retinol |
| • Calcium |
| • Blood-based metabolites (e.g., those published by Kettunen et al. [ |
| • |
| • |
| • e.g., Body mass index (e.g., the 941 published by the Genetic Investigation of ANthropometric Traits [GIANT] consortium) [ |
| • The human gut microbiome (e.g., those published by Hughes et al. [ |