| Literature DB >> 33527565 |
Matthew J Tudball1,2, Jack Bowden1,2,3, Rachael A Hughes1,2, Amanda Ly1,2, Marcus R Munafò1,2,4, Kate Tilling1,2, Qingyuan Zhao5, George Davey Smith1,2.
Abstract
A key assumption in Mendelian randomisation is that the relationship between the genetic instruments and the outcome is fully mediated by the exposure, known as the exclusion restriction assumption. However, in epidemiological studies, the exposure is often a coarsened approximation to some latent continuous trait. For example, latent liability to schizophrenia can be thought of as underlying the binary diagnosis measure. Genetically driven variation in the outcome can exist within categories of the exposure measurement, thus violating this assumption. We propose a framework to clarify this violation, deriving a simple expression for the resulting bias and showing that it may inflate or deflate effect estimates but will not reverse their sign. We then characterise a set of assumptions and a straight-forward method for estimating the effect of SD increases in the latent exposure. Our method relies on a sensitivity parameter which can be interpreted as the genetic variance of the latent exposure. We show that this method can be applied in both the one-sample and two-sample settings. We conclude by demonstrating our method in an applied example and reanalysing two papers which are likely to suffer from this type of bias, allowing meaningful interpretation of their effect sizes.Entities:
Keywords: Mendelian randomisation analysis; biomarkers; latent variable modelling; sensitivity analysis
Mesh:
Year: 2021 PMID: 33527565 PMCID: PMC8603937 DOI: 10.1002/gepi.22376
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.344
Figure 1Effect of schizophrenia liability on risk of ever using cannabis for several choices of sensitivity parameter . 95% confidence intervals are estimated as in section C of the appendix
Figure 2Effect of childhood body mass index on risk of several diseases for several choices of sensitivity parameter . 95% confidence intervals are estimated as in section C of the appendix
Figure 3In the Falconer framework, liability to a disease is assumed to follow a smooth (often normal) distribution. The disease occurs at the tail of the distribution, with the grey region representing expected prevalence in the population
Figure 4The framework proposed in Section 2 is summarised in a directed acyclic graph. Dotted circles represent latent variables and complete circles represent observed variables
Figure 5Comparison of estimated effect with “true” effect for various BMI thresholds. N = 70,261, , and 95% confidence intervals are generated over 1000 bootstrap resamples. “True” corresponds to the sample estimate using BMI as the exposure; “naive” corresponds to using the binary measure as the exposure ; and “latent” corresponds to the latent variable estimator of Section 3.2. BMI, body mass index
Ratio of estimated to true with link function misspecification
| Value of the skewness parameter | ||||||
|---|---|---|---|---|---|---|
| Choice of link function | 0 | 1 | 2 | 3 | 4 | 5 |
| Logistic | 1.01 | 1.02 | 1.03 | 1.05 | 1.06 | 1.07 |
| [1.01, 1.02] | [1.01, 1.03] | [1.03, 1.04] | [1.04, 1.06] | [1.05, 1.07] | [1.06, 1.07] | |
| Probit | 1.01 | 1.02 | 1.03 | 1.05 | 1.06 | 1.07 |
| [1.00, 1.02] | [1.01, 1.03] | [1.02, 1.04] | [1.04, 1.06] | [1.05, 1.07] | [1.06, 1.08] | |
| Semiparametric* | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 |
| [0.99, 1.00] | [0.99, 1.01] | [0.99, 1.01] | [1.00, 1.01] | [1.00, 1.01] | [1.00, 1.02] | |
*Klein and Spady estimator; mean over 1000 draws; N = 2500; ; 95% Monte Carlo confidence.
Ratio of estimated to true with threshold dependence
| Choice of link function | Value of the threshold dependence parameter | ||||
|---|---|---|---|---|---|
| 0 | 0.1 | 0.25 | 0.5 | 1 | |
| Logistic | 1.01 | 1.08 | 1.17 | 1.34 | 1.71 |
| [1.01, 1.02] | [1.07, 1.08] | [1.16, 1.18] | [1.33, 1.36] | [1.69, 1.72] | |
| Probit | 1.01 | 1.07 | 1.17 | 1.33 | 1.69 |
| [1, 1.02] | [1.06, 1.08] | [1.16, 1.18] | [1.32, 1.34] | [1.68, 1.70] | |
| Semiparametric* | 1.00 | 1.05 | 1.15 | 1.30 | 1.67 |
| [0.99, 1] | [1.05, 1.06] | [1.14, 1.16] | [1.29, 1.31] | [1.66, 1.69] | |
*Klein & Spady estimator; mean over 1000 draws; N = 2500; ; 95% Monte Carlo confidence intervals.