| Literature DB >> 35639294 |
Claudia Coscia1,2, Dipender Gill3,4,5,6, Raquel Benítez1, Teresa Pérez2, Núria Malats1, Stephen Burgess7.
Abstract
Mendelian randomization (MR) uses genetic variants as instrumental variables to investigate the causal effect of a risk factor on an outcome. A collider is a variable influenced by two or more other variables. Naive calculation of MR estimates in strata of the population defined by a collider, such as a variable affected by the risk factor, can result in collider bias. We propose an approach that allows MR estimation in strata of the population while avoiding collider bias. This approach constructs a new variable, the residual collider, as the residual from regression of the collider on the genetic instrument, and then calculates causal estimates in strata defined by quantiles of the residual collider. Estimates stratified on the residual collider will typically have an equivalent interpretation to estimates stratified on the collider, but they are not subject to collider bias. We apply the approach in several simulation scenarios considering different characteristics of the collider variable and strengths of the instrument. We then apply the proposed approach to investigate the causal effect of smoking on bladder cancer in strata of the population defined by bodyweight. The new approach generated unbiased estimates in all the simulation settings. In the applied example, we observed a trend in the stratum-specific MR estimates at different bodyweight levels that suggested stronger effects of smoking on bladder cancer among individuals with lower bodyweight. The proposed approach can be used to perform MR studying heterogeneity among subgroups of the population while avoiding collider bias.Entities:
Keywords: Bladder cancer; Bodyweight; Collider bias; Mendelian randomization; Smoking; Stratification
Mesh:
Year: 2022 PMID: 35639294 PMCID: PMC9329404 DOI: 10.1007/s10654-022-00879-0
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 12.434
Fig. 1Directed Acyclic Graphs (DAGs) illustrating relationships between the variables. (A) Mendelian Randomization causal diagram with the instrumental variable assumptions. The dashed lines between G and Y and between G and U, represent violations of the IV2 and IV3 assumptions respectively. (B) DAG considering a collider variable C, being a common child of genetic instrument G and confounders U. When conditioning on C (indicated by the square box on C), G and U become correlated (dashed line between G and U) and a violation of the IV3 assumption occurs. (C) DAG considering a collider variable C, being a common child of risk factor X and outcome Y. (D) DAG illustrating the variables and parameters used for the simulation study. Dashed line from Y to C correspond to simulation scenarios B1 to B3
Fig. 2Augmented Directed Acyclic Graph (DAG) to demonstrate the validity of the IV assumptions conditional on the residual collider. Augmented DAG where G: genetic instrument, X: risk factor, X0: residual risk factor, Y: outcome, U: measured and unmeasured confounders, C: collider, C0: residual collider. Using the rules of d-separation, the instrumental variable G is independent of the outcome Y conditional on the exposure X, confounders U, and residual collider C0 (assumption IV2), and is independent of the confounders U conditional on C0 (assumption IV3), and so is still a valid instrument conditional on C0
Median of estimates and empirical Type I error rates for Scenario A1 (null causal effect 0) with positive, negative, and mixed confounding, and 0.1
| µ1 | µ2 | Positive confounding (α2 and β2 = 0.8) | Negative confounding (α2 and β2 = -0.8) | Mixed confounding (α2 = 0.8 and β2 = -0.8) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Median estimate | Type I error rate (%) | Median estimate | Type I error rate (%) | Median estimate | Type I error rate (%) | Median estimate | Type I error rate (%) | Median estimate | Type I error rate (%) | Median estimate | Type I error rate (%) | ||
| No adjust for collider | Adjust Y/G for collider | No adjust for collider | Adjust Y/G for collider | No adjust for collider | Adjust Y/G for collider | ||||||||
| −1 | −1 | 0.01 | 7% | −0.27 | 70% | 0.01 | 5% | 0.09 | 10% | 0.00 | 5% | 0.28 | 69% |
| −0.5 | 0.00 | 6% | −0.28 | 66% | −0.01 | 3% | −0.12 | 15% | 0.00 | 6% | 0.29 | 69% | |
| 0 | 0.00 | 5% | −0.24 | 50% | −0.01 | 4% | −0.26 | 53% | 0.01 | 5% | 0.25 | 54% | |
| 0.5 | 0.01 | 6% | −0.10 | 15% | 0.01 | 6% | −0.28 | 69% | 0.00 | 6% | 0.12 | 15% | |
| 1 | 0.00 | 5% | 0.08 | 8% | 0.01 | 3% | −0.27 | 68% | 0.00 | 4% | −0.08 | 11% | |
| −0.5 | −1 | 0.01 | 6% | −0.16 | 30% | 0.00 | 6% | 0.15 | 21% | 0.00 | 5% | 0.17 | 34% |
| −0.5 | 0.00 | 6% | −0.18 | 32% | 0.00 | 5% | 0.04 | 7% | 0.01 | 4% | 0.18 | 30% | |
| 0 | 0.00 | 5% | −0.11 | 16% | 0.00 | 3% | −0.12 | 14% | 0.00 | 3% | 0.11 | 14% | |
| 0.5 | −0.01 | 5% | 0.02 | 5% | 0.00 | 6% | −0.17 | 30% | 0.00 | 4% | −0.03 | 7% | |
| 1 | 0.01 | 5% | 0.15 | 25% | 0.00 | 6% | −0.18 | 34% | 0.01 | 4% | −0.15 | 21% | |
| 0 | −1 | −0.01 | 7% | 0.00 | 6% | 0.00 | 6% | 0.00 | 6% | 0.00 | 6% | 0.00 | 6% |
| −0.5 | 0.00 | 7% | 0.00 | 6% | 0.00 | 6% | 0.00 | 6% | 0.00 | 5% | −0.01 | 5% | |
| 0 | 0.00 | 6% | 0.01 | 6% | 0.01 | 6% | 0.01 | 6% | 0.00 | 6% | 0.00 | 6% | |
| 0.5 | −0.01 | 6% | 0.00 | 6% | 0.00 | 5% | 0.00 | 6% | 0.00 | 7% | 0.00 | 7% | |
| 1 | 0.00 | 5% | 0.01 | 6% | 0.00 | 4% | 0.00 | 4% | −0.01 | 6% | −0.01 | 5% | |
| 0.5 | −1 | 0.01 | 4% | 0.16 | 22% | 0.01 | 6% | −0.17 | 35% | 0.00 | 4% | −0.15 | 23% |
| −0.5 | −0.01 | 5% | 0.02 | 5% | −0.01 | 7% | −0.19 | 36% | 0.01 | 6% | −0.03 | 7% | |
| 0 | 0.00 | 5% | −0.11 | 14% | 0.00 | 4% | −0.11 | 15% | −0.01 | 4% | 0.10 | 15% | |
| 0.5 | 0.01 | 4% | −0.17 | 28% | 0.00 | 5% | 0.03 | 7% | 0.00 | 5% | 0.18 | 34% | |
| 1 | 0.01 | 6% | −0.17 | 31% | 0.01 | 5% | 0.15 | 23% | 0.00 | 5% | 0.17 | 33% | |
| 1 | −1 | 0.01 | 5% | 0.08 | 8% | 0.01 | 5% | −0.27 | 70% | 0.01 | 3% | −0.07 | 9% |
| −0.5 | 0.01 | 4% | −0.10 | 13% | 0.01 | 5% | −0.27 | 64% | −0.01 | 5% | 0.11 | 18% | |
| 0 | 0.01 | 6% | −0.24 | 52% | 0.00 | 5% | −0.24 | 50% | −0.01 | 3% | 0.24 | 48% | |
| 0.5 | 0.01 | 5% | −0.27 | 66% | 0.01 | 4% | −0.11 | 15% | 0.00 | 4% | 0.28 | 66% | |
| 1 | 0.01 | 4% | −0.26 | 68% | 0.00 | 4% | 0.08 | 10% | 0.02 | 5% | 0.29 | 75% | |
Empirical Type I error rate represents the proportion of simulated datasets where the null hypothesis is not rejected.
Median of causal estimates in different quartiles, and proportion of datasets in which the homogeneity test was rejected for Scenario A2 (fixed causal effect of 0.5) with positive confounding and 0.1
| µ1 | µ2 | Positive confounding (α2 and β2 = 0.8) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Stratifying on collider, C | Stratifying on residual collider, C0 | ||||||||||
| Proportion homogeneity rejected (%) | Median estimates Q1 | Median estimates Q2 | Median estimates Q3 | Median estimates Q4 | Proportion homogeneity rejected (%) | Median estimates Q1 | Median estimates Q2 | Median estimates Q3 | Median estimates Q4 | ||
| −1 | −1 | 8% | 0.11 | −0.01 | 0.01 | 0.10 | 8% | 0.49 | 0.48 | 0.49 | 0.50 |
| −0.5 | 6% | 0.09 | −0.05 | −0.05 | 0.09 | 5% | 0.53 | 0.48 | 0.51 | 0.53 | |
| 0 | 7% | 0.07 | 0.01 | −0.04 | 0.07 | 6% | 0.50 | 0.52 | 0.50 | 0.49 | |
| 0.5 | 7% | 0.19 | 0.10 | 0.09 | 0.17 | 6% | 0.49 | 0.50 | 0.49 | 0.48 | |
| 1 | 4% | 0.41 | 0.37 | 0.40 | 0.40 | 4% | 0.50 | 0.48 | 0.52 | 0.49 | |
| −0.5 | −1 | 7% | 0.30 | 0.21 | 0.23 | 0.26 | 5% | 0.53 | 0.53 | 0.52 | 0.48 |
| −0.5 | 5% | 0.24 | 0.16 | 0.18 | 0.25 | 5% | 0.48 | 0.47 | 0.51 | 0.48 | |
| 0 | 4% | 0.29 | 0.23 | 0.21 | 0.29 | 4% | 0.50 | 0.47 | 0.45 | 0.49 | |
| 0.5 | 6% | 0.47 | 0.42 | 0.47 | 0.46 | 6% | 0.50 | 0.50 | 0.50 | 0.50 | |
| 1 | 3% | 0.59 | 0.65 | 0.64 | 0.60 | 5% | 0.48 | 0.50 | 0.51 | 0.50 | |
| 0 | −1 | 6% | 0.50 | 0.50 | 0.48 | 0.48 | 6% | 0.51 | 0.50 | 0.48 | 0.47 |
| −0.5 | 4% | 0.48 | 0.48 | 0.52 | 0.50 | 4% | 0.47 | 0.47 | 0.52 | 0.50 | |
| 0 | 4% | 0.49 | 0.48 | 0.51 | 0.52 | 4% | 0.49 | 0.49 | 0.51 | 0.51 | |
| 0.5 | 5% | 0.52 | 0.49 | 0.49 | 0.53 | 6% | 0.54 | 0.49 | 0.48 | 0.53 | |
| 1 | 4% | 0.48 | 0.48 | 0.51 | 0.49 | 4% | 0.48 | 0.47 | 0.52 | 0.50 | |
| 0.5 | −1 | 4% | 0.62 | 0.63 | 0.66 | 0.63 | 4% | 0.51 | 0.49 | 0.52 | 0.52 |
| −0.5 | 5% | 0.45 | 0.44 | 0.44 | 0.47 | 4% | 0.49 | 0.50 | 0.49 | 0.51 | |
| 0 | 4% | 0.32 | 0.23 | 0.23 | 0.29 | 4% | 0.51 | 0.51 | 0.47 | 0.49 | |
| 0.5 | 5% | 0.25 | 0.18 | 0.19 | 0.25 | 3% | 0.51 | 0.49 | 0.52 | 0.50 | |
| 1 | 5% | 0.26 | 0.23 | 0.20 | 0.28 | 5% | 0.46 | 0.51 | 0.49 | 0.50 | |
| 1 | −1 | 5% | 0.41 | 0.37 | 0.34 | 0.40 | 5% | 0.49 | 0.50 | 0.46 | 0.49 |
| −0.5 | 5% | 0.16 | 0.11 | 0.09 | 0.21 | 4% | 0.49 | 0.49 | 0.50 | 0.51 | |
| 0 | 6% | 0.12 | −0.03 | 0.00 | 0.11 | 6% | 0.50 | 0.50 | 0.53 | 0.51 | |
| 0.5 | 6% | 0.04 | −0.03 | −0.02 | 0.07 | 4% | 0.47 | 0.51 | 0.51 | 0.50 | |
| 1 | 6% | 0.14 | 0.00 | 0.03 | 0.10 | 6% | 0.54 | 0.49 | 0.50 | 0.49 | |
Proportion homogeneity rejected represents the proportion of simulated datasets where the null hypothesis of homogeneity is rejected
Median of causal estimates in different quartiles, and proportion of datasets in which the homogeneity test was rejected for Scenario A3 (varying causal effect) with positive confounding and 0.1
| Positive confounding (α2 and β2 = 0.8) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Stratifying on collider, C | Stratifying on residual collider, C0 | ||||||||||
| µ1 | µ2 | Proportion homogeneity rejected (%) | Median estimates Q1 | Median estimates Q2 | Median estimates Q3 | Median estimates Q4 | Proportion homogeneity rejected (%) | Median estimates Q1 | Median estimates Q2 | Median estimates Q3 | Median estimates Q4 |
| −1 | −1 | 48 | −0.39 | −0.07 | 0.14 | 0.58 | 95 | −0.46 | 0.19 | 0.64 | 1.27 |
| −0.5 | 30 | −0.33 | −0.06 | 0.00 | 0.44 | 88 | −0.36 | 0.24 | 0.59 | 1.15 | |
| 0 | 19 | −0.25 | −0.08 | 0.00 | 0.38 | 78 | −0.25 | 0.22 | 0.55 | 1.09 | |
| 0.5 | 15 | −0.11 | 0.05 | 0.14 | 0.46 | 61 | −0.19 | 0.24 | 0.56 | 0.99 | |
| 1 | 16 | 0.09 | 0.32 | 0.44 | 0.63 | 40 | −0.09 | 0.28 | 0.54 | 0.88 | |
| −0.5 | −1 | 36 | −0.11 | 0.15 | 0.32 | 0.71 | 68 | −0.07 | 0.34 | 0.64 | 1.08 |
| −0.5 | 19 | −0.07 | 0.14 | 0.30 | 0.58 | 48 | −0.01 | 0.38 | 0.64 | 0.94 | |
| 0 | 14 | 0.08 | 0.23 | 0.36 | 0.57 | 25 | 0.11 | 0.41 | 0.59 | 0.87 | |
| 0.5 | 11 | 0.22 | 0.46 | 0.55 | 0.76 | 16 | 0.16 | 0.43 | 0.56 | 0.83 | |
| 1 | 16 | 0.35 | 0.58 | 0.77 | 0.98 | 16 | 0.19 | 0.40 | 0.57 | 0.81 | |
| 0 | −1 | 24 | 0.24 | 0.49 | 0.66 | 0.95 | 24 | 0.24 | 0.51 | 0.70 | 0.94 |
| −0.5 | 14 | 0.33 | 0.52 | 0.67 | 0.90 | 15 | 0.33 | 0.50 | 0.66 | 0.89 | |
| 0 | 13 | 0.34 | 0.52 | 0.66 | 0.86 | 13 | 0.35 | 0.53 | 0.66 | 0.87 | |
| 0.5 | 13 | 0.33 | 0.51 | 0.69 | 0.88 | 13 | 0.33 | 0.52 | 0.69 | 0.89 | |
| 1 | 25 | 0.24 | 0.51 | 0.69 | 0.98 | 26 | 0.25 | 0.52 | 0.70 | 0.99 | |
| 0.5 | −1 | 18 | 0.45 | 0.69 | 0.87 | 1.07 | 18 | 0.38 | 0.59 | 0.77 | 1.01 |
| −0.5 | 15 | 0.34 | 0.50 | 0.64 | 0.88 | 18 | 0.37 | 0.61 | 0.79 | 1.03 | |
| 0 | 14 | 0.17 | 0.30 | 0.41 | 0.68 | 26 | 0.30 | 0.61 | 0.77 | 1.07 | |
| 0.5 | 19 | 0.05 | 0.22 | 0.37 | 0.72 | 45 | 0.20 | 0.57 | 0.84 | 1.17 | |
| 1 | 34 | 0.00 | 0.25 | 0.41 | 0.81 | 63 | 0.13 | 0.56 | 0.83 | 1.27 | |
| 1 | −1 | 16 | 0.25 | 0.46 | 0.55 | 0.88 | 40 | 0.26 | 0.68 | 0.90 | 1.33 |
| −0.5 | 12 | 0.03 | 0.16 | 0.30 | 0.58 | 53 | 0.19 | 0.65 | 0.97 | 1.37 | |
| 0 | 18 | −0.10 | −0.02 | 0.11 | 0.49 | 70 | 0.13 | 0.63 | 0.97 | 1.46 | |
| 0.5 | 23 | −0.16 | −0.02 | 0.11 | 0.60 | 83 | 0.04 | 0.59 | 0.98 | 1.54 | |
| 1 | 35 | −0.23 | 0.01 | 0.20 | 0.71 | 94 | −0.07 | 0.55 | 1.04 | 1.65 | |
Proportion homogeneity rejected represents the proportion of simulated datasets where the null hypothesis of homogeneity is rejected
Applied example using UK Biobank to investigate the effect of smoking status on bladder cancer risk in different bodyweight strata
| Bodyweight Q1 | Bodyweight Q2 | Bodyweight Q3 | Bodyweight Q4 | Heterogeneity test p-value | Trend test | |
|---|---|---|---|---|---|---|
| Stratifying on bodyweight | 1.59 [1.08; 2.33] | 1.58 [1.16; 2.14] | 1.13 [0.87; 1.45] | 1.11 [0.88; 1.41] | 0.151 | 0.051 |
| Stratifying on residual bodyweight | 1.61 [1.09; 2.37] | 1.73 [1.28; 2.34] | 1.25 [0.97; 1.62] | 1.10 [0.87; 1.39] | 0.084 | 0.019 |
Bodyweight Q1, Q2, Q3, Q4, represent the four quartiles for both collider and residual collider in which the causal effect of smoking on bladder cancer risk is estimated
Odds ratios (OR) and 95% confidence intervals (95% CI) for bladder cancer are represent estimates per one unit increase in the log odds of being a current smoker