| Literature DB >> 21216802 |
Tom M Palmer1, Debbie A Lawlor, Roger M Harbord, Nuala A Sheehan, Jon H Tobias, Nicholas J Timpson, George Davey Smith, Jonathan A C Sterne.
Abstract
Mendelian randomisation analyses use genetic variants as instrumental variables (IVs) to estimate causal effects of modifiable risk factors on disease outcomes. Genetic variants typically explain a small proportion of the variability in risk factors; hence Mendelian randomisation analyses can require large sample sizes. However, an increasing number of genetic variants have been found to be robustly associated with disease-related outcomes in genome-wide association studies. Use of multiple instruments can improve the precision of IV estimates, and also permit examination of underlying IV assumptions. We discuss the use of multiple genetic variants in Mendelian randomisation analyses with continuous outcome variables where all relationships are assumed to be linear. We describe possible violations of IV assumptions, and how multiple instrument analyses can be used to identify them. We present an example using four adiposity-associated genetic variants as IVs for the causal effect of fat mass on bone density, using data on 5509 children enrolled in the ALSPAC birth cohort study. We also use simulation studies to examine the effect of different sets of IVs on precision and bias. When each instrument independently explains variability in the risk factor, use of multiple instruments increases the precision of IV estimates. However, inclusion of weak instruments could increase finite sample bias. Missing data on multiple genetic variants can diminish the available sample size, compared with single instrument analyses. In simulations with additive genotype-risk factor effects, IV estimates using a weighted allele score had similar properties to estimates using multiple instruments. Under the correct conditions, multiple instrument analyses are a promising approach for Mendelian randomisation studies. Further research is required into multiple imputation methods to address missing data issues in IV estimation.Entities:
Mesh:
Year: 2011 PMID: 21216802 PMCID: PMC3917707 DOI: 10.1177/0962280210394459
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.DAG for a Mendelian randomisation analysis using four genetic variants as instrumental variables for the effect of fat mass on bone mineral density.
Study participant characteristics, total eligible children N = 5509
| Mean (SD), geometric mean (95% CI) or | HWE | ||
|---|---|---|---|
| Gender: | 5509 (100%) | 2713 (49.3%) | |
| Age: Mean (SD) years | 5509 (100%) | 9.88 (0.32) | |
| BMD: geometric mean (95% CI) g/cm[ | 5509 (100%) | 0.902 (0.900, 0.903) | |
| Fat mass: geometric mean (95% CI) g | 5509 (100%) | 7209 (7100, 7320) | |
| Height: mean (SD) cm | 5509 (100%) | 139.6 (6.3) | |
| 5091 (92%) | TT = 0: 868 (37%) | 0.51 | |
| TA = 1: 2413 (47%) | |||
| AA = 2: 810 (16%) | |||
| 5412 (98%) | TT = 0: 3115 (58%) | 0.04 | |
| TC = 1: 2017 (37%) | |||
| CC = 2: 280 (5%) | |||
| 5323 (97%) | CC = 0: 3705 (70%) | 0.57 | |
| CT = 1: 1465 (28%) | |||
| TT = 2: 153 (3%) | |||
| 5303 (96%) | AA = 0: 1731 (33%) | 0.84 | |
| AG = 1: 2604 (49%) | |||
| GG = 2: 968 (18%) |
HWE: Hardy–Weinberg Equilibrium.
Associations of genotypes with potential confounding factors
| Number of risk alleles | |||||
|---|---|---|---|---|---|
| Genetic variant Covariate (unit) ( | 0 | 1 | 2 | ||
| Continuous confounding factors | Mean (95% CI) | Mean (95% CI) | Mean (95% CI) | Regression coefficient | |
| Height (cm) (5091) | 139.5 (139.2, 139.7) | 139.6 (139.3, 139.8) | 139.8 (139.4, 140.3) | 0.18 (−0.07, 0.42), | |
| Lean mass (g) (2515) | 24 426 (24 218, 24 634) | 24 620 (24 439, 24 800) | 24 593 (24 287, 24 899) | 104 (−74, 283), | |
| Height (cm) (5412) | 139.7 (139.4, 139.9) | 139.5 (139.2, 139.8) | 140.1 (139.4, 140.9) | 0.01 (−0.28, 0.29), | |
| Lean mass (g) (2685) | 24 548 (24 387, 24 708) | 24 636 (24 438, 24 834) | 24 910 (24 362, 25 458) | 128 (−78, 334), | |
| Height (cm) (5323) | 139.7 (139.5, 139.9) | 139.5 (139.1, 139.8) | 139.3 (138.3, 140.3) | −0.24 (−0.56, 0.08), | |
| Lean mass (g) (2640) | 24 770 (24 622, 24 917) | 24 286 (24 053, 24 519) | 24 017 (23 293, 24 740) | −447 (−679, −215), | |
| Height (cm) (5303) | 139.5 (139.3, 139.8) | 139.6 (139.4, 139.9) | 139.7 (139.3, 140.1) | 0.10 (−0.14, 0.34), | |
| Lean mass (g) (2625) | 24 596 (24 382, 24 810) | 24 655 (24 479, 24 832) | 24 525 (24 234, 24 816) | −21 (−198, 155), | |
| Categorical confounding factors | Odds ratio | ||||
| MEA (2421) | 139/857 (16%) | 189/1161 (16%) | 69/403 (17%) | 1.03 (0.88, 1.20), | |
| HHSC (2329) | Chi-squared | ||||
| MEA (2591) | 255/1492 (17%) | 155/971 (16%) | 25/128 (20%) | 0.99 (0.83, 1.18), | |
| HHSC (2485) | Chi-squared | ||||
| MEA (2543) | 314/1765 (18%) | 107/705 (15%) | 4/73 (5%) | 0.74 (0.60, 0.92), | |
| HHSC (2438) | Chi-squared | ||||
| MEA (2532) | 151/838 (18%) | 203/1236 (16%) | 69/458 (13%) | 0.90 (0.77, 1.04), | |
| HHSC (2432) | Chi-squared | ||||
MEA: Mother’s highest educational achievement is a binary variable derived from the groups 0 = CSE, O-level, Vocational and 1 = A-level and degree.
HHSC: Head of household social class coded as categorical variable I, II, III non-manual, III manual, IV and V.
Assuming an additive genetic model.
OLS and IV estimates of the effect of fat mass on bone mineral density (BMD) based on complete case analysis, N = 4796[a]
| Method | First stage regression coefficient (95% CI) | First stage | First stage | Ratio of geometric mean BMD[ | SE of estimate (log scale) | Hausman test | Sargan test P-value |
|---|---|---|---|---|---|---|---|
| OLS | NA | NA | NA | 1.22 (1.19, 1.26), | 0.014 | NA | NA |
| IV: SNP(s) used as IV | |||||||
| | 0.11 (0.08, 0.15) | 0.0082 | 39.83 | 1.44 (1.05, 1.97), | 0.16 | 0.300 | NA |
| | 0.09 (0.05, 0.13) | 0.0037 | 17.85 | 2.33 (1.34, 4.05), | 0.28 | 0.006 | NA |
| | −0.06 (−0.11, −0.02) | 0.0016 | 7.47 | 2.27 (0.98, 5.28), | 0.43 | 0.089 | NA |
| | 0.05 (0.01, 0.09) | 0.0016 | 7.57 | 0.98 (0.47, 2.03), | 0.37 | 0.540 | NA |
| | NA | 0.0119 | 29.92 | 1.67 (1.27, 2.19), | 0.14 | 0.020 | 0.11 |
| | NA | 0.0136 | 21.95 | 1.73 (1.34, 2.24), | 0.13 | 0.010 | 0.22 |
| | NA | 0.0153 | 18.59 | 1.63 (1.28, 2.06), | 0.12 | 0.013 | 0.16 |
| Unweighted allele score (4 SNPs) | 0.06 (0.04, 0.08) | 0.0069 | 33.15 | 1.40 (0.99, 1.98), | 0.18 | 0.430 | NA |
| Weighted allele score (4 SNPs) | 0.19 (0.15, 0.24) | 0.0153 | 74.35 | 1.63 (1.29, 2.07), | 0.12 | 0.012 | NA |
Analyses adjusted for height and height squared.
For a 1 unit increase in z-score of age and gender standardised fat mass.
Simulation 1 (non-weak instruments): results (Monte Carlo standard error reported in brackets beside each estimate)
| Model | Average bias | MSE | Average SE | Coverage | Average | Average F | Average absolute TSLS/OLS bias ratio |
|---|---|---|---|---|---|---|---|
| 1. OLS | 0.8194 (0.00005) | 0.6714 (0.00009) | 0.0054 (7 E–7) | 0 | NA | NA | NA |
| 2. TSLS | −0.0019 (0.0004) | 0.0016 (0.00002) | 0.03991 (0.00003) | 0.9523 (0.0021) | 0.1163 (0.0001) | 581.41 (0.504) | 0.0022 (0.0005) |
| 3. TSLS | −0.00004 (0.0003) | 0.0010 (0.00002) | 0.03215 (0.00002) | 0.9467 (0.0022) | 0.1898 (0.0001) | 474.09 (0.333) | 0.0001 (0.0004) |
| 4. TSLS | 0.00084 (0.0003) | 0.0009 (0.00001) | 0.0301 (0.00002) | 0.9487 (0.0022) | 0.2212 (0.0001) | 368.41 (0.243) | 0.0012 (0.0004) |
| 5. TSLS allele score | −0.00098 (0.0003) | 0.0010 (0.00002) | 0.0316 (0.00002) | 0.9486 (0.0022) | 0.1981 (0.0001) | 990.22 (0.685) | 0.0010 (0.0004) |
| 6. TSLS weighted allele score | 0.00084 (0.0003) | 0.0009 (0.00001) | 0.0301 (0.00002) | 0.9492 (0.0022) | 0.2212 (0.0001) | 1105.43 (0.730) | 0.0012 (0.0004) |
MSE: mean squared error, SE: standard error, TSLS: two-stage least squares, OLS: ordinary least squares.
Figure 2.Simulation 1 (non-weak instruments): power curves.
Simulation 2 (non-weak and weak instruments): results (Monte Carlo standard error in brackets beside each estimate)
| Model | Average bias | MSE | Average SE | Coverage | Average | Average F | Av. absolute TSLS/OLS bias ratio |
|---|---|---|---|---|---|---|---|
| 1. OLS | 0.990 (0.00001) | 0.980 (0.00003) | 0.0014 (1.9 E-7) | 0 (0) | NA | NA | NA |
| 2. TSLS | −0.047 (0.0025) | 0.067 (0.003) | 0.237 (0.0015) | 0.93 (0.0025) | 0.005 (0.00002) | 24.92 (0.099) | 0.047 (0.003) |
| 3. TSLS | 0.001 (0.0017) | 0.028 (0.0006) | 0.164 (0.0006) | 0.92 (0.0027) | 0.008 (0.00003) | 20.99 (0.065) | 0.001 (0.002) |
| 4. TSLS | 0.040 (0.0013) | 0.020 (0.0003) | 0.137 (0.0004) | 0.89 (0.0031) | 0.011 (0.00003) | 13.50 (0.036) | 0.041 (0.001) |
| 5. TSLS allele score | −0.026 (0.0018) | 0.032 (0.0007) | 0.172 (0.0006) | 0.94 (0.0024) | 0.008 (0.00003) | 40.99 (0.128) | 0.027 (0.002) |
| 6. TSLS weighted allele score | 0.001 (0.0017) | 0.028 (0.0006) | 0.164 (0.0006) | 0.92 (0.0027) | 0.008 (0.00003) | 41.99 (0.129) | 0.001 (0.002) |
| 7. TSLS allele score | −0.024 (0.0016) | 0.027 (0.0006) | 0.160 (0.0005) | 0.94 (0.0024) | 0.009 (0.00003) | 45.91 (0.136) | 0.024 (0.002) |
| 8. TSLS weighted allele score | 0.040 (0.0013) | 0.020 (0.0003) | 0.137 (0.0004) | 0.89 (0.0031) | 0.011 (0.00003) | 54.01 (0.145) | 0.041 (0.001) |
MSE: mean squared error, SE: standard error, TSLS: two-stage least squares, OLS: ordinary least squares.
Figure 3.Simulation 2 (non-weak and weak instruments): power curves.
IV estimates of the effect of fat mass on bone mineral density (BMD) using all available data[a]
| SNPs used as instrumental variable | First stage regression coefficient (95% CI) | First stage | First stage | Ratio of geometric mean BMD[ | SE of estimate (log scale) | Hausman test | Sargan test | |
|---|---|---|---|---|---|---|---|---|
| OLS | 5509 | NA | NA | NA | 1.22 (1.18, 1.25), | 0.014 | NA | NA |
| IV: SNP(s) used as IV | ||||||||
| | 5091 | 0.12 (0.08, 0.15) | 0.0088 | 45.35 | 1.41 (1.05, 1.89), | 0.15 | 0.320 | NA |
| | 5412 | 0.09 (0.05, 0.13) | 0.0037 | 19.95 | 2.42 (1.42, 4.12), | 0.27 | 0.002 | NA |
| | 5323 | −0.06 (−0.11, −0.02) | 0.0013 | 6.99 | 2.17 (0.92, 5.12), | 0.44 | 0.130 | NA |
| | 5303 | 0.05 (0.01, 0.08) | 0.0013 | 6.90 | 0.92 (0.42, 2.01), | 0.40 | 0.463 | NA |
| | 5007 | NA | 0.0125 | 31.61 | 1.60 (1.24, 2.07), | 0.13 | 0.029 | 0.221 |
| | 4881 | NA | 0.0138 | 22.75 | 1.69 (1.32, 2.17), | 0.13 | 0.006 | 0.227 |
Analyses adjusted for height and height squared.
For a 1 unit increase in z-score of age and gender standardised fat mass.