| Literature DB >> 26568383 |
Louise A C Millard1,2, Neil M Davies1, Nic J Timpson1, Kate Tilling1, Peter A Flach1,2, George Davey Smith1.
Abstract
Observational cohort studies can provide rich datasets with a diverse range of phenotypic variables. However, hypothesis-driven epidemiological analyses by definition only test particular hypotheses chosen by researchers. Furthermore, observational analyses may not provide robust evidence of causality, as they are susceptible to confounding, reverse causation and measurement error. Using body mass index (BMI) as an exemplar, we demonstrate a novel extension to the phenome-wide association study (pheWAS) approach, using automated screening with genotypic instruments to screen for causal associations amongst any number of phenotypic outcomes. We used a sample of 8,121 children from the ALSPAC dataset, and tested the linear association of a BMI-associated allele score with 172 phenotypic outcomes (with variable sample sizes). We also performed an instrumental variable analysis to estimate the causal effect of BMI on each phenotype. We found 21 of the 172 outcomes were associated with the allele score at an unadjusted p < 0.05 threshold, and use Bonferroni corrections, permutation testing and estimates of the false discovery rate to consider the strength of results given the number of tests performed. The most strongly associated outcomes included leptin, lipid profile, and blood pressure. We also found novel evidence of effects of BMI on a global self-worth score.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26568383 PMCID: PMC4644974 DOI: 10.1038/srep16645
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Associations of BMI allele score with BMI across childhood.
| Number with measured BMI | Mean age at BMI measurement | SD increase of log BMI during childhood per 1 SD increase of BMI allele score with 32 SNP variants | SD increase of log BMI during childhood per 1 SD increase of FTO allele dosages | SD increase of log BMI during childhood per 1 SD increase of BMI allele score with 31 SNP variants (FTO removed) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SD change | 95% CI | F-statistic | SD change | 95% CI | F-statistic | SD change | 95% CI | F-statistic | ||
| 6,601 | 7.48 wks | 0.016 | −0.01, 0.04 | 1.65 | −0.018 | −0.04, 0.01 | 2.12 | 0.025 | 0.00, 0.05 | 4.22 |
| 6,282 | 40.50 wks | 0.032 | 0.01, 0.06 | 6.36 | −0.029 | −0.05, 0.00 | 5.19 | 0.047 | 0.02, 0.07 | 14.10 |
| 5,797 | 1.69 yrs | 0.036 | 0.01, 0.06 | 7.42 | −0.041 | −0.07, −0.02 | 9.66 | 0.056 | 0.03, 0.08 | 18.78 |
| 5,430 | 3.71 yrs | 0.068 | 0.04, 0.09 | 25.29 | −0.004 | −0.03, 0.02 | 0.10 | 0.075 | 0.05, 0.10 | 31.39 |
| 6,076 | 7.57 yrs | 0.142 | 0.12, 0.17 | 125.00 | 0.049 | 0.02, 0.07 | 14.88 | 0.133 | 0.11, 0.16 | 109.75 |
| 5,087 | 8.68 yrs | 0.163 | 0.14, 0.19 | 140.66 | 0.074 | 0.05, 0.10 | 28.18 | 0.146 | 0.12, 0.17 | 112.70 |
| 5,623 | 10.68 yrs | 0.175 | 0.15, 0.20 | 175.27 | 0.090 | 0.06, 0.12 | 45.96 | 0.150 | 0.12, 0.18 | 128.80 |
| 5,116 | 12.80 yrs | 0.170 | 0.14, 0.20 | 151.21 | 0.082 | 0.05, 0.11 | 35.28 | 0.150 | 0.12, 0.18 | 116.58 |
| 4,746 | 13.83 yrs | 0.167 | 0.14, 0.19 | 134.07 | 0.074 | 0.05, 0.10 | 26.78 | 0.150 | 0.12, 0.18 | 107.05 |
| 4,174 | 15.45 yrs | 0.158 | 0.13, 0.19 | 106.35 | 0.071 | 0.04, 0.10 | 21.60 | 0.142 | 0.11, 0.17 | 84.49 |
| 2,665 | 17.04 yrs | 0.176 | 0.14, 0.21 | 83.94 | 0.086 | 0.05, 0.12 | 20.14 | 0.153 | 0.12, 0.19 | 63.13 |
Abbreviations: BMI, body mass index; CI, confidence interval; SD, standard deviation; wks, weeks; yrs, years. All BMI variables are log transformed. BMI calculated as weight (kilograms) divided by height (metres squared). Based on the original data of 8,121 participants (variable sample size per BMI measurement).
aF-statistic: Measure of strength of association between exposure and outcome.
Association of BMI allele score and observational BMI at age 8 with potential confounders of BMI.
| Potential confounder | Sample size | SD change of BMI allele score for a 1 SD change of confounder | Sample size | SD change of log BMI age 8 for a 1 SD change of confounder | ||||
|---|---|---|---|---|---|---|---|---|
| SD change | 95% CI | P value | SD change | 95% CI | P value | |||
| Maternal education | ||||||||
| Less than O-level | 7,306 | Reference | 0.871 | 4,735 | Reference | 0.001 | ||
| O-level | −0.02 | −0.08, 0.04 | −0.05 | −0.13, 0.03 | ||||
| A-level | 0.03 | −0.04, 0.09 | −0.05 | −0.13, 0.03 | ||||
| Degree or above | −0.03 | −0.11, 0.04 | −0.16 | −0.25, −0.07 | ||||
| Sex | ||||||||
| Female | 8,121 | Reference | 5,087 | Reference | ||||
| Male | 0.00 | −0.05, 0.04 | 0.828 | 0.13 | 0.07, 0.18 | <0.001 | ||
| Household social class | ||||||||
| Class I (professional) | 6,929 | Reference | 0.776 | 4,554 | Reference | 0.213 | ||
| Class II | 0.01 | −0.06, 0.08 | 0.10 | 0.02, 0.18 | ||||
| Class III (non-manual) | −0.01 | −0.08, 0.07 | 0.09 | 0.00, 0.19 | ||||
| Class III (manual) | −0.01 | −0.11, 0.08 | 0.10 | −0.02, 0.21 | ||||
| Class IV/V (manual) | 0.02 | −0.11, 0.14 | 0.07 | −0.09, 0.23 | ||||
| Parity | ||||||||
| 0 | 7,408 | Reference | 0.850 | 4,741 | Reference | 0.789 | ||
| 1 | −0.03 | −0.08, 0.02 | −0.03 | −0.10, 0.03 | ||||
| 2 | 0.01 | −0.05, 0.07 | 0.01 | −0.07, 0.08 | ||||
| Mother smoking in pregnancy | ||||||||
| No | 7,484 | Reference | 4,788 | Reference | ||||
| Yes | 0.02 | −0.03, 0.08 | 0.444 | 0.18 | 0.11, 0.25 | <0.001 | ||
Abbreviations: SD, standard deviation; CI, confidence interval; BMI, body mass index.
aMaternal education: O-level (ordinary level) exams are taken in different subjects usually at age 15–16 at the completion of legally required school attendance, equivalent to today’s General Certificate of Secondary Education); A-level (Advanced-level) exams are taken in different subjects usually at age 18).
bHousehold social class: The mother recorded the occupation of both herself and her partner in a questionnaire at 32 weeks gestation, which were used to allocate them to social class groups using the 1991 Office of Population, Censuses and Surveys classification; the highest class of the mother and her partner was used in analysis.
cParity: Obtained from obstetric records. Mothers with parity of two or more were grouped into a single category.
dP values for linear trend given if more than two ordinal categories.
Ranking by association strength (P value) of the stage 1 tests with p < 0.05: Outcome associations with BMI allele score for original dataset.
| Rank | Outcome variable (original data with variable N) | SD change of inverse normal transformed outcome for a 1 SD change of BMI allele score | ||||
|---|---|---|---|---|---|---|
| Sample size | SD change | 95% CI | P value (adjusted P value) | Permutation P value | ||
| 1 | Leptin, 9 | 4,249 | 0.138 | 0.11, 0.17 | <0.001 | – |
| 2 | CRP, 9 | 4,250 | 0.083 | 0.05, 0.11 | <0.001 | – |
| 3 | Age menarche | 2,946 | −0.083 | −0.12, −0.05 | <0.001 | – |
| 4 | HDL, 9 | 4,250 | −0.067 | −0.10, −0.04 | <0.001 (0.002) | 0.008 |
| 5 | SBP, 7 | 6,013 | 0.049 | 0.02, 0.07 | <0.001 | – |
| 6 | IL6, 9 | 4,240 | 0.053 | 0.02, 0.08 | 0.001, (0.091) | 0.010 |
| 7 | Enjoyment of School Score, 4 | 5,807 | 0.041 | 0.02, 0.07 | 0.002, (0.255) | 0.022 |
| 8 | Self Esteem: Scholastic Competence, 8 | 5,222 | 0.042 | 0.02, 0.07 | 0.002, (0.323) | 0.025 |
| 9 | Apolipoprotein B, 9 | 4,250 | 0.043 | 0.01, 0.07 | 0.005, (0.788) | 0.028 |
| 10 | Triglycerides, 9 | 4,250 | 0.042 | 0.01, 0.07 | 0.006, (0.970) | 0.037 |
| 11 | VLDL, 9 | 4,250 | 0.042 | 0.01, 0.07 | 0.006, (1) | 0.048 |
| 12 | Apolipoprotein al, 9 | 4,250 | −0.038 | −0.07, −0.01 | 0.012, (1) | 0.046 |
| 13 | Insulin, 15 | 2,859 | 0.047 | 0.01, 0.08 | 0.012 | – |
| 14 | Attention/activity symptoms score, 11 | 4,541 | 0.037 | 0.01, 0.07 | 0.013, (1) | 0.055 |
| 15 | SDQ emotional symptoms score, 6 | 5,748 | −0.030 | −0.06, 0.00 | 0.022, (1) | 0.063 |
| 16 | Hygiene Score, 4 | 6,231 | 0.028 | 0.00, 0.05 | 0.024, (1) | 0.070 |
| 17 | Self Esteem: Global Self Worth Score, 8 | 5,214 | 0.031 | 0.00, 0.06 | 0.025, (1) | 0.079 |
| 18 | FVC: lung function, 8 | 5,276 | 0.030 | 0.00, 0.06 | 0.030 | − |
| 19 | Burden of compulsions/obsessions score, 7 | 5,684 | 0.028 | 0.00, 0.05 | 0.031, (1) | 0.080 |
| 20 | Particular fears score, 7 | 5,734 | −0.028 | −0.05, 0.00 | 0.033, (1) | 0.086 |
| 21 | Glucose, 15 | 2,862 | 0.041 | 0.11, 0.17 | 0.038 | – |
Abbreviations: BMI, body mass index; CI, confidence interval; SD, standard deviation; VLDL, very low density lipoprotein; IL6, interleukin 6; SBP, systolic blood pressure; HDL, high density lipoprotein; VLDL, very low density lipoprotein; SDQ, Strengths and Difficulties Questionnaires; LF, lung function; FVC, forced vital capacity. Full names of variables are given in Supplementary Table 3. All outcomes are transformed to normal distributions using a rank-based inverse normal transformation. Exposure and outcome variables are standardised. Outcome as dependent variable, BMI allele score as independent variable.
aAdjusted P values are adjusted for the 160 tests performed using the Bonferroni correction: pcorrected = poriginal·160. Adjusted P values greater than 1 are rounded to 1. Outcomes in validation set are excluded from Bonferroni correction.
bPermutation P values are generated with permutation testing. Null hypothesis: The outcome variable at rank i in this table would be ranked lower than i if no association was found with the allelic score. We exclude outcomes in validation set such that such that, for instance i = 1 for HDL because all three outcomes ranked higher than HDL are in the validation set).
cVariables that are in the ‘validation set’, that were chosen for inclusion using apriori knowledge of their association with BMI. Adjusted P value are not given for these are they are not part of the main outcome dataset.
Figure 1QQ-plot of the associations between the BMI allelic score and the 172 outcomes.
Association of log BMI age 8 with outcomes, of the stage 1 tests. Using the original dataset with variable number of individuals for each outcome. Tests performed with the Stata regress command and robust option. Top result leptin is not shown as P value too small. Corrected P = 0.00023 line: The Bonferroni corrected P = 0.05, accounting for the 160 tests (excluding validation set) performed. P expected = actual line: The expected trajectory, assuming the P values are uniformly distributed.
Ranked outcome associations with BMI allele score for original dataset, for results with P < 0.05.
| Rank | Outcome variable (original data with variable N) | Sample size | IV estimate of SD change of outcome for a 1 SD change of log BMI age 8 | SD change of outcome for a 1 SD change of log BMI (observational, age 8) | P value | ||||
|---|---|---|---|---|---|---|---|---|---|
| Test statistic | 95% CI | P value (adjusted P value | Test statistic | 95% CI | P value (adjusted P value | ||||
| Linear regression of continuous, normally distributed outcomes. Test statistic is the mean difference (SD) per 1 SD greater log BMI age 8 or percentage difference per 1 SD greater log BMI age 8 for log transformed outcomes | |||||||||
| 1 | Leptin, 9 | 3,381 | 0.922 | 0.77, 1.07 | <0.001 | 0.688 | 0.66, 0.71 | <0.001 | 0.002 |
| 2 | CRP, 9 | 3,382 | 0.432 | 0.25, 0.62 | <0.001 | 0.367 | 0.33, 0.40 | <0.001 | 0.484 |
| 3 | Age menarche | 2,186 | −0.485 | −0.70, −0.27 | <0.001 | −0.298 | −0.34, −0.26 | <0.001 | 0.080 |
| 4 | HDL, 9 | 3,382 | −0.466 | −0.66, −0.27 | <0.001 (<0.001) | −0.303 | −0.34, −0.27 | <0.001 (<0.001) | 0.096 |
| 5 | SBP, 7 | 4,641 | 0.305 | 0.13, 0.48 | 0.001 | 0.312 | 0.28, 0.34 | <0.001 | 0.940 |
| 6 | IL6, 9 | 3,372 | 0.234 | 0.04, 0.43 | 0.018 (2.852) | 0.197 | 0.17, 0.23 | <0.001 (<0.001) | 0.709 |
| 9 | Apolipoprotein B, 9 | 3,382 | 0.363 | 0.16, 0.57 | <0.001 (0.080) | 0.163 | 0.13, 0.20 | <0.001 (<0.001) | 0.048 |
| 10 | Triglycerides, 9 | 3,382 | 0.310 | 0.11, 0.51 | 0.002 (0.341) | 0.226 | 0.19, 0.26 | <0.001 (<0.001) | 0.392 |
| 11 | VLDL age 9 | 3,382 | 0.310 | 0.11, 0.51 | 0.002 (0.343) | 0.226 | 0.19, 0.26 | <0.001 (<0.001) | 0.393 |
| 12 | Apolipoprotein al, 9 | 3,382 | −0.243 | −0.44, −0.05 | 0.016 (1) | −0.196 | −0.23, −0.16 | <0.001 (<0.001) | 0.636 |
| 13 | Insulin, 15 | 2,285 | 0.299 | 0.06, 0.53 | 0.013 | 0.202 | 0.16, 0.24 | <0.001 | 0.434 |
| 16 | Hygiene Score, 4 | 4,335 | 0.209 | 0.01, 0.41 | 0.042 (1) | 0.052 | 0.02, 0.08 | 0.001 (0.086) | 0.118 |
| 18 | FVC: lung function, 8 | 4,869 | 0.162 | 0.00, 0.32 | 0.052 | 0.147 | 0.12, 0.18 | <0.001 | 0.855 |
| 21 | Glucose, 15 | 2,288 | 0.334 | 0.02, 0.65 | 0.039 | 0.006 | −0.04, 0.05 | 0.819 | 0.012 |
| Logistic regression of binary outcomes. Test statistic is the odds ratio between outcome groups | |||||||||
| 7 | Enjoyment of School Score, 4 | 5,808 | 1.344 | 0.98, 1.84 | 0.066 (1) | 1.075 | 1.01, 1.14 | 0.025 (1) | 0.700 |
| 8 | Self Esteem: Scholastic Competence, 8 | 5,223 | 1.548 | 1.11, 2.16 | 0.010 (1) | 1.004 | 0.95, 1.06 | 0.896 (1) | 0.036 |
| 14 | Attention/activity symptoms score, 11 | 4,542 | 1.399 | 0.96, 2.03 | 0.079 (1) | 1.022 | 0.95, 1.10 | 0.570 (1) | 0.166 |
| 15 | SDQ emotional symptoms score, 6 | 5,749 | 0.856 | 0.62, 1.19 | 0.357 (1) | 0.947 | 0.88, 1.01 | 0.095 (1) | 0.543 |
| 17 | Self Esteem: Global Self Worth Score, 8 | 5,215 | 1.536 | 1.10, 2.14 | 0.011 (1) | 1.002 | 0.95, 1.06 | 0.932 (1) | 0.046 |
| 19 | Burden of compulsions/obsessions score, 7 | 5,685 | 2.258 | 0.74, 6.92 | 0.154 (1) | 0.970 | 0.67, 1.27 | 0.842 (1) | 0.507 |
| 20 | Particular fears score, 7 | 5,735 | 0.893 | 0.65, 1.23 | 0.487 (1) | 0.992 | 0.93, 1.05 | 0.792 (1) | 0.519 |
Abbreviations: BMI, body mass index; CI, confidence interval; SD, standard deviation; IV, instrumental variable; VLDL, very low density lipoprotein; IL6, interleukin 6; SBP, systolic blood pressure; HDL, high density lipoprotein; SDQ, Strengths and Difficulties Questionnaires; FEF, forced expiratory flow; LF, lung function; FVC, forced vital capacity. Full names of variables are given in Supplementary Table 3. Figures 2 and 3 shows these results graphically. The ranks are based on tests of association of the BMI allele score with the outcome directly, for the 22 associations with a p < 0.05. We then perform IV analysis of these results, and it is the IV estimate that is given in this table. Exposure and outcome variables are standardized. Outcome as dependent variable, BMI allele score as independent variable.
aContinuous outcomes: Using Stata ivreg2 command (robust option) and BMI allele score as instrumental variable for log BMI age 8. First stage predicting log BMI at age 8 with the BMI allele score, and the second stage performs an unadjusted association of these log BMI age 8 predictions with the outcome. Binary outcomes: Using regress command (robust option) for first stage, and logistic command (vce(robust) option) for the second stage, to associate outcome with the predicted values of log BMI age 8 from the first stage.
bAdjusted P values: Adjusted for the 160 tests performed using the Bonferroni correction: pcorrected = poriginal·160. Adjusted P values greater than 1 are rounded to 1. Outcomes in validation set are excluded from Bonferroni correction.
cLog transformed outcomes, such that distributions approximately normal.
dCategories for binary variables given in Supplementary Table 7.
eP value of Durbin-Wu-Hausman test, comparing the effect estimates using the allelic score with the effect estimates using observational log BMI age 8. This uses a test for endogeneity (endog argument of ivreg2 Stata command).
fVariables that are in the ‘validation set’, that were chosen for inclusion using a priori knowledge of their association with BMI.
Figure 2A comparison of the observational and instrumental variable estimates for continuous outcomes.
The standard deviation change of outcome for a 1 SD increase of log BMI aged 8. IV estimate of effect using two-stage least squares regression of log BMI at age 8 as the exposure, with robust option. Observational estimates are the SD change of the outcome for a 1 SD increase in log BMI at age 8. Graphical illustration of the results in Table 4.
Figure 3A comparison of the observational and instrumental variable estimates for binary outcomes.
Odds ratio between groups of outcomes, for a 1 SD change of log BMI aged 8. IV estimate of effect using two-stage least squares regression of log BMI at age 8 as the exposure, with robust option. Observational estimates are the odds ratio between outcome groups. Graphical illustration of the results in Table 4. Categories for binary variables given in Supplementary Table 7.
Figure 4Testing invalidity of IV assumptions: associations of two instrumental variables using distinct SNP subsets, for continuous outcomes.
Comparison between the SNP subsets: (1) 31 SNPs (excluding FTO SNP) and (2) the FTO SNP only. IV estimate of effect using two-stage least squared regression of log BMI at age 8 as the exposure. Graphical illustration of the results in Table 5.
Figure 5Testing invalidity of IV assumptions: associations of two instrumental variables using distinct SNP subsets, of binary outcomes.
Odds ratio between outcome groups, for a 1 SD change of log BMI aged 8. Comparison between the SNP subsets: (1) 31 SNPs (excluding FTO SNP) and (2) the FTO SNP only. IV estimate of effect using two-stage least squared regression of log BMI at age 8 as the exposure. Graphical illustration of the results in Table 5. Categories for binary variables are given in Supplementary Table 7.
Testing for invalidity of IV assumptions: associations of two instrumental variables for log BMI at age 8; using 31 SNPs (excluding FTO SNP) and only the FTO SNP respectively.
| Outcome variable (original data with variable N) | IV estimate using 31 SNPs excluding FTO SNP | IV estimate using FTO SNP | Hansen P value | ||||
|---|---|---|---|---|---|---|---|
| Test statistic | 95% CI | P value | Test statistic | 95% CI | P value | ||
| Linear regression of continuous normally distributed outcomes. Test statistic is the mean difference (SD) per 1 SD greater log BMI age 8 or percentage difference per 1 SD greater log BMI age 8 | |||||||
| Leptin, 9 | 0.851 | 0.68, 1.02 | <0.001 | 1.193 | 0.85, 1.54 | <0.001 | 0.051 |
| CRP, 9 | 0.447 | 0.24, 0.66 | <0.001 | 0.375 | 0.01, 0.74 | 0.042 | 0.735 |
| Age menarche | −0.443 | −0.70, −0.19 | 0.001 | −0.631 | −1.05, −0.21 | 0.003 | 0.449 |
| HDL, 9 | −0.517 | −0.75, −0.29 | <0.001 | −0.271 | −0.64, 0.10 | 0.153 | 0.275 |
| SBP, 7 | 0.290 | 0.10, 0.48 | 0.002 | 0.385 | −0.05, 0.83 | 0.086 | 0.694 |
| IL6, 9 | 0.300 | 0.08, 0.52 | 0.008 | −0.027 | −0.41, 0.36 | 0.891 | 0.136 |
| Apolipoprotein B, 9 | 0.281 | 0.05, 0.51 | 0.018 | 0.678 | 0.24, 1.12 | 0.003 | 0.098 |
| Triglycerides, 9 | 0.262 | 0.04, 0.49 | 0.023 | 0.496 | 0.09, 0.90 | 0.016 | 0.307 |
| VLDL age 9 | 0.261 | 0.04, 0.49 | 0.023 | 0.497 | 0.09, 0.90 | 0.016 | 0.303 |
| Apolipoprotein al, 9 | −0.275 | −0.51, −0.04 | 0.021 | −0.122 | −0.50, 0.26 | 0.533 | 0.509 |
| Insulin, 15 | 0.214 | −0.05, 0.48 | 0.116 | 0.634 | 0.08, 1.19 | 0.026 | 0.163 |
| Hygiene Score, 4 | 0.241 | 0.02, 0.46 | 0.032 | 0.045 | −0.44, 0.53 | 0.858 | 0.477 |
| FVC: lung function, 8 | 0.226 | 0.04, 0.41 | 0.016 | −0.116 | −0.49, 0.25 | 0.540 | 0.096 |
| Glucose, 15 | 0.423 | 0.06, 0.78 | 0.021 | −0.018 | −0.56, 0.52 | 0.949 | 0.161 |
| Logistic regression of binary outcomes. Test statistic is the odds ratio between groups for a 1 SD increase in log BMI age 8 | |||||||
| Enjoyment of School Score, 4 | 1.349 | 0.95, 1.92 | 0.096 | 1.319 | 0.66, 2.64 | 0.435 | 0.897 |
| Self Esteem: Scholastic Competence, 8 | 1.665 | 1.15, 2.41 | 0.007 | 1.122 | 0.54, 2.34 | 0.759 | 0.517 |
| Attention/activity symptoms score, 11 | 1.534 | 1.01, 2.32 | 0.043 | 0.916 | 0.40, 2.07 | 0.832 | 0.882 |
| SDQ emotional symptoms score, 6 | 0.816 | 0.56, 1.18 | 0.277 | 1.072 | 0.52, 2.22 | 0.852 | 0.815 |
| Self Esteem: Global Self Worth Score, 8 | 1.723 | 1.19, 2.50 | 0.004 | 0.926 | 0.45, 1.93 | 0.837 | 0.357 |
| Burden of compulsions/obsessions score, 7 | 2.241 | 0.60, 8.44 | 0.233 | 2.255 | 0.14, 36.76 | 0.568 | 0.734 |
| Particular fears score, 7 | 0.864 | 0.61, 1.23 | 0.418 | 1.041 | 0.52, 2.10 | 0.910 | 0.901 |
Abbreviations: BMI, body mass index; CI, confidence interval; SD, standard deviation; IV, instrumental variable; VLDL, very low density lipoprotein; IL6, interleukin 6; SBP, systolic blood pressure; HDL, high density lipoprotein; SDQ, Strengths and Difficulties Questionnaires; LF, lung function; FVC, forced vital capacity. IV estimate calculated with ivregress and robust option (for robust standard errors). Categories for binary variables are given in Supplementary Table 7. Full names of variables are given in Supplementary Table 3. Figures 4 and 5 show these results graphically.
aFTO SNP is rs1558902.
bVariables that are in the ‘validation set’, that were chosen for inclusion using a priori knowledge of their association with BMI.
cP values are not adjusted for the multiple tests performed.
dLog transformed outcomes, such that distributions approximately normal.
eHansen P value comparing effect estimates using 31 SNP score and FTO.
Overidentification tests of IV using CUE.
| Outcome variable (original data with variable N) | CUE | |||
|---|---|---|---|---|
| Test statistic | 95% CI | P value | Hansen P value | |
| Linear regression of continuous normally distributed outcomes. Test statistic is the mean difference (SD) per 1 SD greater log BMI age 8 or percentage difference per 1 SD greater log BMI age 8 | ||||
| Leptin, 9 | 0.934 | 0.74, 1.13 | < 0.001 | 0.019 |
| CRP, 9 | 0.410 | 0.22, 0.60 | < 0.001 | 0.606 |
| Age menarche | −0.508 | −0.82, −0.20 | 0.001 | 0.077 |
| HDL, 9 | −0.447 | −0.65, −0.24 | < 0.001 | 0.388 |
| SBP, 7 | 0.314 | 0.12, 0.50 | 0.001 | 0.456 |
| IL6, 9 | 0.250 | 0.07, 0.43 | 0.006 | 0.885 |
| Apolipoprotein B, 9 | 0.339 | 0.04, 0.64 | 0.028 | 0.010 |
| Triglycerides, 9 | 0.245 | 0.04, 0.45 | 0.020 | 0.455 |
| VLDL age 9 | 0.245 | 0.04, 0.45 | 0.020 | 0.454 |
| Apolipoprotein al, 9 | −0.302 | −0.53, −0.07 | 0.010 | 0.005 |
| Insulin, 15 | 0.781 | 0.28, 1.28 | 0.002 | 0.013 |
| Hygiene Score, 4 | 0.299 | 0.08, 0.52 | 0.007 | 0.265 |
| FVC: lung function, 8 | 0.111 | −0.05, 0.27 | 0.180 | 0.670 |
| Glucose, 15 | 0.312 | −0.01, 0.63 | 0.059 | 0.877 |
| Linear regression of binary outcomes. Test statistic is the change in probability that outcome has value 0 for a 1 SD increase in log BMI age 8 | ||||
| Enjoyment of School Score, 4 | 0.008 | −0.24, 0.25 | 0.949 | 0.309 |
| Self Esteem: Scholastic Competence, 8 | 0.075 | −0.10, 0.25 | 0.408 | 0.134 |
| Attention/activity symptoms score, 11 | 0.174 | −0.08, 0.43 | 0.184 | 0.702 |
| SDQ emotional symptoms score, 6 | 0.066 | −0.20, 0.33 | 0.621 | 0.046 |
| Self Esteem: Global Self Worth Score, 8 | 0.087 | −0.11, 0.28 | 0.374 | 0.097 |
| Burden of compulsions/obsessions score, 7 | 0.035 | −0.06, 0.13 | 0.475 | 0.587 |
| Particular fears score, 7 | −0.071 | −0.28, 0.14 | 0.500 | 0.283 |
Abbreviations: BMI, body mass index; CI, confidence interval; SD, standard deviation; IV, instrumental variable; VLDL, very low density lipoprotein; IL6, interleukin 6; SBP, systolic blood pressure; HDL, high density lipoprotein; SDQ, Strengths and Difficulties Questionnaires; LF, lung function; FVC, forced vital capacity. Categories for binary variables given in Supplementary Table 7. Full names of variables are given in Supplementary Table 3. Tests use all 32 SNPs separately in the model.
aCUE: continuously updating estimator, with robust standard errors (robust option).
bP values are not adjusted for the multiple tests performed.
cLog transformed outcomes, such that distributions approximately normal.
Figure 6Distribution of the percentage of missing data, in our 8,121 sample, across the 172 outcomes.
Figure 7Graphs illustrating two possible causal pathways to explain associations of the BMI allele score with the outcomes.
Abbreviations: BMI, body mass index. Left graph represents the intended pathway we have investigated, where the BMI allele score is an IV for BMI and the variants affect the outcome solely through observed BMI in childhood. The right graph represents the alternative causal pathway, where the allelic score affects BMI indirectly through the outcome variable. It is possible that these two pathways both occur for a given outcome, such that the graph would become cyclical.