| Literature DB >> 31552104 |
Sheila J Barton1,2, Phillip E Melton3,4, Philip Titcombe1,2, Robert Murray2, Sebastian Rauschert5, Karen A Lillycrop2,6, Rae-Chi Huang5, Joanna D Holbrook2, Keith M Godfrey1,2,7.
Abstract
Background: Association studies of epigenome-wide DNA methylation and disease can inform biological mechanisms. DNA methylation is often measured in peripheral blood, with heterogeneous cell types with different methylation profiles. Influences such as adiposity-associated inflammation can change cell-type proportions, altering measured blood methylation levels. To determine whether associations between loci-specific methylation and outcomes result from cellular heterogeneity, many studies adjust for estimated blood cell proportions, but high correlations between methylation and cell-type proportions could violate the statistical assumption of no multicollinearity. We examined these assumptions in a population-based study.Entities:
Keywords: Illumina 450K; epigenomics; houseman cell-type adjustments; multicollinearity; reversal of direction of association; statistical assumptions
Year: 2019 PMID: 31552104 PMCID: PMC6746958 DOI: 10.3389/fgene.2019.00816
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Regression results for Loge BMI with DNA methylation % as predictor adjusted for age and sex: i) without Houseman cell-type adjustment; ii) with six Houseman cell-type adjustments; iii) with Houseman cell-type adjustments but excluding granulocytes; iv) with the first two principal components of six Houseman cell-type adjustments; v) cell-type adjustments residuals, each cell type individually; vi) cell-type adjustment residuals all cell types together. se, standard error.
| Row | CpG | CpG1 | CpG2 | CpG3 | CpG4 | CpG5 | CpG6 | CpG7 | CpG8 | CpG9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 780 | 758 | 723 | 812 | 790 | 778 | 740 | 801 | 760 | |||
| 0 | 0.004 | −0.004 | −0.004 | −0.003 | −0.004 | −0.004 | −0.004 | −0.004 | loge BMI | ||
| 0.001 | 0.0029 | 0.0019 | 0.001 | 0.001 | 0.0012 | 0.001 | 0.0012 | 0.0012 | |||
| 0.915 | 0.98 | 0.06 | 0.003 | 0.012 | 0.003 | 0.008 | 0.001 | 0.006 | (Age and sex adjusted only) | ||
| 0.005 | 0.003 | 0.001 | 0.004 | 0.006 | 0.005 | 0.007 | 0.005 | 0.006 | loge BMI | ||
| 0.0017 | 0.0014 | 0.0024 | 0.0023 | 0.0022 | 0.0022 | 0.0024 | 0.0025 | 0.0024 | |||
| 0.01 | 0.041 | 0.608 | 0.061 | 0.007 | 0.028 | 0.005 | 0.063 | 0.009 | (Age, sex, and Houseman cell-type adjustments) | ||
| 0.005 | 0.003 | 0.001 | 0.004 | 0.006 | 0.005 | 0.007 | 0.005 | 0.006 | loge BMI | ||
| 0.0017 | 0.0010 | 0.0024 | 0.0023 | 0.0022 | 0.0022 | 0.0024 | 0.0025 | 0.0025 | |||
| 0.010 | 0.041 | 0.610 | 0.064 | 0.007 | 0.031 | 0.004 | 0.062 | 0.009 | (Age, sex, and Houseman cell-type adjustments without granulocytes) | ||
| 0.005 | 0.003 | 0.001 | 0.004 | 0.006 | 0.005 | 0.007 | 0.005 | 0.006 | loge BMI | ||
| 0.0017 | 0.0014 | 0.0024 | 0.0023 | 0.0022 | 0.0022 | 0.0024 | 0.0025 | 0.0024 | |||
| 0.01 | 0.041 | 0.608 | 0.061 | 0.007 | 0.028 | 0.005 | 0.063 | 0.009 | (Age, sex, and principal components 1 and 2 of Houseman cell-type adjustments) | ||
| 0.096 | 0.062 | 0.014 | 0.043 | 0.067 | 0.049 | 0.064 | 0.048 | 0.089 | loge BMI | ||
| 0.0436 | 0.0361 | 0.0598 | 0.0536 | 0.0524 | 0.0508 | 0.0553 | 0.0584 | 0.0564 | |||
| 0.028 | 0.087 | 0.820 | 0.427 | 0.199 | 0.331 | 0.246 | 0.415 | 0.115 | (Age, sex, and Houseman cell-type residuals one at a time) | ||
| 0.121 | 0.084 | 0.045 | 0.109 | 0.147 | 0.120 | 0.173 | 0.132 | 0.172 | loge BMI | ||
| 0.0446 | 0.0368 | 0.0619 | 0.0592 | 0.0578 | 0.0568 | 0.0620 | 0.0664 | 0.0640 | |||
| 0.007 | 0.022 | 0.468 | 0.065 | 0.011 | 0.035 | 0.005 | 0.047 | 0.007 | (Age, sex, and Houseman cell-type residuals all at the same time) |
Pearson correlation between CpG4 and estimated cell proportions.
| CpG4 | CD8 T cells | CD4 T cells | NK cells | B cells | Monocytes | Granulocytes |
|---|---|---|---|---|---|---|
| Pearson correlation | .611** | .553** | .371** | .215** | –.402** | –.783** |
| Significance (two-tailed) | <2.2 × 10−16 | <2.2 × 10−16 | <2.2 × 10−16 | 8.19 × 10−13 | <2.2 × 10−16 | <2.2 × 10−16 |
| 1081 | 1081 | 1081 | 1081 | 1081 | 1081 |
Variance inflation factors (VIFs), 1/VIF, and percentage variance explained by the other independent variables, for the model including CpG4. The left-hand side of shows VIFs, 1/VIF, and percentage variance explained by the other independent variables for the model including six Houseman cell types; the right-hand side of shows results for the model including five Houseman cell types but omitting granulocytes.
| loge BMI | VIFs | 1/VIF | % Variance explained | VIFs without granulocytes | 1/VIF | % Variance explained |
|---|---|---|---|---|---|---|
| 1.20 | 0.831 | 16.87 | 1.19 | 0.84 | 16.11 | |
| 1.03 | 0.975 | 2.53 | 1.02 | 0.98 | 1.64 | |
| 3.17 | 0.316 | 68.43 | 3.13 | 0.32 | 68.06 | |
| 24.95 | 0.040 | 95.99 | 1.83 | 0.55 | 45.35 | |
| 61.21 | 0.016 | 98.37 | 2.09 | 0.48 | 52.06 | |
| 30.26 | 0.033 | 96.70 | 2.02 | 0.49 | 50.57 | |
| 14.67 | 0.068 | 93.18 | 1.33 | 0.75 | 25.08 | |
| 11.32 | 0.088 | 91.17 | 1.50 | 0.67 | 33.19 | |
| 113.71 | 0.009 | 99.12 | NA | NA | NA |
Pearson correlation between CpGs and first two principal components (PCs) of six Houseman cell-type adjustments.
| PC1 | PC2 | ||
|---|---|---|---|
| Pearson correlation | −.470** | .258** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 781 | 781 | ||
| Pearson correlation | −.329** | .297** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 759 | 759 | ||
| Pearson correlation | −.493** | .311** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 724 | 724 | ||
| Pearson correlation | −.671** | .447** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 813 | 813 | ||
| Pearson correlation | −.653** | .464** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 791 | 791 | ||
| Pearson correlation | −.700** | .456** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 778 | 778 | ||
| Pearson correlation | −.645** | .496** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 741 | 741 | ||
| Pearson correlation | −.751** | .415** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 802 | 802 | ||
| Pearson correlation | −.727** | .436** | |
| Significance (two-tailed) | < 2.22 × 10−13 | < 2.22 × 10−13 | |
| 761 | 761 |
Figure 1Scatterplot of CpG4% methylation against loge BMI at age 17 in the Raine cohort.