| Literature DB >> 26199674 |
B M Quraishi1, H Zhang1, T M Everson2, M Ray1, G A Lockett3, J W Holloway4, S R Tetali1, S H Arshad5, A Kaushal1, F I Rezwan6, W Karmaus1.
Abstract
BACKGROUND: The prevalence of eczema is increasing in industrialized nations. Limited evidence has shown the association of DNA methylation (DNA-M) with eczema. We explored this association at the epigenome-scale to better understand the role of DNA-M. Data from the first generation (F1) of the Isle of Wight (IoW) birth cohort participants and the second generation (F2) were examined in our study. Epigenome-scale DNA methylation of F1 at age 18 years and F2 in cord blood was measured using the Illumina Infinium HumanMethylation450 Beadchip. A total of 307,357 cytosine-phosphate-guanine sites (CpGs) in the F1 generation were screened via recursive random forest (RF) for their potential association with eczema at age 18. Functional enrichment and pathway analysis of resulting genes were carried out using DAVID gene functional classification tool. Log-linear models were performed in F1 to corroborate the identified CpGs. Findings in F1 were further replicated in F2.Entities:
Keywords: Allergic disease; CpG; DNA methylation; Eczema; Epigenetics; Epigenome-scale; F1 and F2 generations; Random forest
Year: 2015 PMID: 26199674 PMCID: PMC4508804 DOI: 10.1186/s13148-015-0108-y
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Eczema status in male and female cohort participants in the F1 and F2 generations (chi-square tests)
| F1 generation | ||||
|---|---|---|---|---|
| Independent variables | Females | Males | Chi-square | |
| ( | ( |
| ||
| Eczema status | Yes | 37 (15.2 %) | 9 (7.3 %) | 0.051 |
| No | 207 (84.8 %) | 113 (92.6 %) | ||
| F2 generation | ||||
| Independent variables | Boys | Girls | Chi-square | |
| ( | ( |
| ||
| Age 3 months | Yes | 9 (15.0 %) | 2 (3.6 %) | 0.048 |
| Eczema status | No | 44 (73.3 %) | 53 (94.6 %) | |
| Missing | 7 (11.7 %) | 1 (1.8 %) | ||
| Age 6 months | Yes | 13 (21.7 %) | 6 (10.7 %) | 0.162 |
| Eczema status | No | 39 (65.0 %) | 43 (76.8 %) | |
| Missing | 8 (13.3 %) | 7 (12.5 %) | ||
| Age 12 months | Yes | 9 (15.0 %) | 5 (8.9 %) | 0.521 |
| Eczema status | No | 37 (61.7 %) | 36 (64.3 %) | |
| Missing | 14 (23.3 %) | 15 (26.8 %) | ||
The performance of recursive RF at each iteration
| Iteration | Number of CpGs | (OOB-ER) Overall misclassification | Eczema misclassification | Non-eczema misclassification |
|---|---|---|---|---|
| 1 | 307,357 | 18.6 % | 95.7 % | 7.5 % |
| 2 | 153,678 | 15.3 % | 82.6 % | 5.6 % |
| 3 | 76,838 | 18.6 % | 87.0 % | 8.8 % |
| 4 | 38,419 | 16.1 % | 65.2 % | 9.1 % |
| 5 | 19,208 | 17.8 % | 80.4 % | 8.8 % |
| 6 | 9604 | 14.2 % | 78.3 % | 5.0 % |
| 7 | 4802 | 12.3 % | 58.7 % | 5.6 % |
| 8 | 2401 | 10.7 % | 52.2 % | 4.7 % |
| 9 | 1200 | 7.9 % | 37.0 % | 3.8 % |
| 10 | 599 | 6.8 % | 26.1 % | 4.1 % |
| 11 | 298 | 6.6 % | 30.4 % | 3.1 % |
| 12a | 148 | 5.2 % | 17.4 % | 3.4 % |
| 13 | 74 | 6.3 % | 19.6 % | 4.4 % |
| 14 | 37 | 9.3 % | 21.7 % | 7.5 % |
| 15 | 18 | 8.5 % | 26.1 % | 5.9 % |
| 16 | 9 | 10.7 % | 19.6 % | 9.4 % |
| 17 | 3 | 16.9 % | 28.2 % | 15.3 % |
OOB-ER out of bag error rate
aThe 12th iteration had the lowest misclassification error rate
Fig. 1Misclassification error rates at each iteration of the recursive RF. OOB out of bag error rate (overall error), YES eczema, No non-eczema
Terms significantly enriched in functional annotation and pathway analysis and genes present in the pathways potentially associated with eczema (FDR-adjusted P value; FDR = 0.05)
| Term | FDR-adjusted | |
|---|---|---|
| Polymorphism | 4.7 × 10−145 | |
| Sequence variant | 2.3 × 10−111 | |
| Alternative splicing | 6.8 × 10−74 | |
| Splice variant | 1.6 × 10−46 | |
| Phosphoprotein | 6.2 × 10−25 | |
| Protocadherin gamma | 1.8 × 10−16 |
|
| Disease mutation | 4.9 × 10−16 |
|
| Domain: cadherin 6 | 1.5 × 10−10 |
|
| Cadherin, N-terminal | 3.6 × 10−10 |
|
| Pathways in cancer | 8.5 × 10−8 | |
| Membrane | 1.1 × 10−7 | |
| Regulation of actin cytoskeleton | 1.8 × 10−7 | |
| Long-term depression | 9.0 × 10−7 | |
| Calcium ion binding | 1.1 × 10−6 | |
| Plasma membrane | 2.2 × 10−6 | |
| Glycoprotein | 2.4 × 10−6 | |
| Gap junctiona | 2.6 × 10−6 |
|
| Cell-cell adhesion | 2.7 × 10−6 |
|
| Homophilic cell adhesion | 6.1 × 10−6 | |
| Chemokine signaling pathway | 1.0 × 10−5 | |
| Focal adhesion | 1.3 × 10−5 | |
| Axon guidance | 1.3 × 10−5 |
|
| Tight junctiona | 1.6 × 10−5 | |
| Biological adhesion | 1.7 × 10−5 | |
| Cell adhesion | 2.2 × 10−5 |
|
| Coiled coil | 2.6 × 10−5 | |
| Melanogenesisa | 7.1 × 10−5 |
|
| Vascular smooth muscle contraction | 1.1 × 10−4 | |
| Chromosomal rearrangement | 2.6 × 10−4 | |
| Cardiac muscle contraction | 2.7 × 10−4 | |
| Intracellular signaling cascade | 4.5 × 10−4 | |
| Cell membrane | 4.7 × 10−4 | |
| Cell fraction | 4.8 × 10−4 | |
| Prostate cancer | 6.2 × 10−4 | |
| Ion binding | 6.8 × 10−4 | |
| Acetylation | 7.6 × 10−4 | |
| Signal | 8.3 × 10−4 | |
| Transmembrane | 1.0 × 10−3 | |
| Mutagenesis site | 1.1 × 10−3 | |
| Cation binding | 1.2 × 10−3 | |
| Lysine degradation | 1.4 × 10−3 | |
| Leukocyte trans endothelial migration | 1.5 × 10−3 | |
| Lysosome | 1.5 × 10−3 | |
| Transcription factor binding | 3.9 × 10−3 | |
| Melanomaa | 4.6 × 10−3 |
|
| Tumor suppressor | 5.0 × 10−3 | |
| Nucleotide binding | 5.0 × 10−3 | |
| Endocytosis | 7.0 × 10−3 | |
| Apoptosisa | 7.3 × 10−3 |
|
| Small cell lung cancer | 7.3 × 10−3 | |
| Nucleus | 1.1 × 10−2 | |
| Cell projection | 1.7 × 10−2 | |
| Positive regulation of cellular biosynthetic process | 4.4 × 10−2 | |
| Transcription co-activator activity | 4.9 × 10−2 |
aRepresents pathways which are involved in eczema with their genes
The 41 CpGs that had the same direction of effect with eczema in both F1 and F2 generations based on log-linear models
| CpGs | F1-Risk Ratio | 95 % CI-F1 | F2-risk ratio | 95 % CI-F2 | Gene |
|---|---|---|---|---|---|
| cg00193668 | 17.29 | 2.90, 102.87 | 4.86 | 0.89, 26.4 |
|
| cg04850479a | 15.19 | 3.07, 75.17 | 6.82 | 1.52, 30.6 |
|
| cg02641560 | 14.50 | 3.39, 62.65 | 1.33 | 0.13, 12.8 |
|
| cg05839818 | 13.02 | 2.34, 72.26 | 1.3 | 0.13, 12.1 | |
| cg05411056 | 9.73 | 2.64, 35.81 | 5.61 | 1.44, 21.85 | |
| cg02077766 | 9.60 | 2.14, 43.07 | 1.29 | 0.38, 4.33 |
|
| cg00667315 | 7.66 | 1.88, 31.21 | 1.25 | 0.19, 8.0 | |
| cg00900242 | 6.86 | 1.26, 37.20 | 6.04 | 0.75, 48.6 | |
| cg02583247 | 6.61 | 2.05, 21.33 | 1.27 | 0.26, 6.10 |
|
| cg01802073 | 6.10 | 1.40, 26.43 | 1.43 | 0.24, 8.61 |
|
| cg14839837 | 5.90 | 1.63, 21.39 | 2.94 | 0.73, 11.7 |
|
| cg00354884 | 5.77 | 1.95, 17.03 | 1.8 | 0.59, 6.03 |
|
| cg00158434 | 5.43 | 1.75, 16.78 | 2.47 | 0.52, 11.5 |
|
| cg03049303 | 4.73 | 1.44, 15.57 | 4.61 | 0.77, 27.4 |
|
| cg24303123 | 4.68 | 1.73, 12.65 | 1.49 | 0.50, 4.46 |
|
| cg11570082 | 4.46 | 1.85, 10.71 | 2.56 | 0.58, 11.2 | |
| cg02237186 | 4.26 | 1.24, 14.63 | 2.89 | 0.16, 51.1 |
|
| cg02654265 | 3.92 | 1.56, 9.87 | 0.29 | 0.05, 1.52 | |
| cg00369908 | 3.65 | 1.34, 9.92 | 4.05 | 0.75, 21.6 |
|
| cg00722180 | 3.64 | 1.22, 0.85 | 2.84 | 0.63, 12.7 |
|
| cg02433979 | 2.91 | 1.35, 6.27 | 1.17 | 0.37, 3.68 | |
| cg00035220 | 2.62 | 1.19, 5.72 | 1.18 | 0.34, 4.03 |
|
| cg00252472 | 2.62 | 1.27, 5.40 | 1.22 | 0.44, 3.38 | |
| cg00306063 | 2.59 | 1.12, 5.97 | 2.12 | 0.48, 9.19 |
|
| cg00742851 | 2.23 | 1.16, 4.28 | 1.26 | 0.45, 3.48 |
|
| cg02203881 | 2.07 | 1.07, 4.00 | 1.67 | 0.46, 6.02 |
|
| cg00576402 | 0.57 | 0.35, 0.92 | 0.76 | 0.28, 2.07 |
|
| cg01560119 | 0.41 | 0.21, 0.80 | 0.79 | 0.37, 1.68 |
|
| cg01651499 | 0.37 | 0.16, 0.85 | 0.41 | 0.12, 1.34 |
|
| cg02098905 | 0.35 | 0.16, 0.76 | 0.41 | 0.14, 1.12 | |
| cg04797820 | 0.33 | 0.17, 0.64 | 0.93 | 0.31, 2.76 |
|
| cg00247571 | 0.31 | 0.13, 0.75 | 0.89 | 0.26, 2.50 | |
| cg00071869 | 0.30 | 0.13, 0.70 | 0.77 | 0.12, 4.88 |
|
| cg00797821 | 0.29 | 0.10, 0.82 | 0.36 | 0.06, 2.12 | |
| cg01158447 | 0.24 | 0.09, 0.60 | 0.35 | 0.11, 1.14 |
|
| cg00077547 | 0.21 | 0.06, 0.70 | 0.91 | 0.25, 3.24 |
|
| cg04980849 | 0.21 | 0.07, 0.60 | 0.57 | 0.16, 1.96 |
|
| cg00050654 | 0.19 | 0.07, 0.51 | 0.71 | 0.20, 2.43 | |
| cg20077343 | 0.19 | 0.06, 0.58 | 0.25 | 0.03, 1.83 |
|
| cg17602756 | 0.14 | 0.03, 0.64 | 0.26 | 0.05, 1.39 |
|
| cg01427769a | 0.13 | 0.03, 0.46 | 0.09 | 0.02, 0.36 |
|
aCpG sites significantly associated with eczema in both generations. For cg04850479, the P values are 0.0006 in the F1 generation and 0.0121 in the F2 generation, and for cg01427769, the P values are 0.0015 and 0.0007, respectively
Fig. 2The risk ratios of 83 eczema-associated CpGs sorted by chromosome from 1 to 21. The numbers in the textbox are chromosome indices, which are represented by different colors in the bar graphs. The horizontal red line represents the risk ratio of one
Fig. 3Flow chart of statistical analyses and the number of CpG sites after each analysis in the F1 and F2 generations. RRF: recursive random forest