| Literature DB >> 26292806 |
Todd M Everson1, Genevieve Lyons2, Hongmei Zhang3, Nelís Soto-Ramírez3, Gabrielle A Lockett4, Veeresh K Patil5, Simon K Merid6, Cilla Söderhäll7, Erik Melén6,8, John W Holloway4,9, S Hasan Arshad5,9, Wilfried Karmaus3.
Abstract
BACKGROUND: The prevalence of allergic diseases are increasing worldwide, emphasizing the need to elucidate their pathogeneses. The aims of this study were to use a two-stage design to identify DNA methylation levels at cytosine-phosphate-guanine (CpG) sites across the genome associated with atopy and high serum immunoglobulin E (IgE), then to replicate our findings in an independent cohort.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26292806 PMCID: PMC4545869 DOI: 10.1186/s13073-015-0213-8
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Recursive RF feature selection process. The feature selection process started with a large dataset: all CpGs that survived data cleaning and preprocessing, and were not potentially affected by probe SNPs. The cycle in black (conducting the Random Forest, collecting evaluation measures, assessing stop criteria, and reducing the data) repeated until the atopy-specific misclassification rate showed a marked increase, indicating that some excluded sites were important in classifying atopic participants. Thus, once an increase in atopy-specific misclassification was observed, the cycle stopped and sites from the previous iteration were selected for follow-up testing. OOB-ER out-of-bag error rate, RF Random Forest, VIM variable importance measure
Prevalence of atopy via positive skin prick tests and high serum IgE among females and males
| Epigenetic subsample | Full cohort | Subsample vs full cohort | |||||
|---|---|---|---|---|---|---|---|
| Outcome variables | Females | Males |
| Females | Males |
| |
| (n = 245) | (n = 122) |
| (n = 786) | (n = 750) |
| ||
| Atopy | At least one positive | 76 (31.4 %) | 49 (40.8 %) | 0.097 | 159 (35.6 %) | 194 (47.7 %) | Females: 0.30 |
| All negative | 166 (68.6 %) | 71 (59.2 %) | 287 (64.4 %) | 213 (52.3 %) | Males: 0.22 | ||
| IgE | IgE ≥ 200 kU/L | 24 (23.8 %) | 18 (41.9 %) | 0.047 | 66 (25.9 %) | 81 (32.5 %) | Females: 0.78 |
| IgE < 200 kU/L | 77 (76.2 %) | 25 (58.1 %) | 189 (74.1 %) | 168 (67.5 %) | Males: 0.31 | ||
Misclassification rates throughout the recursive RF process
| Iteration | Number of variables | OOB-ER overall misclassification (%) | Non-atopic misclassification (%) | Atopic misclassification (%) |
|---|---|---|---|---|
| 1 | 254,460 | 38.43 | 19.87 | 78.95 |
| 2 | 127,230 | 35.12 | 17.46 | 73.68 |
| 3 | 63,615 | 33.05 | 19.27 | 63.15 |
| 4 | 31,807 | 27.68 | 10.24 | 65.78 |
| 5 | 15,903 | 24.38 | 9.03 | 57.89 |
| 6 | 7,951 | 16.94 | 4.21 | 44.73 |
| 7 | 3,975 | 14.87 | 5.42 | 35.52 |
| 8 | 1,987 | 11.15 | 4.21 | 26.31 |
| 9 | 993 | 11.57 | 4.81 | 26.31 |
| 10 | 496 | 9.09 | 5.42 | 17.10 |
| 11 | 248 | 8.67 | 5.42 | 15.78 |
| 12 | 124 | 9.09 | 5.42 | 17.10 |
| 13 | 62 | 9.50 | 7.22 | 14.47 |
| 14 | 31 | 11.98 | 9.63 | 17.10 |
| 15 | 15 | 15.70 | 13.85 | 19.73 |
OOB-ER out-of-bag error rate
Genetic annotations for 62 sites selected by recursive Random Forest
| CpG Site | Chr | Coordinate | Associated genes | Gene region | CpG island |
|---|---|---|---|---|---|
| cg00854799 | 1 | 2336398 |
| 3′UTR | North Shelf |
| cg09249800 | 1 | 6341287 |
| Body | Island |
| cg06824199 | 1 | 47157809 |
| Body | – |
| cg17594242 | 1 | 115654782 |
| – | – |
| cg09332506 | 1 | 160309220 |
| Body | North Shelf |
| cg01847596 | 2 | 95660093 |
| – | North Shelf |
| cg07880854 | 2 | 112895559 |
| TSS1500 | North Shore |
| cg13168187 | 2 | 159523681 |
| Body | – |
| cg01203365 | 2 | 217291500 |
| Body | – |
| cg27468224 | 4 | 55031503 |
| – | – |
| cg11372831 | 4 | 57303157 |
| Body; TSS1500 | South Shore |
| cg03553407 | 4 | 148863880 |
| Body | – |
| cg00528600 | 5 | 61699751 |
| TSS200 | Island |
| cg04085542 | 5 | 93414338 |
| Body; 5′UTR | – |
| cg05560165 | 5 | 133450315 |
| TSS1500 | Island |
| cg14322298 | 6 | 10585683 |
| Body; TSS1500 | – |
| cg03131171 | 6 | 37616686 |
| Body | North Shore |
| cg00155310 | 6 | 50814011 |
| 3′UTR | South Shore |
| cg02201050 | 7 | 22759083 |
| – | – |
| cg02366798 | 7 | 27237154 |
| 3′UTR | North Shore |
| cg09570585 | 7 | 138916241 |
| 1stExon | Island |
| cg05652668 | 7 | 139044807 |
| 1stExon; 5′′UTR | Island |
| cg07970948 | 7 | 149543165 |
| Body | – |
| cg24836822 | 7 | 150648840 |
| Body | Island |
| cg05104993 | 8 | 11973223 |
| TSS200 | North Shore |
| cg06816054 | 8 | 27695695 |
| TSS1500 | South Shore |
| cg04775941 | 8 | 141474793 |
| – | Island |
| cg13713293 | 9 | 841636 |
| TSS200 | Island |
| cg25854298 | 10 | 73936754 |
| Body | – |
| cg03468115 | 10 | 76852752 |
| – | – |
| cg23527183 | 10 | 95253833 |
| – | North Shelf |
| cg08397758 | 10 | 100174853 |
| 1stExon | – |
| cg06851336 | 10 | 104678166 |
| 1stExon; 5′UTR | Island |
| cg24077454 | 10 | 119134782 |
| 1stExon; 5′UTR | Island |
| cg14574726 | 11 | 809735 |
| TSS200 | Island |
| cg12819873 | 11 | 57157632 |
| 5′UTR | – |
| cg13233042 | 11 | 63432489 |
| Body | – |
| cg04162999 | 11 | 64120313 |
| Body | – |
| cg10016610 | 11 | 124735994 |
| Body | Island |
| cg07908654 | 13 | 41631052 |
| – | North Shelf |
| cg09635874 | 13 | 98952518 |
| Body | – |
| cg11182893 | 13 | 114842103 |
| Body | South Shore |
| cg14478663 | 14 | 51643693 |
| – | – |
| cg15281774 | 15 | 73661908 |
| TSS1500 | Island |
| cg01777765 | 16 | 1823191 |
| TSS200; TSS1500 | Island |
| cg05048002 | 16 | 30077837 |
| 5'UTR | Island |
| cg04342090 | 16 | 30670571 |
| – | Island |
| cg02775369 | 16 | 56316221 |
| Body | – |
| cg01190915 | 16 | 56642761 |
| Body | South Shore |
| cg04983687 | 16 | 88558223 |
| Body | Island |
| cg27202913 | 16 | 89258862 |
| Body | Island |
| cg01097406 | 16 | 89675127 |
| – | – |
| cg04798929 | 17 | 8287246 |
| TSS1500 | Island |
| cg17549513 | 17 | 9694789 |
| TSS200 | – |
| cg07765167 | 17 | 36451845 |
| TSS1500 | North Shore |
| cg27469152 | 17 | 56282313 |
| 3′UTR | – |
| cg17041511 | 17 | 61509620 |
| – | North Shelf |
| cg12819826 | 19 | 10216676 |
| TSS1500; TSS1500 | North Shore |
| cg12578575 | 19 | 54135140 |
| TSS200 | – |
| cg11569718 | 19 | 58905979 |
| Body | North Shore |
| cg13197551 | 20 | 60709957 |
| 3′UTR | – |
| cg17971837 | 22 | 37215996 |
| TSS1500 | South Shelf |
Abbreviations: Chr Chromosome number, CpG cytosine–phosphate–guanineIgE Immunoglobulin E, IOW Isle of Wight cohort, TSS Transcription Start Site, UTR untranslated region
Coordinate: Location of the CpG site within each chromosome, via genome build 37
Stage 1 – Assessment of the influence of cell type on CpG selection in stage 1 analyses (n = 245)
| CpG Site | Crude β1 | Crude | Adjusted β1 | Adjusted | %Diffβ |
|---|---|---|---|---|---|
| cg00155310 | 2.24 | 0.00066 | 2.24 | 0.0011 | −0.26 |
| cg00528600 | 1.66 | 0.0038 | 1.68 | 0.0043 | 1.11 |
| cg00854799 | 1.23 | 0.0062 | 1.25 | 0.0067 | 1.29 |
| cg01097406 | 0.25 | 0.0050 | 0.27 | 0.0042 | 5.08 |
| cg01190915 | 2.43 | 0.00020 | 2.61 | 0.00011 | 7.49 |
| cg01203365 | −1.25 | 0.0051 | −1.19 | 0.0087 | −4.59 |
| cg01777765 | −1.51 | 0.0030 | −1.69 | 0.0015 | 11.76 |
| cg01847596 | −2.62 | 0.0015 | −2.65 | 0.0019 | 1.49 |
| cg02201050 | 2.08 | 0.0012 | 2.10 | 0.0017 | 0.87 |
| cg02366798 | 0.84 | 0.0097 | 0.85 | 0.014 | 0.46 |
| cg02775369 | −1.55 | 0.00072 | −1.53 | 0.0011 | −1.37 |
| cg03131171 | −1.21 | 0.0064 | −1.28 | 0.0049 | 5.47 |
| cg03468115 | 1.68 | 0.0017 | 1.58 | 0.0052 | −5.88 |
| cg03553407 | 1.54 | 0.00037 | 1.50 | 0.00060 | −2.44 |
| cg04085542 | 1.00 | 0.00033 | 0.95 | 0.00094 | −5.15 |
| cg04162999 | 0.92 | 0.030 | 1.05 | 0.016 | 14.72 |
| cg04342090 | −1.55 | 0.0086 | −1.46 | 0.014 | −5.58 |
| cg04775941 | −1.67 | 0.020 | −1.54 | 0.034 | −7.5 |
| cg04798929 | 2.20 | 0.0024 | 2.14 | 0.0034 | −2.98 |
| cg04983687 | −1.18 | 0.000010 | −1.43 | 0.0000017 | 21.45 |
| cg05048002 | −1.91 | 0.00074 | −1.75 | 0.0036 | −8.1 |
| cg05104993 | 2.89 | 0.0018 | 3.18 | 0.0012 | 10.1 |
| cg05560165 | 1.37 | 0.0011 | 1.55 | 0.00062 | 13.84 |
| cg05652668 | −1.15 | 0.00068 | −1.27 | 0.00031 | 10.57 |
| cg06816054 | −1.37 | 0.0081 | −1.49 | 0.0068 | 9.25 |
| cg06824199 | −2.11 | 0.000029 | −2.45 | 0.000011 | 16.35 |
| cg06851336 | −1.78 | 0.00099 | −1.76 | 0.0011 | −0.86 |
| cg07765167 | −2.47 | 0.000065 | −2.52 | 0.000095 | 1.94 |
| cg07880854 | 1.21 | 0.014 | 1.43 | 0.0070 | 18.16 |
| cg07908654 | −1.67 | 0.000013 | −1.80 | 0.0000099 | 7.96 |
| cg07970948 | −1.26 | 0.000027 | −1.42 | 0.000014 | 12.22 |
| cg08397758 | 0.84 | 0.043 | 1.90 | 0.0032 | 125.56 |
| cg09249800 | −1.14 | 0.000016 | −1.27 | 0.0000094 | 11.06 |
| cg09332506 | −1.97 | 0.0000097 | −2.10 | 0.0000078 | 6.38 |
| cg09570585 | 0.85 | 0.064 | 0.82 | 0.082 | −3.88 |
| cg09635874 | 1.96 | 0.00037 | 1.90 | 0.0010 | −3.11 |
| cg10016610 | −0.57 | 0.098 | −0.63 | 0.085 | 9.8 |
| cg11182893 | 1.81 | 0.00032 | 1.64 | 0.0015 | −9.47 |
| cg11372831 | 2.62 | 0.00018 | 2.63 | 0.00029 | 0.2 |
| cg11569718 | −1.54 | 0.011 | −1.44 | 0.021 | −6.64 |
| cg12578575 | −0.58 | 0.012 | −0.59 | 0.014 | 1.49 |
| cg12819826 | −1.90 | 0.000087 | −1.93 | 0.00029 | 1.67 |
| cg12819873 | −2.36 | 0.0000045 | −2.47 | 0.0000051 | 4.97 |
| cg13168187 | 1.65 | 0.0024 | 2.79 | 0.000043 | 69.09 |
| cg13197551 | −1.84 | 0.000062 | −1.77 | 0.00016 | −3.49 |
| cg13233042 | −1.64 | 0.000032 | −1.74 | 0.000031 | 6.21 |
| cg13713293 | −1.43 | 0.00039 | −1.40 | 0.00070 | −1.67 |
| cg14322298 | −1.53 | 0.0031 | −1.57 | 0.0052 | 2.77 |
| cg14478663 | 1.81 | 0.00020 | 1.91 | 0.00017 | 5.64 |
| cg14574726 | 1.75 | 0.0058 | 1.91 | 0.0037 | 9.54 |
| cg15281774 | −1.91 | 0.0014 | −2.13 | 0.00076 | 11.37 |
| cg17041511 | −1.97 | 0.000021 | −2.25 | 0.000027 | 14.69 |
| cg17549513 | 1.62 | 0.0094 | 1.82 | 0.0051 | 12.16 |
| cg17594242 | 1.72 | 0.000035 | 1.71 | 0.000071 | −0.93 |
| cg17971837 | −3.38 | 0.000061 | −3.64 | 0.000069 | 7.85 |
| cg23527183 | −0.83 | 0.0048 | −0.92 | 0.0034 | 10.38 |
| cg24077454 | 1.33 | 0.0056 | 1.27 | 0.012 | −4.29 |
| cg24836822 | −1.09 | 0.000020 | −1.17 | 0.000014 | 8.19 |
| cg25854298 | −1.40 | 0.000042 | −1.51 | 0.000037 | 7.39 |
| cg27202913 | −0.67 | 0.010 | −0.69 | 0.0094 | 2.01 |
| cg27468224 | −1.83 | 0.00013 | −1.85 | 0.00019 | 1.29 |
| cg27469152 | −1.99 | 0.00021 | −2.21 | 0.000094 | 11.21 |
Results of 62 logistic regressions between methylation M-values and atopy for each selected CpG. We present crude associations as well as associations adjusted for predicted cell proportions of CD8+ T cells, CD4+ T cells, natural killer cells, B-cells, monocytes, and granulocytes. β1 represents the value of the regression coefficient for the CpG site in that statistical model. The percent change in β-values (%Diffβ) was calculated as [(crude β1 – adjusted β1)/crude β1] and was used to evaluate whether cell type influenced the selection of each CpG site
Fig. 2Distribution of methylation levels within the male and female samples, by atopy status. Boxplots showing the distribution of methylation levels within the male (bright red and bright blue) and female (dark red and dark blue) samples, stratified by atopy status (red = atopic; blue = non-atopic) among the 17 CpG sites significantly associated with atopy and present in both the male and female samples. The 30 sites that were not significant in stage 2 analyses are included in Additional file 1
Comparison of stage 2 (n = 367) and replication (n = 464) results
| Genetic annotations* | IOW: atopy statusa | IOW: high IgEb | BAMSE: atopy statusc | ||||||
|---|---|---|---|---|---|---|---|---|---|
| CpG Site | Chr | Gene name | CpG location | Adj. OR |
| Adj. OR |
| Adj. OR |
|
|
|
|
|
| 0.239 |
| 0.158 |
| 0.269 |
|
| cg09249800 | 1 |
| Body | 0.239 |
| 0.141 |
| – |
|
|
|
|
|
| 0.065 |
| 0.045 |
| 0.140 |
|
|
|
|
|
| 0.145 |
| 0.103 |
| 0.206 |
|
|
|
|
|
| 0.107 |
| 0.073 |
| 0.118 |
|
|
|
|
|
| 0.091 |
| 0.025 |
| 0.099 |
|
|
|
|
|
| 0.123 |
| 0.073 |
| 0.262 |
|
|
|
|
|
| 0.192 |
| 0.105 |
| 0.194 |
|
|
|
|
|
| 0.160 |
| 0.094 |
| 0.312 |
|
| cg07765167 | 17 |
| TSS1500 | 0.087 |
| 0.043 |
| – |
|
|
|
|
|
| 0.123 |
| 0.126 | 3.62E−03 | 0.196 |
|
|
|
|
|
| 0.143 |
| 0.075 | 8.14E−04 | 0.133 |
|
| cg24836822 | 7 |
| Body | 0.311 |
| 0.225 |
| – |
|
|
|
|
|
| 0.243 |
| 0.109 |
| 0.313 |
|
|
|
|
|
| 0.221 |
| 0.144 | 1.24E−03 | 0.149 |
|
| cg09635874 | 13 |
| Body | 8.084 |
| 2.591 | 2.44E−01 | 1.149 | 8.59E−01 |
| cg04085542 | 5 |
| Body; 5′UTR | 2.663 |
| 1.250 | 5.48E−01 | 0.936 | 7.93E−01 |
| cg03553407 | 4 |
| Body | 4.330 |
| 2.411 | 1.92E−01 | 2.010 | 2.16E−01 |
| cg12819826 | 19 |
| TSS1500 | 0.213 |
| 0.229 | 4.02E−02 | 1.788 | 2.87E−01 |
| cg00854799 | 1 |
| 3′UTR | 4.209 |
| 2.930 | 8.27E−02 | 1.464 | 5.18E−01 |
| cg05652668 | 7 |
| 1st exon; 5′UTR | 0.280 |
| 0.517 | 2.39E−01 | 0.820 | 7.35E−01 |
|
|
|
|
| 0.122 |
| 0.102 | 4.11E−02 | 0.188 |
|
Associations for methylation M-values with atopy and high IgE from the IOW epigenetic sample, as well as with atopy in the BAMSE replication sample. IOW analyses were adjusted for predicted cell proportions of CD8+ T cells, CD4+ T cells, natural killer cells, B cells, monocytes, granulocytes, and sex. BAMSE replication analyses were adjusted for the same predicted cell proportions, sex, batch (bisulfite treatment date), and asthma treatment
Abbreviations: Adj. OR adjusted odds ratio, BAMSE Children, Allergy, Milieu, Stockholm, Epidemiology cohort, Chr Chromosome number, CpG cytosine–phosphate–guanine, IgE Immunoglobulin E, IOW Isle of Wight cohort, TSS Transcription Start Site, UTR untranslated region
*CpGs and their annotations highlighted in bold were the sites replicated in the BAMSE cohort
aAtopy defined as at least one positive skin prick test; significant sites determined via α = 8.06E−4 (0.05/62 tests)
bHigh IgE defined as serum IgE ≥ 200 kU/L; significant sites determined via α = 8.06E−4 (0.05/62 tests)
cAtopy defined as serum IgE antibody ≥ 0.35 kUA/L, to any allergen; significant sites determined via α = 0.05
Functional annotation for genes associated with the 13 sites that were successfully replicated
| Category | Term | Genes |
|
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| UP_SEQ_FEATURE | Sequence variant |
| 0.1365 |
| GOTERM_MF_FAT | GO:0005509 Calcium ion binding |
| 0.1807 |
| SP_PIR_KEYWORDS | Nitration |
| 0.1539 |
| SP_PIR_KEYWORDS | Alternative splicing |
| 0.3382 |
| GOTERM_MF_FAT | GO:0046872 Metal ion binding |
| 0.3749 |
| GOTERM_BP_FAT | GO:0048193 Golgi vesicle transport |
| 0.8360 |
| GOTERM_MF_FAT | GO:0043169 Cation binding |
| 0.2826 |
| GOTERM_MF_FAT | GO:0043167 Ion binding |
| 0.2291 |
| SP_PIR_KEYWORDS | Calcium |
| 0.3469 |
| SP_PIR_KEYWORDS | Phosphoprotein |
| 0.2989 |
| SP_PIR_KEYWORDS | Cytoplasmic vesicle |
| 0.5085 |
| UP_SEQ_FEATURE | Splice variant |
| 0.9270 |
| UP_SEQ_FEATURE | Domain:EF-hand 1 |
| 0.8342 |
| UP_SEQ_FEATURE | Domain:EF-hand 2 |
| 0.7430 |
| INTERPRO | IPR018249: EF-HAND 2 |
| 0.9655 |
| GOTERM_BP_FAT | GO:0006350 Transcription |
| 0.9478 |
| INTERPRO | IPR018247: EF-HAND 1 |
| 0.8227 |
Statistically significant DAVID functional annotation and pathway results are in bold
P-values corrected for multiple testing via Benjamini–Hochberg method