| Literature DB >> 29529061 |
Hemant Gujar1, Jane W Liang1, Nicholas C Wong2, Khyobeni Mozhui1,3.
Abstract
The Illumina Infinium MethylationEPIC provides an efficient platform for profiling DNA methylation in humans at over 850,000 CpGs. Model organisms such as mice do not currently benefit from an equivalent array. Here we used this array to measure DNA methylation in mice. We defined probes targeting conserved regions and performed differential methylation analysis and compared between the array-based assay and affinity-based DNA sequencing of methyl-CpGs (MBD-seq) and reduced representation bisulfite sequencing. Mouse samples consisted of 11 liver DNA from two strains, C57BL/6J (B6) and DBA/2J (D2), that varied widely in age. Linear regression was applied to detect differential methylation. In total, 13,665 probes (1.6% of total probes) aligned to conserved CpGs. Beta-values (β-value) for these probes showed a distribution similar to that in humans. Overall, there was high concordance in methylation signal between the EPIC array and MBD-seq (Pearson correlation r = 0.70, p-value < 0.0001). However, the EPIC probes had higher quantitative sensitivity at CpGs that are hypo- (β-value < 0.3) or hypermethylated (β-value > 0.7). In terms of differential methylation, no EPIC probe detected a significant difference between age groups at a Benjamini-Hochberg threshold of 10%, and the MBD-seq performed better at detecting age-dependent change in methylation. However, the top most significant probe for age (cg13269407; uncorrected p-value = 1.8 x 10-5) is part of the clock CpGs used to estimate the human epigenetic age. For strain, 219 EPIC probes detected significant differential methylation (FDR cutoff 10%) with ~80% CpGs associated with higher methylation in D2. This higher methylation profile in D2 compared to B6 was also replicated by the MBD-seq data. To summarize, we found only a small subset of EPIC probes that target conserved sites. However, for this small subset the array provides a reliable assay of DNA methylation and can be effectively used to measure differential methylation in mice.Entities:
Mesh:
Year: 2018 PMID: 29529061 PMCID: PMC5846735 DOI: 10.1371/journal.pone.0193496
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample details and average methylation signal intensity.
| Full set | Conserved set | |||||||
|---|---|---|---|---|---|---|---|---|
| Sample | Age | Age (months) | Strain | Sex | Mean | Median | Mean | Median |
| Mouse1 | young | 4 | D2 | F | 505 | 394 | 3206 | 1898 |
| Mouse2 | young | 4 | D2 | F | 926 | 524 | 10989 | 10278 |
| Mouse7 | young | 4 | B6 | F | 877 | 538 | 9866 | 8702 |
| Mouse8 | young | 4 | B6 | F | 766 | 397 | 10386 | 9975 |
| Mouse3 | mid | 12 | D2 | F | 852 | 483 | 10615 | 9880 |
| Mouse4 | mid | 12 | D2 | F | 818 | 430 | 10866 | 10542 |
| Mouse5 | mid | 12 | D2 | M | 845 | 456 | 11545 | 10982 |
| Mouse9 | mid | 12 | B6 | M | 852 | 444 | 11433 | 11187 |
| Mouse6 | old | 24 | D2 | F | 737 | 379 | 10206 | 9611 |
| Mouse10 | old | 24 | B6 | F | 845 | 448 | 10767 | 10436 |
| Mouse11 | old | 24 | B6 | F | 886 | 490 | 11302 | 10741 |
| Human1 | 7568 | 7218 | 8710 | 8616 | ||||
| Human2 | 10668 | 10288 | 11761 | 11599 | ||||
D2: DBA/2J; B6: C57BL/6J
Genomic features of CpGs and enrichment in conserved sites.
| Full set | Conserved set | ||||
|---|---|---|---|---|---|
| Feature | Counts | Percent | Counts | Percent | Enrichment p |
| TSS1500 | 107193 | 12 | 1195 | 9 | ns |
| TSS200 | 65152 | 8 | 1940 | 14 | <1.0E-15 |
| 5'UTR | 73070 | 8 | 1269 | 9 | 1.8E-04 |
| 1stExon | 26433 | 3 | 2028 | 15 | <1.0E-15 |
| Exon | 5680 | 1 | 282 | 2 | <1.0E-15 |
| 3'UTR | 21594 | 2 | 340 | 2 | ns |
| Body | 318165 | 37 | 4918 | 36 | ns |
| Non-Genic | 249549 | 29 | 1693 | 12 | ns |
| Islands | 161598 | 19 | 6270 | 46 | <1.0E-15 |
| Shores | 154735 | 18 | 2267 | 17 | ns |
| Shelves | 61811 | 7 | 664 | 5 | ns |
| Open Sea | 488692 | 56 | 4464 | 33 | ns |
CpG position relative to gene features based on annotations from Illumina (UCSC_RefGene_Group). TSS1500 and TSS200 are CpGs at –200 or 200–1500 upstream of are transcription start sites; Non-genic are CpG with no annotated gene features.
Shores = 0–2 kb from islands; shelves = 2–4 kb from islands
Enrichment of gene features and CpG regions in the conserved set compared to the full set based on hypergeometric test
Counts of Illumina human MethylationEPIC probes by β-value and concordance with MBD-seq at corresponding CpG regions.
| Counts of CpG regions by rms value | ||||
|---|---|---|---|---|
| CpG Category | Probe counts | rms < 0.3 | 0.3 ≤ rms ≤ 0.7 | rms > 0.7 |
| Hypo (β < 0.3) | 7548 | 6198 | 1000 | 350 |
| Hemi (0.3 ≤ β ≤ 0.7) | 3159 | 973 | 1827 | 359 |
| Hyper(β > 0.7) | 2956 | 171 | 1599 | 1186 |
Conserved probes on the HumanMethylationEPIC arrays were grouped by β-value. These are counts in each category.
CpG For each category of probes, the corresponding CpG regions were counted and grouped by CpG density normalized relative methylation score (rms) to determine concordance between the array and MBD-seq
Age-dependent differentially methylated CpGs/regions detected by conserved Illumina human MethylationEPIC probes and by MBD-seq.
| EPIC | MBD-seq | ||||||
|---|---|---|---|---|---|---|---|
| ProbeID | Gene | Region | Position (mm10) | Coef. | Age (P) | logFC | Age (P) |
| cg08949408 | Body; Island | chr2:13.01 | 0.32 | 0.001 | 3.3 | 1.3E-10 | |
| cg10444382 | Body; Island | chr10:84.76 | 0.24 | 9.4E-04 | 2.9 | 2.5E-08 | |
| cg22384902 | TSS1500; island | chr6:28.83 | 0.22 | 0.009 | 2.0 | 2.0E-06 | |
| cg06945399 | TSS200; Island | chr6:28.83 | 0.18 | 0.057 | 1.5 | 2.2E-05 | |
| cg23398076 | Body; Shelf | chr11:19.02 | 0.13 | 0.007 | 1.5 | 2.4E-05 | |
| cg05393688 | Body; Shore | chr14:76.51 | 0.17 | 0.005 | 1.5 | 2.8E-05 | |
| cg20563498 | Body; Shelf | chr7:97.32 | -0.02 | 0.27 | 1.1 | 3.2E-05 | |
These are age-dependent differentially methylated CpG regions discovered in the MBD-seq at an FDR of 10%; replicated for the corresponding CpG in the EPIC microarray at an uncorrected p-value cutoff of 0.05. Coef. is the linear regression coefficient (i.e., change in methylation β-value from young to old). LogFC is log2 fold change in methylation from young to old.
CpG location in relation to gene features and CpG region based in probe annotations for the human methylation microarray; gene feature annotations are the same for the corresponding regions in the mouse genome.
Chromosome and Megabase coordinate based on mm10 mouse reference genome
Strain-dependent differentially methylated CpGs/regions detected by both Illumina human MethylationEPIC probes and by MBD-seq.
| EPIC | MBD-seq | ||||||
|---|---|---|---|---|---|---|---|
| ProbeID | Gene | Region | Position (mm10) | Coef. | Strain (P) | logFC | Strain (P) |
| Differentially methylated CpGs detected by EPIC probe at FDR 10%; replicated by MBD-seq | |||||||
| cg21064315 | 3'UTR; Shore | chr4:118.36 | -0.82 | 5.5E-09 | -2.0 | 1.7E-04 | |
| cg14945867 | 1stExon; Island | chr14:46.79 | 0.27 | 1.3E-06 | 6.0 | 1.2E-09 | |
| cg04546815 | Body | chr4:98.78 | 0.34 | 1.6E-06 | 1.7 | 4.4E-04 | |
| cg10277781 | 1stExon; Island | chr14:46.79 | 0.35 | 1.9E-06 | 6.0 | 1.2E-09 | |
| cg00049718 | 5'UTR | chr3:103.02 | 0.40 | 2.6E-06 | 6.8 | 9.8E-15 | |
| cg07211292 | 3'UTR; Island | chr2:153.08 | -0.46 | 5.0E-06 | -1.5 | 1.4E-05 | |
| cg24255125 | Body; Island | chr9:42.52 | -0.36 | 7.8E-06 | -3.0 | 5.9E-09 | |
| cg03517030 | 1stExon; Island | chr2:90.85 | 0.35 | 1.6E-05 | 6.7 | 2.3E-14 | |
| cg05781968 | Body; Island | chr14:28.51 | 0.31 | 4.4E-05 | 2.3 | 1.0E-05 | |
| cg04154281 | Body; Shore | chr11:102.31 | 0.17 | 6.5E-05 | 0.7 | 0.03 | |
| cg06861375 | Body; Island | chr3:98.43 | 0.36 | 6.7E-05 | 4.5 | 2.8E-04 | |
| cg24959134 | - | chr10:92.44 | -0.33 | 9.4E-05 | -2.4 | 0.01 | |
| cg06552810 | - | chr2:106.19 | 0.26 | 1.1E-04 | 2.9 | 0.002 | |
| cg01663821 | Shore | chr3:98.94 | 0.19 | 1.3E-04 | 0.9 | 0.02 | |
| cg00597112 | - | chr11:109.01 | 0.21 | 1.4E-04 | 0.5 | 0.002 | |
| cg26857408 | Body; Shore | chr11:102.31 | 0.24 | 2.1E-04 | 0.7 | 0.03 | |
| cg15172734 | 5'UTR; Shore | chr14:26.53 | -0.11 | 3.4E-04 | -2.5 | 0.01 | |
| cg09990537 | 5'UTR; Shore | chr14:28.51 | 0.17 | 3.4E-04 | 1.0 | 0.004 | |
| cg12849734 | - | chr2:157.71 | 0.14 | 4.4E-04 | 0.9 | 0.01 | |
| cg21746387 | TSS1500; Shore | chr10:127.51 | -0.17 | 5.5E-04 | -3.4 | 0.001 | |
| cg11382417 | - | chr2:96.32 | -0.21 | 6.0E-04 | -4.7 | 1.3E-07 | |
| cg02865068 | Shore | chr2:105.66 | 0.11 | 9.7E-04 | 2.9 | 0.04 | |
| cg14275842 | Body; Island | chr11:70.62 | 0.18 | 0.001 | 1.0 | 0.005 | |
| cg02159996 | 5'UTR | chr4:33.13 | 0.13 | 0.001 | 1.2 | 2.5E-04 | |
| cg00920372 | - | chr19:45.33 | -0.08 | 0.001 | -1.5 | 8.8E-04 | |
| cg03422015 | Body | chr6:119.69 | 0.04 | 0.001 | 1.1 | 0.02 | |
| cg04340318 | - | chr4:86.04 | 0.16 | 0.001 | 2.3 | 0.001 | |
| cg14465355 | Body; Shore | chr12:110.64 | 0.06 | 0.001 | 0.6 | 0.02 | |
| cg15002641 | Body | chr1:133.39 | -0.10 | 0.001 | -1.0 | 0.02 | |
| Differentially methylated regions detected by MBD-seq at FDR 10%; replicated by EPIC | |||||||
| cg05362127 | TSS200; Island | chr14:28.51 | 0.33 | 0.002 | 2.3 | 9.4E-06 | |
| cg24142850 | - | chr8:92.55 | -0.09 | 0.005 | -2.9 | 9.4E-05 | |
| cg15585318 | Body; Island | chr14:28.51 | 0.22 | 0.006 | 1.8 | 2.1E-06 | |
| cg09595163 | Body; Island | chr14:28.51 | 0.18 | 0.006 | 2.3 | 1.2E-05 | |
| cg13868216 | Body; Island | chr15:79.26 | 0.11 | 0.01 | 1.6 | 1.8E-04 | |
| cg09972454 | Body; Shore | chr4:147.94 | -0.06 | 0.01 | -2.9 | 1.5E-06 | |
| cg18120446 | Island | chr5:41.75 | 0.01 | 0.02 | -2.2 | 2.7E-08 | |
These are strain-dependent differentially methylated CpGs (EPIC microarray) and CpG regions (MBD-seq) based on a “false discovery threshold” (FDR) cutoff of 10% and replication at an uncorrected p-value threshold of 0.05.
Coef. is the linear regression coefficient (i.e., difference in methylation relative to C57BL/6J; negative is lower methylation in DBA/2J; and positive is higher methylation in DBA/2J compared to C57BL/6J). LogFC is log2 fold difference in methylation (i.e., difference in methylation relative to DBA/2J; negative is lower methylation in DBA/2J; and positive is higher methylation in DBA/2J compared to C57BL/6J).
CpG location in relation to gene features and CpG region based in probe annotations for the human methylation microarray. For most conserved regions, mouse annotations are analogous to humans.
Chromosome and Megabase coordinate based on mm10 mouse reference genome