| Literature DB >> 21278160 |
Hehuang Xie1, Min Wang, Alexandre de Andrade, Maria de F Bonaldo, Vasil Galat, Kelly Arndt, Veena Rajaram, Stewart Goldman, Tadanori Tomita, Marcelo B Soares.
Abstract
Genomic DNA methylation contributes substantively to transcriptional regulations that underlie mammalian development and cellular differentiation. Much effort has been made to decipher the molecular mechanisms governing the establishment and maintenance of DNA methylation patterns. However, little is known about genome-wide variation of DNA methylation patterns. In this study, we introduced the concept of methylation entropy, a measure of the randomness of DNA methylation patterns in a cell population, and exploited it to assess the variability in DNA methylation patterns of Alu repeats and promoters. A few interesting observations were made: (i) within a cell population, methylation entropy varies among genomic loci; (ii) among cell populations, the methylation entropies of most genomic loci remain constant; (iii) compared to normal tissue controls, some tumors exhibit greater methylation entropies; (iv) Alu elements with high methylation entropy are associated with high GC content but depletion of CpG dinucleotides and (v) Alu elements in the intronic regions or far from CpG islands are associated with low methylation entropy. We further identified 12 putative allelic-specific methylated genomic loci, including four Alu elements and eight promoters. Lastly, using subcloned normal fibroblast cells, we demonstrated the highly variable methylation patterns are resulted from low fidelity of DNA methylation inheritance.Entities:
Mesh:
Year: 2011 PMID: 21278160 PMCID: PMC3105398 DOI: 10.1093/nar/gkr017
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The formula of methylation entropy and the examples for genomic loci with various methylation entropies in a cell population. (A) The formula of methylation entropy. The determination of methylation entropy requires three parameters: the number of CpG sites, the total number of sequence reads generated and the occurrence of each methylation pattern. (B–E) Genomic loci with various methylation entropies.
Figure 2.The distribution of methylation entropy for simulation results. For a genomic locus with four CpG dinucleotides and average methylation level as 50%, 10 000 methylation data sets were generated. Each data set comprised of 16 sequence reads with four CpG sites per read. The dashed curve represents simulation result for stochastic methylation event. For 10 000 data sets, the methylation entropy ranged from 0.54 to 0.97 with average as 0.80. The solid curve represents simulation result for allelic-specific methylation as an example of deterministic methylation event. For 10 000 data sets, the methylation entropy ranged from 0.24 to 0.52 with average as 0.35.
Statistics of high-throughput bisulfite sequencing data for Alu and flanking sequence
| Sample ID | NC1 | NC2 | PN1 | PN2 | PA1 | PA2 | RL1 | RL2 |
|---|---|---|---|---|---|---|---|---|
| Number of Loci | 2530 | 3678 | 3076 | 2334 | 3730 | 2958 | 2153 | 2965 |
| Number of sequence reads | 70 154 | 102 405 | 76 033 | 52 652 | 84 603 | 71 289 | 46 938 | 77 134 |
| Number of CpG sites | 402 506 | 653 650 | 467 705 | 313 967 | 542 630 | 424 410 | 288 981 | 435 017 |
| Average reads per locus | 27.7 | 27.8 | 24.7 | 22.6 | 22.7 | 24.1 | 21.8 | 26.0 |
| Average CpG sites per reads | 5.7 | 6.4 | 6.2 | 6.0 | 6.4 | 6.0 | 6.2 | 5.6 |
Figure 3.The distribution of methylation entropies in tissue samples. The x-axis represents different levels of methylation entropies. The y-axis represents the percentage of genomic loci examined. (A) The distribution of Alu methylation entropies. NC, PN, PA and RL represent normal control, primary non-aggressive, primary aggressive and relapsed ependymoma tissues, respectively. PA1* and RL1* were derived from one same individual. (B) The distribution of methylation entropies of all possible segments with six contiguous CpG sites in promoters and CGIs on chromosome 21.
Figure 4.The correlation of methylation entropies between the primary and relapsed ependymoma tissues from one individual. Each dot represents a genomic locus with the methylation entropies calculated for primary (PA1*) and relapsed (RL1*) tumors.
Figure 5.The methylation status of CpG/CpG dyads in subcloned human normal lung fibroblast determined by hairpin-bisulfite PCR. (A) Bisulfite sequencing results for genomic locus 1 (chr9:139174924-139175041). (B) Bisulfite sequencing results for genomic locus 2 (chr10:134480046-134480230). The methylated cytosines are indicated with filled circles while unmethylated cytosines are indicated with open circles. The bold curved lines represent hairpin linkers connected to both complementary strands. The symmetrical methylation statuses of CpG/CpG dyads indicate an accurate methylation inheritance, while asymmetrical methylation status (hemimethylated) indicates a failure in the transmission of methylation status or a de novo methylation event.
Genomic features or DNA related attributes associated with Alu methylation entropy
| Attribute name | Direction of change | Statistical test | Significance (not adjusted) | Significance (Bonferroni) | Significance threshold (FDR) |
|---|---|---|---|---|---|
| GC content | Increase | WilcoxonRankSum | 1.24E-08 | 3.50E-06 | 3.53E-05 |
| tetraNT_AATT | Decrease | WilcoxonRankSum | 3.63E-08 | 1.03E-05 | 7.07E-05 |
| tetraNT_TAAT | Decrease | WilcoxonRankSum | 1.25E-07 | 3.55E-05 | 1.06E-04 |
| cCount | Increase | WilcoxonRankSum | 2.12E-07 | 5.99E-05 | 1.41E-04 |
| tetraNT_CCCT | Increase | WilcoxonRankSum | 3.04E-07 | 8.60E-05 | 1.77E-04 |
| in Intron | Decrease | Chi-Square test | 5.80E-06 | 1.64E-03 | 2.12E-04 |
| tetraNT_ATTA | Decrease | WilcoxonRankSum | 6.10E-06 | 1.73E-03 | 2.47E-04 |
| tetraNT_ATTT | Decrease | WilcoxonRankSum | 1.52E-05 | 4.29E-03 | 2.83E-04 |
| tetraNT_ATAT | Decrease | WilcoxonRankSum | 2.96E-05 | 8.37E-03 | 3.18E-04 |
| distance to most adjacent CGI | Decrease | WilcoxonRankSum | 5.34E-05 | 1.51E-02 | 3.53E-04 |
| tetraNT_TAGT | Decrease | WilcoxonRankSum | 5.82E-05 | 1.65E-02 | 3.89E-04 |
| tetraNT_AAAT | Decrease | WilcoxonRankSum | 6.10E-05 | 1.73E-02 | 4.24E-04 |
| tetraNT_CCCC | Increase | WilcoxonRankSum | 6.13E-05 | 1.73E-02 | 4.59E-04 |
| tetraNT_TTTA | Decrease | WilcoxonRankSum | 1.17E-04 | 3.30E-02 | 4.95E-04 |
| tetraNT_TATA | Decrease | WilcoxonRankSum | 1.40E-04 | 3.97E-02 | 5.30E-04 |
| cgRatio | Decrease | WilcoxonRankSum | 1.57E-04 | 4.43E-02 | 5.65E-04 |
| tetraNT_AATA | Decrease | WilcoxonRankSum | 1.67E-04 | 4.71E-02 | 6.01E-04 |
aThe direction of change (increase or decrease) indicated the association of genomic region with high methylation entropy with the genomic features or DNA related attributes examined.