| Literature DB >> 22474411 |
Xin Du1, Leng Han, An-Yuan Guo, Zhongming Zhao.
Abstract
CpG islands are typically located in the 5' end of genes and considered as gene markers because they play important roles in gene regulation via epigenetic change. In this study, we compared the features of CpG islands identified by several major algorithms by setting the parameter cutoff values in order to obtain a similar number of CpG islands in a genome. This approach allows us to systematically compare the methylation and gene expression patterns in the identified CpG islands. We found that Takai and Jones' algorithm tends to identify longer CpG islands but with weaker CpG island features (e.g., lower GC content and lower ratio of the observed over expected CpGs) and higher methylation level. Conversely, the CpG clusters identified by Hackenberg et al.'s algorithm using stringent criteria are shorter and have stronger features and lower methylation level. In addition, we used the genome-wide base-resolution methylation profile in two cell lines to show that genes with a lower methylation level at the promoter-associated CpG islands tend to express in more tissues and have stronger expression. Our results validated that the DNA methylation of promoter-associated CpG islands suppresses gene expression at the genome level.Entities:
Year: 2012 PMID: 22474411 PMCID: PMC3303704 DOI: 10.1155/2012/598987
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Summary of the CpG islands (CGIs) or CpG clusters (CGCs) identified by different algorithms in the human genome.
| Method* | Whole genome CGIs | Promoter-associated CGIs | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # CGIs | Length (bp) | GC content (%) | CpG O/E** | # CGIs | Proportion (%) | # genes | CGIs/gene | Length (bp) | GC content (%) | CpG O/E** | ||
| TJ-CGIs | 37, 729 | 1089 | 60.6 | 0.717 | 13,207 | 35.0 | 12, 521 | 1.05 | 1477 | 62.2 | 0.759 | |
| CGCs-15 | 37, 184 | 605 | 68.7 | 0.855 | 14,419 | 38.8 | 11, 292 | 1.28 | 694 | 70.1 | 0.885 | |
| CGCs-20 | 25, 454 | 727 | 70.3 | 0.853 | 12,115 | 47.6 | 10, 245 | 1.18 | 767 | 70.6 | 0.885 | |
| UCSC-CGIs | 27, 639 | 763 | 66.1 | 0.862 | 12,297 | 44.5 | 11, 744 | 1.05 | 964 | 69.3 | 0.862 | |
*The methods/algorithms for screening CpG islands or CpG clusters are described in Section 2.
**CpG O/E: the ratio of the observed versus expected number of CpG dinucleotides in a sequence.
Figure 1Distribution of features of CpG islands identified by different algorithms. (a) Length. (b) GC content (%). (c) CpG O/E.
Figure 2Methylation level of the CpG islands or clusters in the whole genomic regions or associated with the promoter regions. (a) Methylation data was based on human methylome from H1 cell line. (b) Methylation data was based on human methylome from IMR90 cell line.
Pearson's correlation coefficient between methylation level of promoter-associated CGIs and gene expression strength.
| Method* | H1 | IMR90 | |||
|---|---|---|---|---|---|
|
|
|
|
| ||
| TJ-CGIs | −0.016 | 0.128 | −0.021 | 0.0419 | |
| CGCs-15 | −0.037 | 4.78 × 10−4 | −0.036 | 7.52 × 10−4 | |
| CGCs-20 | −0.037 | 9.47 × 10−4 | −0.040 | 3.33 × 10−4 | |
| UCSC-CGIs | −0.045 | 1.40 × 10−5 | −0.038 | 2.89 × 10−4 | |
*The methods/algorithms for screening CpG islands or CpG clusters are described in Section 2.
Figure 3Relationship between methylation level of promoter-associated CpG islands and the number of expressed tissues. (a) H1 cell line. (b) IMR90 cell line.
Figure 4Relationship between methylation level of promoter-associated CpG islands and expression strength. (a) Methylation data was based on human methylome from H1 cell line. (b) Methylation data was based on human methylome from IMR90 cell line.
Figure 5Correlation between the number of expressed tissues and expression strength.