| Literature DB >> 24923326 |
Trees-Juen Chuang1, Tai-Wei Chiang2.
Abstract
Gene expression is largely regulated by DNA methylation, transcription factor (TF), and microRNA (miRNA) before, during, and after transcription, respectively. Although the evolutionary effects of TF/miRNA regulations have been widely studied, evolutionary analysis of simultaneously accounting for DNA methylation, TF, and miRNA regulations and whether promoter methylation and gene body (coding regions) methylation have different effects on the rate of gene evolution remain uninvestigated. Here, we compared human-macaque and human-mouse protein evolutionary rates against experimentally determined single base-resolution DNA methylation data, revealing that promoter methylation level is positively correlated with protein evolutionary rates but negatively correlated with TF/miRNA regulations, whereas the opposite was observed for gene body methylation level. Our results showed that the relative importance of these regulatory factors in determining the rate of mammalian protein evolution is as follows: Promoter methylation ≈ miRNA regulation > gene body methylation > TF regulation, and further indicated that promoter methylation and miRNA regulation have a significant dependent effect on protein evolutionary rates. Although the mechanisms underlying cooperation between DNA methylation and TFs/miRNAs in gene regulation remain unclear, our study helps to not only illuminate the impact of these regulatory factors on mammalian protein evolution but also their intricate interaction within gene regulatory networks.Entities:
Keywords: comparative genomics; microRNA; promoter/gene body methylation; protein evolutionary rate; transcription factor
Mesh:
Substances:
Year: 2014 PMID: 24923326 PMCID: PMC4080426 DOI: 10.1093/gbe/evu124
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Experimentally Determined Single Base-Resolution DNA Methylation Data Used in this Study
| Sample | Description (Ref.) | No. of Genes (Sampled #CG ≥ 10) | Average CpG Coverage | Average CpG Coverage |
|---|---|---|---|---|
| S1 | Peripheral blood B lymphocytes ( | 10,627 | 76.04 | 86.69 |
| S2 | Peripheral blood hematopoietic stem/progenitor cells
(CD133+CD34+CD38−Lin−) ( | 10,451 | 78.03 | 89.10 |
| S3 | Newborn foreskin fibroblasts ( | 10,388 | 89.34 | 93.56 |
| S4 | H1 embryonic stem cells ( | 10,751 | 68.54 | 89.41 |
| S5 | Breast cells from adult female ( | 10,916 | 96.78 | 98.75 |
| S6 | Peripheral blood hematopoietic stem/progenitor cells
(CD34+CD38−Lin−) ( | 10,722 | 80.04 | 89.41 |
| S7 | Fetal lung fibroblasts ( | 10,848 | 78.13 | 93.62 |
| S8 | Peripheral blood granulocytic neutrophils ( | 10,081 | 71.27 | 82.91 |
| S9 | Prefrontal cortex ( | 10,556 | 80.58 | 90.38 |
| S10 | H9 embryonic stem cells ( | 10,522 | 91.81 | 95.31 |
| S11 | Fibroblasts derived from H9 embryonic stem cells ( | 10,436 | 90.01 | 93.65 |
aCoverage of CpG dinucleotides for each promoter/gene body (protein-coding) region = (number of sampled CpG dinucleotides)/(number of sampled CpG dinucleotides + number of nonsampled CpG dinucleotides).
FSpearman’s rank correlation coefficient (ρ) between CpGO/E and mCG density for the promoter and gene body regions in the 11 analyzed methylomes. *P < 0.05 and ***P < 0.001.
Spearman’s Rank Correlation Coefficient (ρ) between Promoter (or Gene Body) mCG Density and the Evolutionary Rates (dN, dS, and dN/dS) for Human–Macaque and Human–Mouse Comparisons Before and After Controlling for Gene Body (or Promoter) mCG Density and Ten Other Confounding Factors
| Human–macaque | ||||||
| Promoter methylation | 0.2052*** | 0.0798*** | 0.1928*** | 0.1434*** | 0.0009 | 0.1601*** |
| Gene body methylation | −0.0975*** | −0.0872*** | −0.0803*** | −0.0607*** | −0.0121 | −0.0718*** |
| Human–mouse | ||||||
| Promoter methylation | 0.268*** | 0.1453*** | 0.2304*** | 0.1907*** | 0.0329 | 0.1827*** |
| Gene body methylation | −0.096*** | −0.0768*** | −0.0793*** | −0.0289 | 0.025 | −0.0452*** |
Note.—The ten confounding factors are NTF, NmiR, protein connectivity, expression level, tissue specificity, UTR length, intron length, intron number, solvent accessibility, and disorder content.
aThe analysis was based on 5,128 human genes and their macaque orthologs.
bThe analysis was based on 5,357 human genes and their mouse orthologs.
*P < 0.05 and ***P < 0.001.
FSpearman’s rank correlation coefficient (ρ) between average promoter/gene body mCG densities for the 11 analyzed samples (table 1) and the 2 regulatory factors (NTF or NmiR) (A) before and (B) after controlling for 10 other confounding factors (see text). The analyses were based on 5,418 genes containing all confounding factors examined. ***P < 0.001.
FAbsolute values of Spearman’s rank correlation coefficients (ρ) between protein evolutionary rates (dN and dN/dS) and one of the indicated factors (promoter/gene body mCG density [mCGp and mCGg], NTF, NmiR, protein connectivity [PPI], expression level [ExpLvl], tissue specificity [τ], UTR length [UTR_L], intron length [In_L], intron number [InNum], solvent accessibility [SolAcc], or disorder content [DisCont]), while simultaneously controlling for the other 11 factors, for both human–macaque (5,128 genes) and human–mouse (5,357 genes) orthologs.
FThe median values of promoter mCG density (mCGp) and NmiR in five groups of human protein-coding genes of similar size (divided according to the magnitudes of dN and dN/dS, from low to high) for both human–macaque (5,128 genes) and human–mouse (5,357 genes) comparisons.
FThe correlations between promoter and gene body mCG densities, NTF, NmiR, and protein evolutionary rates (dN and dN/dS). Solid and dotted lines represent positive and negative correlations, respectively.