| Literature DB >> 24564680 |
Abstract
BACKGROUND: During eukaryotic genome evolution, tandem gene duplication is the most frequent event giving rise to clustered gene families. However, how expression divergence between tandemly duplicated genes has emerged and maintained remain unclear. In particular, it is unknown if epigenetic regulators have been involved in the process.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24564680 PMCID: PMC4046690 DOI: 10.1186/1471-2164-15-S1-S8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Rank correlations (ρ) and partial rank correlations (ρp) of the genomic properties with expression dissimilarities (measured by ExpD 1- r or ExpD Euc) of the non-paralogous adjacent genes
| Genomic properties a |
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
|
| 0.180 (<10-217) | 0.130 (<10-113) | 0.112 (<10-83) | 0.040 (<10-11) |
|
| 0.120 (<10-98) | 0.032 (<10-7) | 0.104 (<10-72) | 0.058 (<10-22) |
| Δ | 0.077 (<10-39) | 0.049 (<10-16) | 0.196 (<10-258) | 0.182 (<10-227) |
a "d", intergenic distance; "#CTCF", number of overlapping CTCF-binding sites; "ΔCpG O/E", difference in upstream DNA methylation.
b P values show the probabilities of the observations under the hypothesis of no correlation.
c Spearman's partial correlation coefficient ρp is computed by controlling for the other two genomic properties listed in a
Rank correlations (ρ) and partial rank correlations (ρp) of the examined genomic properties with expression dissimilarities (measured by ExpD 1- r or ExpD Euc) of the paralogous adjacent genes
| Genomic properties a |
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
|
| 0.039 (0.164) | -0.011 (0.707) | 0.100 (<10-3) | 0.021 (0.461) |
|
| 0.111 (<10-4) | 0.103 (<10-3) | 0.187 (<10-10) | 0.159 (<10-7) |
| Δ | 0.069 (0.015) | 0.054 (0.023) | 0.102 (<10-3) | 0.094 (<10-3) |
a "d", intergenic distance; "#CTCF", number of overlapping CTCF-binding sites; "ΔCpG O/E", difference in upstream DNA methylation.
b P values show the probabilities of the observations under the hypothesis of no correlation.
c Spearman's partial correlation coefficient ρp is computed by controlling for the other two genomic properties listed in a
Rank correlations (ρ) and partial rank correlations (ρp) of the examined genomic properties with expression dissimilarities (measured by ExpD 1- r or ExpD Euc) of the adjacent paralogous genes of different orientations.
| Genomic properties a |
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
|
| ||||
|
| 0.165 (0.072) | 0.106 (0.250) | 0.150 (0.109) | 0.036 (0.700) |
|
| 0.213 (0.020) | 0.173 (0.058) | 0.352 (<10-4) | 0.327 (<10-3) |
| Δ | -0.024 (0.794) | 0.001 (0.991) | -0.071 (0.443) | -0.054 (0.561) |
|
| ||||
|
| 0.005 (0.870) | -0.050 (0.110) | 0.089 (<10-2) | 0.016 (0.607) |
|
| 0.100 (<10-2) | 0.110 (<10-3) | 0.151 (<10-5) | 0.127 (<10-4) |
| Δ | 0.083 (<10-2) | 0.085 (<10-2) | 0.109 (<10-3) | 0.104 (<10-3) |
|
| ||||
|
| 0.139 (0.110) | 0.105 (0.230) | 0.145 (0.096) | 0.046 (0.602) |
|
| 0.113 (0.195) | 0.061 (0.482) | 0.271 (<10-2) | 0.202 (0.019) |
| Δ | 0.050 (0.569) | 0.018 (0.0835) | 0.212 (0.014) | 0.153 (0.078) |
a "d", intergenic distance; "#CTCF", number of overlapping CTCF-binding sites; "ΔCpG O/E", difference in upstream DNA methylation.
b P values show the probabilities of the observations under the hypothesis of no correlation.
c Spearman's partial correlation coefficient ρp is computed by controlling for the other two genomic properties listed in a
d There were 120, 1003 and 133 head-to-head, head-to-tail and tail-to-tail adjacent paralogs, respectively.
Rank correlations (ρ) and partial rank correlations (ρp) of the examined genomic properties with the divergence time of the paralogous adjacent genes.
| Genomic properties a |
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
|
| ||||
|
| 0.138 (<10-5) | 0.057 (0.056) | 0.141 (<10-6) | 0.053 (0.060) |
|
| 0.225 (<10-13) | 0.189 (<10-9) | 0.219 (<10-14) | 0.178 (<10-9) |
| Δ | 0.097 (<10-2) | 0.089 (<10-2) | 0.084 (<10-2) | 0.073 (<10-2) |
|
| ||||
|
| 0.086 (0.152) | -0.070 (0.245) | 0.108 (0.072) | -0.038 (0.526) |
|
| 0.292 (<10-6) | 0.276 (<10-5) | 0.273 (<10-5) | 0.243 (<10-4) |
| Δ | 0.122 (0.043) | 0.079 (0.186) | 0.132 (0.027) | 0.092 (0.125) |
a "d", intergenic distance; "#CTCF", number of overlapping CTCF-binding sites; "ΔCpG O/E", difference in upstream DNA methylation.
b P values show the probabilities of the observations under the hypothesis of no correlation.
c Spearman's partial correlation coefficient ρp is computed by controlling for the other two genomic properties listed in a
Enriched/depleted GO terms at level 4 for genes with high #CTCF/d flanking regions
| Functional categories | Duplicate with 25% top | Non-duplicates with 25% top |
|---|---|---|
|
|
| |
|
| ||
|
| transferase activity, transferring phosphorus-containing groups | |
|
| hydrolase activity, acting on acid anhydrides (GO:0016817) (↑, | |
|
| protein domain specific binding | |
|
| substrate-specific transmembrane transporter activity (GO:0022891) (↑, | |
|
| cytoskeletal protein binding | |
|
| phospholipid binding | |
|
| identical protein binding | |
|
| receptor binding | |
| transferase activity, transferring glycosyl groups (GO:0016757) (↓, | heat shock protein binding | |
| monooxygenase activity | oxidoreductase activity, acting on peroxide as acceptor (GO:0016684) (↑, | |
|
| ||
| extracellular matrix | extracellular matrix | |
|
| extracelluar space | |
|
| endomembrane system | |
| membrane-bounded vesicle (GO:0031988) (↑, | ||
| cell surface | ||
| cell projection | ||
| midbody | ||
|
| ||
| response to organic substance | response to organic substance | |
|
| lipid metabolic process | |
|
| defense response | |
|
| response to other organism | |
|
| carbohydrate metabolic process | |
|
| response to drug | |
|
| cell-cell signaling | |
|
| cytoskeleton organization | |
|
| regulation of developmental process (GO:0050793) (↑, | |
|
| regulation of immune system process (GO:0002682) (↑, | |
a Specifically enriched GO terms are highlighted in bold fonts
b The symbol ↑or ↓ represents the enrichment status of "enriched" or "depleted" of the GO term, respectively.
c P values were Bonferroni-corrected for multiple tests