| Literature DB >> 21172051 |
Yun-Qin Chen1, Hui Yu, Yi-Xue Li, Yuan-Yuan Li.
Abstract
BACKGROUND: A 'head-to-head' (h2h) gene pair is defined as a genomic locus in which two adjacent genes are divergently transcribed from opposite strands of DNA. In our previous work, this gene organization was found to be ancient and conserved, which subjects functionally related genes to transcriptional co-regulation. However, some of the biological features of h2h pairs still need further clarification.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21172051 PMCID: PMC3024869 DOI: 10.1186/1471-2105-11-S11-S16
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Percentages of h2h pairs within particular TSS distance intervals
| (0, 100) bp | (100, 200) bp | (0, 400) bp | |
|---|---|---|---|
| Set H | 14.2% | 20%* | 54.9% |
| Set HM | 18.3% | 28.3%* | 68.6% |
| Set HMCF | 22.1% | 31.2%* | 74.0% |
| Set HMCF | 42.9%* | 35.7% | 92.9% |
*The largest percentage within a 100-bp interval. It can be seen that the peak intervals at the four gradually-increasing conservation levels are (100, 200) bp, (100, 200) bp, (100, 200) bp, and (0, 100) bp, respectively.
Figure 1TSS distance distributions of four h2h pair sets
Figure 2Distribution of Expression correlation indices for gene pairs from five different sets. Indices used: Pearson’s Correlation Coefficient (A), Mutual Rank (B), and Relative Rank (C). Data were from COXPRESdb.
Expression correlation within h2h pairs in DBH2H
| Correlation | Positive Correlation | Negative Correlation | ||||
|---|---|---|---|---|---|---|
| Pearsona | Spearmanb | Pearson | Spearman | Pearson | Spearman | |
| Set H | 0.55 | 0.51 | 0.61 | 0.54 | 0.42 | 0.42 |
| (52.2%)c | (43.2%) | (45.3%) | (36.2%) | (6.9%) | (7.0%) | |
| Set HM | 0.57 | 0.52 | 0.62 | 0.55 | 0.44 | 0.44 |
| (55.9%) | (47.3%) | (48.6%) | (40%) | (7.2%) | (7.3%) | |
| Set HMC | 0.57 | 0.53 | 0.62 | 0.55 | 0.46 | 0.46 |
| (55.8%) | (47.6%) | (48.3%) | (40%) | (7.4%) | (8.0%) | |
| Set HMCF | 0.59 | 0.55 | 0.64 | 0.59 | 0.45 | 0.43 |
| (57.5%) | (50.5%) | (51.3%) | (44%) | (6.2%) | (6.5%) | |
a Two-group Wilcoxon test, p<0.05 for set H vs set HM
b Two-group Wilcoxon test, p<0.05 for set H vs set HM
c A percentage in round brackets refers to the average Significance Ratio (SR).
Figure 3Distribution of Expression correlation indices for possible pairs formed by genes from four different sets. Indices used: Pearson Correlation Coefficient (A), Mutual Rank (B). Data were from COXPRESdb.
TF-association of h2h pairs
| TF-sharing pairs | TF-exclusive pairs | Proportion of TF-sharing pairs in annotated pairs | |
|---|---|---|---|
| Adjacent | 53 | 374 | 12.4% |
| Set H | 39 | 168 | 18.8% |
| Set HM | 13 | 51 | 20.3% |
| Set HMC | 6 | 24 | 20% |
| Set HMCF | 3 | 2 | 60% |
Figure 4TF similarity is positively related with expression correlation of h2h gene pairs. (A) boxplots of expression correlation values of four h2h pair groups with different level of TF similarities. Pairs sharing at least one TF ( 2nd and 3rd ) boxplots displayed a significantly higher expression correlation than those with no common TF (1st boxplot): mean values 0.564 vs. 0.355, two group t-test p =1.21e-09; pairs sharing two or more TFs (3rd boxplot) displayed a significantly higher expression correlation than those with only one common TF (2nd boxplot): mean values 0.619 vs 0.496., two group t-test p=0.02.(B) the scatter plot of expression correlation values vs. TF similarities (defined as the fraction of shared TFs in the union TFs).
Seven h2h gene pairs in which one gene regulates the other
| Pearson Correlation Coefficient (PCC) | Mutual Rank (MR) | |
|---|---|---|
| CSTF1 -> AURKA | 0.55 | 68.2 |
| DTX3L -> PARP9 | 0.807 | 1 |
| WDSOF1 -> SLC25A32 | 0.701 | 1.7 |
| MCM4 -> PRKDC | 0.7 | 7.8 |
| RECQL -> GOLT1B | 0.627 | 1.4 |
| NUFIP1 -> KIAA1704 | 0.487 | 29.9 |
| POLR3K -> C16orf33 | 0.701 | 1 |
Functional similarities of h2h gene pairs
| BP | MF | CC | |
|---|---|---|---|
| Adjacent gene pairs | 0.29 (1603) a | 0.34 b (1859) | 0.44 b(1993) |
| Set H | 0.33c (405) | 0.37 c(492) | 0.51 c (508) |
| Set HM | 0.29 (105) | 0.36 (112) | 0.51 (123) |
| Set HMC | 0.21(48 ) | 0.33 (42) | 0.45 (53) |
| Set HMCF | 0.38 (8) | 0.27 (7) | 0.59 (9) |
a Number of annotated h2h pairs are shown in round brackets.
b Functional similarities of set H were significantly higher than h2h genes’ adjacent gene pairs (p<0.05).
c Functional similarities of set H, at any GO subsystem, were significantly higher than random pair sets of comparable sizes (p<0.01).
Average functional similarity of h2h gene pairs delimited by expression correlation thresholds
| Functional Similarity | |||
|---|---|---|---|
| PCC>0.355* | PCC>0.5 | PCC>0.6 | |
| Set H | 0.576 | 0.594 | 0.654 |
| Set HM | 0.538 | 0.532 | 0.564 |
| Set HMC | 0.512 | 0.541 | 0.693 |
| Set HMCF | 0.687 | 0.567 | 0.848 |
* 0.355 was the median expression correlation of GO-annotated human h2h gene pairs.