| Literature DB >> 22369481 |
Darby Tien-Hao Chang1, Chi-Yeh Wu, Chen-Yu Fan.
Abstract
BACKGROUND: Head-to-head (h2h) genes are prone to have association in expression and in functionality and have been shown conserved in evolution. Currently there are many studies on such h2h gene pairs. We found that the previous studies extremely focused on human genome. Furthermore, they only focused on analyses that require only gene or protein sequences but not conducted a systematic investigation on other promoter features such as the binding evidence of specific transcription factors (TFs). This is mainly because of the incomplete resources of higher organisms, though they are relatively of interest, than model organisms such as Saccharomyces cerevisiae. The authors of this study recently integrated nine promoter features of 6603 genes of S. cerevisiae from six databases and five papers. These resources are suitable to conduct a comprehensive analysis of h2h genes in S. cerevisiae.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22369481 PMCID: PMC3303733 DOI: 10.1186/1471-2164-13-S1-S11
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of h2h gene pairs by chromosome
| Chr | Length (bp) | #gene | #pair | #h2h | #h2t | #t2t | %h2h | %h2t | %t2t | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 230,208 | 117 | 5.08 | 98 | 26 | 41 | 31 | 26.5 | 41.8 | 31.6 |
| 2 | 813,179 | 456 | 5.61 | 412 | 101 | 200 | 111 | 24.5 | 48.5 | 26.9 |
| 3 | 316,617 | 183 | 5.78 | 161 | 37 | 82 | 42 | 23.0 | 50.9 | 26.1 |
| 4 | 1,531,919 | 837 | 5.46 | 757 | 187 | 365 | 205 | 24.7 | 48.2 | 27.1 |
| 5 | 576,869 | 324 | 5.62 | 278 | 70 | 134 | 74 | 25.2 | 48.2 | 26.6 |
| 6 | 270,148 | 141 | 5.22 | 126 | 32 | 66 | 28 | 25.4 | 52.4 | 22.2 |
| 7 | 1,090,947 | 583 | 5.34 | 531 | 133 | 264 | 134 | 25.0 | 49.7 | 25.2 |
| 8 | 562,643 | 321 | 5.71 | 294 | 74 | 144 | 76 | 25.2 | 49.0 | 25.9 |
| 9 | 439,885 | 241 | 5.48 | 217 | 57 | 99 | 61 | 26.3 | 45.6 | 28.1 |
| 10 | 745,741 | 398 | 5.34 | 353 | 90 | 165 | 98 | 25.5 | 46.7 | 27.8 |
| 11 | 666,454 | 348 | 5.22 | 321 | 81 | 159 | 81 | 25.2 | 49.5 | 25.2 |
| 12 | 1,078,175 | 578 | 5.36 | 513 | 133 | 237 | 143 | 25.9 | 46.2 | 27.9 |
| 13 | 924,429 | 505 | 5.46 | 465 | 122 | 225 | 118 | 26.2 | 48.4 | 25.4 |
| 14 | 784,334 | 435 | 5.55 | 397 | 94 | 205 | 98 | 23.7 | 51.6 | 24.7 |
| 15 | 1,091,289 | 598 | 5.48 | 546 | 145 | 257 | 144 | 26.6 | 47.1 | 26.4 |
| 16 | 948,062 | 511 | 5.39 | 465 | 122 | 213 | 130 | 26.2 | 45.8 | 28.0 |
| Overall | 12,070,899 | 6,576 | 5.45 | 5,934 | 1504 | 2,856 | 1,574 | 25.3 | 48.1 | 26.5 |
1 Number of genes per 10 kbp.
Figure 1Distributions of gene distance of adjacent genes. (a) The gene distance is measured as the number of base pairs in between the coding regions of two genes. (b) The gene distance is measured as the number of base pairs in between the transcription start sites (TSSs) of two h2h genes. H2h with high association represents gene pairs with functional similarity higher than the genome average; while h2h with low association represents the remaining h2h gene pairs (see the Methods section for the details).
Association of adjacent genes in terms of number of TFs that bind/regulate both genes
| #pair | Co-expression | Functional association | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| #TF | H2h | H2t | T2t | H2h | H2t | T2t | H2h | H2t | T2t | |
| TFs with binding evidence to both genes | ||||||||||
| 0 TF | 405 | 1,761 | 1,229 | 0.229 | 0.221 | 0.245 | 1.839 | 1.641 | 1.690 | |
| 1 TF | 301 | 595 | 240 | 0.256 | 0.215 | 0.246 | 1.736 | 1.727 | 1.638 | |
| 2 TFs | 271 | 246 | 0.246 | 0.229 | 1.779 | 1.724 | ||||
| 3 TFs | 158 | 113 | 0.261 | 0.252 | 1.663 | 1.753 | ||||
| 4 TFs | 117 | 0.268 | 1.560 | |||||||
| 5 TFs | ||||||||||
| PCC | 0.911 | 0.918 | 0.899 | -0.247 | 0.877 | 0.396 | ||||
| TFs with regulation evidence to both genes | ||||||||||
| 0 TF | 1,053 | 2,098 | 1,212 | 0.232 | 0.212 | 0.237 | 1.673 | 1.572 | 1.644 | |
| 1 TF | 299 | 522 | 265 | 0.260 | 0.226 | 0.269 | 1.885 | 1.854 | 1.660 | |
| 2 TFs | ||||||||||
| PCC | 0.972 | 0.931 | 0.986 | 0.992 | 0.991 | 0.880 | ||||
| Overall | 1,504 | 2,856 | 1,574 | |||||||
Values highlighted with bold font indicate using more than and equal to the number of TFs for sufficient pairs. For example, number of h2h pairs with four TFs with binding evidence is 117; while number of h2t pair with ≥4 TFs with binding evidence is 141. Higher co-expression or functional association indicates better gene association; see the Methods section for the details.
TFs that prefer regulation of h2h genes
| TF | |||||||
|---|---|---|---|---|---|---|---|
| STP2 | 7 | 0.375 | 4.358 | 3 | 42.9 | 235 | 6.0 |
| RME1 | 6 | 0.240 | 3.437 | 0 | 0.0 | 156 | 7.7 |
| PDR3 | 12 | 0.288 | 2.931 | 2 | 16.7 | 446 | 5.4 |
| SFP1 | 22 | 0.335 | 2.452 | 0 | 0.0 | 432 | 10.2 |
| GCN4 | 195 | 0.269 | 2.250 | 1 | 0.5 | 2,058 | 19.0 |
| SFP1 | 42 | 0.335 | 2.225 | 0 | 0.0 | 886 | 9.5 |
| RPN4 | 41 | 0.248 | 2.130 | 0 | 0.0 | 169 | 48.5 |
| FHL1 | 57 | 0.273 | 2.101 | 0 | 0.0 | 1,120 | 10.2 |
| MET4 | 757 | 0.330 | 2.034 | 26 | 3.4 | 19,090 | 7.9 |
1 Number of h2h gene pairs both bound/regulated by the TF. 2 Average co-expression of the h2h gene pairs. 3 Average functional similarity of the h2h gene pairs. 4 Number of the h2h gene pairs with interactions. 5 Ratio of the h2h gene pairs with interactions. 6 Number of genes bound/regulated by the TF with evidence. 7 Ratio of genes bound/regulated by the TF that are h2h genes.
H2h pairs where both genes were bound/regulated by STP2
| Pair | Systematic name | Gene name | Gene ontology (GO) annotations | |
|---|---|---|---|---|
| 1 | YDL234C | GYP7 | Regulation | BP GO:0032889 - regulation of vacuole fusion, non-autophagic; CC GO:0005737 - cytoplasm; MF GO:0005097 - Rab GTPase activator activity |
| YDL233W | n/a | Regulation | CC GO:0005634 - nucleus | |
| 3 | YHR136C | SPL2 | Regulation | BP GO:0009266 - response to temperature stimulus; CC GO:0005737 - cytoplasm; MF GO:0004860 - protein kinase inhibitor activity |
| YHR137W | ARO9 | Regulation | BP GO:0009058 - biosynthetic process; CC GO:0005634 - nucleus; MF GO:0016769 - transferase activity, transferring nitrogenous groups | |
| 4 | YIR027C | DAL1 | Regulation | BP GO:0009442 - allantoin assimilation pathway; MF GO:0004038 - allantoinase activity |
| YIR028W | DAL4 | Regulation | BP GO:0015720 - allantoin transport; CC GO:0016021 - integral to membrane; MF GO:0005274 - allantoin uptake transmembrane transporter activity | |
| 7 | YOL155C | HPF1 | Regulation | BP GO:0031505 - fungal-type cell wall organization; CC GO:0009277 - fungal-type cell wallMF GO:0015926 - glucosidase activity |
| YOL154W | ZPS1 | Regulation | CC GO:0009277 - fungal-type cell wall |
Pairs with known interactions are highlighted with bold font. 1 Evidence type (binding or regulation) of STP2 and the gene.
Figure 2Gene association of TATA-containing and TATA-less h2h gene pairs. The -axis is the co-expression in (a) and functional similarity in (b) while the -axis is the frequency of h2h gene pairs with the corresponding values. Higher value indicates better gene association for both indices; see the Methods section for the details.
Figure 3Gene association of different promoter accessibility. The -axis is the co-expression in (a) and functional similarity in (b) while the -axis is the frequency of h2h gene pairs with the corresponding values. Higher value indicates better gene association for both indices; see the Methods section for the details.