| Literature DB >> 18366726 |
Abstract
BACKGROUND: The cooperative interaction between transcription factors has a decisive role in the control of the fate of the eukaryotic cell. Computational approaches for characterizing cooperative transcription factors in yeast, however, are based on different rationales and provide a low overlap between their results. Because the wealth of information contained in protein interaction networks and regulatory networks has proven highly effective in elucidating functional relationships between proteins, we compared different sets of cooperative transcription factor pairs (predicted by four different computational methods) within the frame of those networks.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366726 PMCID: PMC2315657 DOI: 10.1186/1471-2164-9-137
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Methods under study.
| Proteins that are close in the PIN are likely to be co-regulated by the same TFs. Cooperative TF pairs are identified on the basis of the distance between their common target genes in the PIN (as opposed to the distance between genes controlled by either TF). Subcellular localization data was used to filter the PIN. Functional data was used to refine the distances between target genes. | Cooperative TF pairs, triads and modules. Members of triads and modules were pairwisely decomposed in an all-vs-all fashion. Gene names were transformed to YPD names. TFs not present in the set of 101 TFs common to all methods were excluded. The number of cooperative TF pairs was 45. | |
| Proteins with similar expression profiles are likely to be co-regulated. Cooperative TF pairs are identified on the basis of their influence on the cell-cycle-dependent co-expression of their common target genes. | Significant cooperative TF pairs labeled as significant (PB < 0.001). Gene names were transformed to YPD names and TFs not present in the set of 101 TFs common to all methods were excluded. The number of cooperative TF pairs was 31. | |
| Cooperativity has an influence in the expression level of regulated genes during one or more phases of the cell cycle. First, TFs involved in regulation of the cell cycle are found. Then, TF pairs associated to a target gene more than random expectation are identified. Of these, a cooperative interaction between two TFs is identified based on their influence in the expression level of the target genes regulated by them. | Only pairs labeled as "confident" considered. Gene names were transformed to YPD names and TFs not present in the set of 101 TFs common to all methods were excluded. The number of cooperative TF pairs was 15. | |
| DNA-binding sites of cooperative TFs are likely to co-occur in the target genes. Also, cooperative TF pairs are likely to influence changes in the expression profiles of target genes. This influence was measured by means of a dynamic stochastic model on cell-cycle expression data. The method was also applied to gene expression under H2O2 stress. | Only TF pairs with |
Features of the four methods under study. Abbreviations: TF, transcription factor; PIN, protein interaction network.
Dependence and overlap between the four literature sources.
| 0.0110 | 0.0061 | 0.0117 | ||
| 0.206 | 0.0068 | 0.0122 | ||
| 0.132 | 0.179 | 0.0099 | ||
| 0.197 | 0.222 | 0.196 |
Upper right side: dependence in terms of mutual information between pairs of methods (none of the values was found to be significantly larger than random expectation). Lower left side: overlap in terms of Jaccard coefficient between pairs of methods (all values were found to be significantly larger than random expectation). Diagonal (in bold): number of CTFPs predicted by each method.
Shortest path length in the PIN.
| 2.119 | 2.841 | 1.262·10-5 | 2.967 | 1.574·10-5 | 1.722 | 3.151 | 1.455·10-10 | ||
| 2.269 | 2.841 | 2.622·10-3 | 2.967 | 1.534·10-3 | 1.722 | 3.151 | 3.229·10-6 | ||
| 2.000 | 2.841 | 1.757·10-4 | 2.967 | 3.372·10-4 | 1.722 | 3.151 | 7.860·10-7 | ||
| 2.256 | 2.841 | 2.003·10-5 | 2.967 | 5.341·10-5 | 1.722 | 3.151 | 9.653·10-10 | ||
Shortest path length between cooperative TF pairs in the PIN. The distribution of shortest path lengths between CTFPs predicted by each method was compared to the distributions in the other sets of TF pairs by means of a Mann-Whitney test. The p-value column is in bold type if the distribution of the parameter (in this case, the shortest path length) for a given method is not significantly different to that of the corresponding set (p-value < 0.01).
Modularity in the PIN.
| 0.238 | 0.071 | 6.561·10-6 | 0.110 | 2.048·10-4 | 0.395 | 0.035 | 9.321·10-13 | ||
| 0.186 | 0.071 | 1.582·10-3 | 0.110 | 0.395 | 8.690·10-3 | 0.035 | 2.808·10-8 | ||
| 0.212 | 0.071 | 1.119·10-4 | 0.110 | 1.692·10-3 | 0.395 | 0.035 | 7.146·10-9 | ||
| 0.188 | 0.071 | 7.941·10-8 | 0.110 | 4.160·10-5 | 0.395 | 4.563·10-3 | 0.035 | 2.200·10-16 | |
Modularity of cooperative TF pairs in the PIN. Modularity was measured as topological overlap (see Methods). The distribution of modularity values for the CTFPs predicted by method was compared to distributions in the other sets of TF pairs by means of a Mann-Whitney test. Font styles are as in Table 3.
Shortest path length in the regulatory network.
| 3.731 | 3.970 | 3.292 | 5.000 | 4.380 | |||||
| 3.500 | 3.970 | 3.292 | 5.000 | 4.380 | |||||
| 2.846 | 3.970 | 3.292 | 5.000 | 4.380 | |||||
| 3.258 | 3.970 | 3.292 | 5.000 | 4.380 | 5.292·10-3 | ||||
Shortest path length between cooperative TF pairs in the regulatory network. The distribution of shortest path lengths between the CTFPs predicted by each method was compared to distributions in the other sets of TF pairs by means of a Mann-Whitney test. Font styles are as in Table 3.
In-degree modularity in the regulatory network.
| 0.026 | 0.057 | 0.100 | 0.125 | 0.044 | |||||
| 0.084 | 0.057 | 0.100 | 0.125 | 0.044 | |||||
| 0.083 | 0.057 | 0.100 | 0.125 | 0.044 | |||||
| 0.108 | 0.057 | 0.100 | 0.125 | 0.044 | |||||
In-degree modularity of cooperative TF pairs in the regulatory network. The in-degree of a gene denotes the regulatory control performed upon the expression of that gene. Modularity was measured as topological overlap (see Methods). The distribution of modularity values for the CTFPs predicted by each method was compared to distributions in the other sets of TF pairs by means of a Mann-Whitney test. Font styles are as in Table 3.
Out-degree modularity in the regulatory network
| 0.424 | 0.132 | 1.986·10-11 | 0.318 | 0.590 | 1.992·10-3 | 0.050 | 2.200·10-16 | ||
| 0.314 | 0.132 | 8.341·10-7 | 0.318 | 0.590 | 1.170·10-4 | 0.050 | 3.553·10-15 | ||
| 0.300 | 0.132 | 6.995·10-5 | 0.318 | 0.590 | 1.210·10-3 | 0.050 | 5.072·10-9 | ||
| 0.314 | 0.132 | 3.756·10-12 | 0.318 | 0.590 | 5.027·10-5 | 0.050 | 2.200·10-16 | ||
Out-degree modularity of cooperative TF pairs in the regulatory network. The out-degree of a gene denotes the regulatory control performed by that gene upon the expression of other genes. Modularity was measured as topological overlap (see Methods). The distribution of modularity values for the CTFPs predicted by each method was compared to distributions in the other sets of TF pairs by means of a Mann-Whitney test. Font styles are as in Table 3.
Figure 1Examples of the calculation of the functional similarity score. Transcription factors are represented as TF1, TF2 and TF3. The group of genes regulated by each TF are GTF1 = {A, B, C}, GTF2 = {D, E, F} and GTF3 = {G, D, H, I}. The five different protein functions in this simplified figure are labeled as f...f. The functions are associated to the genes with an arrow. In this example, we calculated the functional similarity score of TF1-TF2, TF1-TF3 and TF3-TF3. The last two examples show how the FS score deals with similar functional profiles.