| Literature DB >> 25559499 |
Fu-Jou Lai, Mei-Huei Jhu, Chia-Chun Chiu, Yueh-Min Huang, Wei-Sheng Wu.
Abstract
BACKGROUND: Transcriptional regulation of gene expression is usually accomplished by multiple interactive transcription factors (TFs). Therefore, it is crucial to understand the precise cooperative interactions among TFs. Various kinds of experimental data including ChIP-chip, TF binding site (TFBS), gene expression, TF knockout and protein-protein interaction data have been used to identify cooperative TF pairs in existing methods. The nucleosome occupancy data is not yet used for this research topic despite that several researches have revealed the association between nucleosomes and TFBSs.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25559499 PMCID: PMC4305981 DOI: 10.1186/1752-0509-8-S5-S2
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Flowchart of our method. The figure shows a schematic description of the steps used for determining the cooperativity score of two TFs. In the 2 × 2 contingency table, the D.N. stands for depletion of nucleosomes while O.N. stands for occupancy of nucleosomes.
The 27 predicted cooperative TF pairs
| Ranking | TF1 | TF2 | Consistent with existing methods' predictions | Having PPIs? | Co-annotated MIPS functional categories |
|---|---|---|---|---|---|
| 1 | Reb1 | YDR026C | [ | -- | -- |
| 2 | Fkh2 | Fkh1 | [ | Y | mitotic cell cycle and cell cycle control; budding, cell polarity and filament formation |
| 3 | Tye7 | Cbf1 | [ | Y | metabolism |
| 4 | Dig1 | Ste12 | [ | Y | protein folding; pheromone response, mating-type determination, sex-specific proteins; budding, cell polarity and filament formation |
| 5 | Pdr3 | Pdr1 | --- | Y | DNA binding; chemical agent resistance; detoxification |
| 6 | Swi6 | Mbp1 | [ | Y | DNA synthesis and replication; mitotic cell cycle and cell cycle control; protein with binding function or co-factor requirement (structural or catalytic) |
| 7 | Swi6 | Swi4 | [ | Y | G1/S transition of mitotic cell cycle |
| 8 | Dal80 | Gln3 | --- | Y | regulation of nitrogen metabolism |
| 9 | Gln3 | Gat1 | [ | Y | regulation of nitrogen metabolism |
| 10 | Dal80 | Gat1 | --- | -- | regulation of nitrogen metabolism |
| 11 | Pho4 | Cbf1 | --- | Y | metabolism; DNA binding |
| 12 | Pho4 | Tye7 | --- | -- | metabolism |
| 13 | Hap3 | Hap2 | [ | Y | regulation of C-compound and carbohydrate metabolism |
| 14 | Gal80 | Gal4 | --- | Y | regulation of C-compound and carbohydrate metabolism |
| 15 | Met31 | Met32 | [ | Y | metabolism of methionine; metabolism of cysteine; regulation of amino acid metabolism; regulation of nitrogen, sulfur and selenium metabolism; DNA binding |
| 16 | Msn4 | Msn2 | [ | Y | DNA binding; stress response |
| 17 | Rap1 | Fhl1 | [ | Y | --- |
| 18 | Tec1 | Ste12 | [ | Y | budding, cell polarity and filament formation |
| 19 | Swi6 | Stb1 | [ | Y | G1/S transition of mitotic cell cycle |
| 20 | Hap2 | Hap5 | [ | Y | regulation of C-compound and carbohydrate metabolism |
| 21 | Nrg1 | Nrg2 | --- | Y | protein with binding function or co-factor requirement (structural or catalytic); budding, cell polarity and filament formation |
| 22 | Hap3 | Hap5 | [ | Y | regulation of C-compound and carbohydrate metabolism |
| 23 | Swi4 | Stb1 | [ | Y | G1/S transition of mitotic cell cycle |
| 24 | Swi5 | Ace2 | [ | Y | G1 phase of mitotic cell cycle |
| 25 | Sok2 | Phd1 | [ | Y | budding, cell polarity and filament formation |
| 26 | Hap5 | Hap4 | --- | Y | regulation of C-compound and carbohydrate metabolism |
| 27 | Mbp1 | Swi4 | [ | Y | mitotic cell cycle and cell cycle control |
The 11 compared existing methods.
| Existing methods | Data sources used | Method description | Threshold of p-value | Number of predicted cooperative TF pairs |
|---|---|---|---|---|
| Wang et al. (2006) [ | ChIP-chip data, gene expression data, TFBS data | They developed a new framework to infer the combinatorial control of TFs by integrating heterogeneous functional genomic datasets. | 10 −3 | 14 |
| Tsai et al. (2005) [ | ChIP-chip data, gene expression data | They used statistical methods to identify yeast cell cycle TFs and synergistic pairs of TFs. | 10 −5 | 18 |
| Elati et al. (2007) [ | Gene expression data | They adopted a data mining system to learn transcriptional regulation relationship from gene expression data. | 10 −3 | 20 |
| Nagamine et al. (2005) [ | ChIP-chipdata, PPI data | They inferred the cooperative pairs under the assumption that the existence of interaction between two proteins suggests that they contribute to the same or similar biological process. | 10 −3 | 24 |
| Datta and Zhao (2007) [ | ChIP-chip data | They used a log-linear model to study cooperative binding among TFs and developed an Expectation-Maximization algorithm for statistical inferences. | 10 −3 | 25 |
| He et al. (2006) [ | ChIP-chip data, gene expression data | They adopted the microarray expression data to predict the cooperative TF pairs by testing whether the expression of target genes was significantly influenced by their cooperative effect with the multivariate method, ANOVA. | 10 −2 | 30 |
| Banerjee and Zhang (2003) [ | ChIP-chip data, gene expression data | They infer the cooperative pairs under the assumption that a pair of TFs is cooperative if genes regulated by both TFs are more co-expressed than those genes regulated by either TF alone. | 5 × 10 −2 | 31 |
| Chang et al. (2006) [ | ChIP-chip data, gene expression data | They employed a stochastic system model to assess TF cooperativity. | 10 −21 | 55 |
| Yang et al. (2010) [ | ChIP-chip data, TF knockout data | They predicted cooperativity between TFs by identifying the most statistically significant overlap of target genes regulated by two TFs in ChIP-chip data and TF knockout data. | 5 × 10 −3 | 186 |
| Chen et al. (2012) [ | ChIP-chip data | They facilitated identification of interactions between TFs by using motif discovery method when detecting overlapping targets of TFs based on ChIP-chip data. | 10 −3 | 221 |
| Yu et al. (2006) [ | ChIP-chip data | They proposed a method: Motif-PIE, which predicts interacting TF pairs by using a motif discovery procedure. | 10 −8 | 300 |
The table shows the information of 11 existing methods adopted for comparison. The data sources used, a brief description of the method, the p-value threshold and the number of predicted cooperative TF pairs of each method are described in columns 2, 3, 4 and 5.
Figure 2Comparison of our method with 11 existing methods based on three performance indices. This figures shows the comparison results of our methods with 11 existing methods using (a) the performance index 1, (b) the performance index 2, and (c) the performance index 3. Note that T stands for our method, A1 for Banerjee and Zhang' s method, A2 for Chang et al.'s method, A3 for Chen et al.'s method, A4 for Datta and Zhao's method, A5 for Elati et al.'s method, A6 for He et al.'s method, A7 for Nagamne et al.'s method, A8 for Tsai et al.'s method, A9 for Wang's method, A10 for Yang et al.'s method, and A11 for Yu et al.'s method.
The benchmark set of 27 known cooperative TF pairs.
| TF pairs | MIPS complex ID | MIPS complex name |
|---|---|---|
| ARG80-MCM1 | 510.190.120 | ARG complex |
| MET4-MET28 | 510.190.160.10 | Cbf1/Met4/Met28 complex |
| STP4-STP1 | 440.30.30 | tRNA splicing |
| IME1-UME6 | 510.190.200 | Ume6/Ime1 complex |
| HAP5-HAP4 | 510.160 | CCAAT-binding factor complex |
| STP2-STP1 | 440.30.30 | tRNA splicing |
| HAP2-HAP3 | 510.160 | CCAAT-binding factor complex |
| ARG80-ARG81 | 510.190.120 | ARG complex |
| MET4-MET31 | 510.190.160.20 | Met4/Met28/Met31 complex |
| CBF1-MET28 | 510.190.160.10 | Cbf1/Met4/Met28 complex |
| MCM1-ARG81 | 510.190.120 | ARG complex |
| HAP5-HAP2 | 510.160 | CCAAT-binding factor complex |
| HAP4-HAP2 | 510.160 | CCAAT-binding factor complex |
| PIP2-OAF1 | 510.190.100 | OAF complex |
| MET4-CBF1 | 510.190.160.10 | Cbf1/Met4/Met28 complex |
| GCR1-GCR2 | 510.190.90 | GCR complex |
| RTG1-RTG3 | 510.190.130 | RTG complex |
| SWI6-MBP1 | 510.190.70 | MBF complex |
| HAP5-HAP3 | 510.160 | CCAAT-binding factor complex |
| HAP4-HAP3 | 510.160 | CCAAT-binding factor complex |
| GAL3-GAL80 | 510.190.80 | GAL80 complex |
| MET4-MET32 | 510.190.160.30 | Met4/Met28/Met32 complex |
| STP4-STP2 | 440.30.30 | tRNA splicing |
| SWI4-SWI6 | 510.190.60 | SBF complex |
| GAL80-GAL4 | 510.190.80 | GAL80 complex |
| MET32-MET28 | 510.190.160.30 | Met4/Met28/Met32 complex |
| MET28-MET31 | 510.190.160.20 | Met4/Met28/Met31 complex |
The list of 27 known cooperative TF pairs are derived from biochemically well-defined transcriptional complexes within the MIPS complex catalogue.
Figure 3The performance of our method when using different thresholds of the cooperativity score. To check the robustness of our method against different thresholds of cooperativity scores, we tested four different thresholds (125, 120, 110 and 90). The figure shows that no matter which threshold is used, the performance of our method is always the same (i.e. superior to 10 out of 11 existing methods) on the performance index 1. This suggests that our method is robust against different thresholds of the cooperativity score.
Figure 4The performance of our method when using the TFBS data with different qualities. To check the robustness of our method against different TFBS qualities, we tested four different posterior probability thresholds (0.2, 0.3, 0.4 and 0.5). The figure shows that no matter which threshold is used, the performance of our method is always the same (i.e. superior to 10 out of 11 existing methods) on the performance index 1. This suggests that our method is robust against different thresholds of posterior probability to control the TFBS quality.
Figure 5Comparison of our method with 11 existing methods based on the precision and recall when using a benchmark set of 27 known cooperative TF pairs. Here, we use the precision and recall to evaluate the performance of a method. The figure shows that our method outperforms 11 existing methods in the precision and outperforms 10 existing methdos in the recall.
Figure 6The performance of our method with/without using nucleosome occupancy data. To demonstrate that nucleosome occupancy data contributes to the overall improved prediction, we tested our method when nucleosome data are used (denoted as T w/ Nucleosome) and when nucleosome data are not used (denoted as T w/o Nucleosome) on the performance index 1. The figure shows that the −logP of T w/ Nucleosome is higher than that of T w/o Nucleosome, suggesting that nucleosome occupancy data do contribute to the overall improved prediction of our method.
Figure 7A TF cooperativity network. A TF cooperativity network is constructed from our 27 predicted cooperative TF pairs. Nodes represent TFs. A black line between two TFs means that these two TFs are predicted to be a cooperative TF pair but they are not annotated in the same MIPS functional category. A colored line between two TFs means that these two TFs are predicted to be a cooperative TF pair and they are annotated in the same MIPS functional category. The green color stands for metabolism, red for cell cycle, blue for cell type differentiation, orange for cell rescue, defense and virulence.