| Literature DB >> 17559637 |
Wei-Sheng Wu1, Wen-Hsiung Li, Bor-Sen Chen.
Abstract
BACKGROUND: ChIP-chip data, which indicate binding of transcription factors (TFs) to DNA regions in vivo, are widely used to reconstruct transcriptional regulatory networks. However, the binding of a TF to a gene does not necessarily imply regulation. Thus, it is important to develop methods to identify regulatory targets of TFs from ChIP-chip data.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17559637 PMCID: PMC1906835 DOI: 10.1186/1471-2105-8-188
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Classification of the binding targets of a TF into plausible and non-plausible regulatory ones. The numbers of genes in B+, B+R+ and B+R- are shown for each of the nine cell cycle TFs under study. B+R+ is further divided into two subsets depending on whether the gene's expression profile is positively (TlC > 0) or negatively (TlC < 0) correlated with the TF's regulatory profile, possibly with time lags (see Additional file 1 for details).
| TF | |||
| Abf1 | 247 | 144 (85,59) | 103 |
| Ace2 | 81 | 44 (23,21) | 37 |
| Cin5 | 142 | 69 (35,34) | 73 |
| Fkh1 | 133 | 96 (62,34) | 37 |
| Fkh2 | 116 | 90 (60,30) | 26 |
| Rap1 | 147 | 82 (61,21) | 65 |
| Swi4 | 146 | 84 (66,18) | 62 |
| Swi5 | 106 | 42 (32,10) | 64 |
| Swi6 | 144 | 49 (25,24) | 95 |
Figure 1Enrichment in functional annotation for the cell cycle TFs under study. The numbers of significantly enriched MIPS functional categories in B+R+ (blue) and B+R- (brown) for each of the nine cell cycle TFs under study are shown.
Enrichment of cell cycle genes. The proportions of genes that belong to the 793 cell cycle genes identified by Spellman et al. [7] are calculated for B+R+ and B+R-. We then test whether the enrichment of the known cell cycle genes in B+R+ is statistically higher than that in B+R-. The cumulative hypergeometric distribution is used to determine the statistical significance (see the Appendix for details). In most cases (7/9), except for Abf1 and Ace2, the known cell cycle genes are more enriched in B+R+ than in B+R-.
| TF | ( | |||
| Abf1 | 19/144 | 6/103 | 0.0439 | (144,19,103,6) |
| Ace2 | 14/44 | 7/37 | 0.1433 | (44,14,37,7) |
| Cin5 | 24/69 | 11/73 | (69,24,73,11) | |
| Fkh1 | 41/96 | 3/37 | (96,41,37,3) | |
| Fkh2 | 54/90 | 0/26 | (90,54,26,0) | |
| Rap1 | 13/82 | 2/65 | (82,13,65,2) | |
| Swi4 | 60/84 | 15/62 | (84,60,62,15) | |
| Swi5 | 22/42 | 14/64 | (42,22,64,14) | |
| Swi6 | 37/49 | 42/95 | (49,37,95,42) |
Identification of highly co-expressed genes among the regulatory targets of a TF. The expression coherence (EC) of B+R+, Aand Rare calculated. We then test whether the expression coherence of Xis statistically higher than that of B+R-, where X= Aor R. The cumulative hypergeometric distribution is used to assign a p-value for rejecting the null hypothesis EC(X) = EC(B+R+). Only those that have p < 0.001 (i.e., -log10 p > 3) are shown (see the Appendix for details). In addition, we show the most enriched MIPS functional category for each X.
| TF( | |||||
| Abf1(0.15) | A1(0.64;Inf) | A3(0.51;Inf) | R6(0.34;Inf) | ||
| PROTEIN WITH BINDING FUNCTION OR COFACTOR REQUIREMENT | TRANSCRIPTION | BIOGENESIS OF CELLULAR COMPONENTS | |||
| Ace2(0.07) | A3(0.31;5.11) | A4(0.5;3.66) | R3(1;3.55) | ||
| CELL CYCLE AND DNA PROCESSING | REGULATION OF METABOLISM AND PROTEIN FUNCTION | METABOLISM | |||
| Cin5(0.08) | A0(0.73;9.14) | A1(0.43;6.29) | A5(0.61;11.63) | R0(0.76;Inf) | R2(0.47;4.14) |
| ENERGY | CELLULAR TRANSPORT, TRANSPORT FACILITIES AND TRANSPORT ROUTES | METABOLISM | CELLULAR TRANSPORT, TRANSPORT FACILITIES AND TRANSPORT ROUTES | CELLULAR TRANSPORT, TRANSPORT FACILITIES AND TRANSPORT ROUTES | |
| Fkh1(0.12) | A0(0.65;11.29) | A1(0.49;11.03) | A2(0.27;4.18) | ||
| CELL CYCLE AND DNA PROCESSING | CELL TYPE DIFFERENTIATION | CELL CYCLE AND DNA PROCESSING | |||
| Fkh2(0.16) | A0(0.69;Inf) | A1(0.7;Inf) | A2(0.69;11.44) | A3(0.76;8.82) | |
| CELL CYCLE AND DNA PROCESSING | CELL CYCLE AND DNA PROCESSING | CELL TYPE DIFFERENTIATION | PROTEIN FATE | ||
| Rap1(0.11) | A2(0.58;Inf) | A4(0.62;Inf) | A5(1;9.46) | ||
| PROTEIN SYNTHESIS | PROTEIN SYNTHESIS | UNCLASSIFIED PROTEINS | |||
| Swi4(0.2) | A0(0.87;Inf) | A1(0.6;Inf) | A2(0.79;Inf) | A3(0.71;6.19) | |
| CELL CYCLE AND DNA PROCESSING | METABOLISM | CELL CYCLE AND DNA PROCESSING | CELL CYCLE AND DNA PROCESSING | ||
| Swi5(0.17) | A0(1;7.79) | A2(0.86;11.36) | A3(0.64;7.78) | ||
| BIOGENESIS OF CELLULAR COMPONENTS | INTERACTION WITH THE ENVIRONMENT | CELL RESCUE, DEFENSE AND VIRULENCE | |||
| Swi6(0.23) | A0(0.9;10.18) | A6(0.73;4.33) | A7(0.75;8.25) | R2(0.61;4.76) | |
| BIOGENESIS OF CELLULAR COMPONENTS | CELL CYCLE AND DNA PROCESSING | METABOLISM | CELL CYCLE AND DNA PROCESSING | ||
Performance comparison of TRIA with MA-Network. We tested over-representation of the high-confidence TF binding motif in S1 and S2, where S1 is the set of regulatory targets of a TF that are identified by TRIA but not by MA-Network and S2 is the set of regulatory targets of a TF that are identified by MA-Network but not by TRIA. The proportions of genes, whose promoter regions contain the high-confidence TF binding motif is calculated for S1 and S2. The cumulative hypergeometric distribution is used to determine the statistical significance of over-representation (see the Appendix for details). In four of the five (4/5) cases the high-confidence TF binding motifs are enriched in S1 with p-value < 0.001 but only two of the five (2/5) cases in S2.
| TF | ||||
| Abf1 | 46/62 | 28/56 | ||
| Ace2 | 2/28 | 0.0340 | 2/17 | 0.0132 |
| Fkh2 | 17/47 | 7/18 | ||
| Swi4 | 16/27 | 6/18 | 0.0021 | |
| Swi5 | 9/25 | 7/30 | 0.0171 |
Performance comparison of TRIA with MFA. We tested over-representation of the high-confidence TF binding motif in T1 and T2, where T1 is the set of regulatory targets of a TF that are identified by TRIA but not by MFA and T2 is the set of regulatory targets of a TF that are identified by MFA but not by TRIA. The proportions of genes, whose promoter regions contain the high-confidence TF binding motif is calculated for T1 and T2. The cumulative hypergeometric distribution is used to determine the statistical significance of the over-representation (see the Appendix for details). In six of the eight (6/8) cases the high-confidence TF binding motifs are enriched in T1 with p-value < 0.001 but none of the eight (0/8) cases in T2.
| TF | ||||
| Abf1 | 75/105 | 10/106 | 0.9042 | |
| Ace2 | 1/31 | 0.2782 | 3/35 | 0.0056 |
| Fkh1 | 30/64 | 5/109 | 1.0000 | |
| Fkh2 | 20/49 | 10/100 | 0.2038 | |
| Rap1 | 32/72 | 7/36 | 0.0052 | |
| Swi4 | 28/56 | 2/36 | 0.7981 | |
| Swi5 | 7/26 | 0.0076 | 4/32 | 0.3417 |
| Swi6 | 19/30 | 13/72 | 0.2932 |
Figure 2Enrichment in functional annotation for the stress response TFs under study. The numbers of significantly enriched MIPS functional categories in B+R+ (blue) and B+R- (brown) for each of the 27 amino acid starved stress TFs under study are shown.