| Literature DB >> 25521604 |
Fu-Jou Lai, Hong-Tsun Chang, Yueh-Min Huang, Wei-Sheng Wu.
Abstract
BACKGROUND: Eukaryotic transcriptional regulation is known to be highly connected through the networks of cooperative transcription factors (TFs). Measuring the cooperativity of TFs is helpful for understanding the biological relevance of these TFs in regulating genes. The recent advances in computational techniques led to various predictions of cooperative TF pairs in yeast. As each algorithm integrated different data resources and was developed based on different rationales, it possessed its own merit and claimed outperforming others. However, the claim was prone to subjectivity because each algorithm compared with only a few other algorithms and only used a small set of performance indices for comparison. This motivated us to propose a series of indices to objectively evaluate the prediction performance of existing algorithms. And based on the proposed performance indices, we conducted a comprehensive performance evaluation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25521604 PMCID: PMC4290732 DOI: 10.1186/1752-0509-8-S4-S9
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Categorization of 14 sets of PCTFPs based on data sources utilized.
| Data sources utilized | Related studies | # of PCTFPs |
|---|---|---|
| ChIP-chip data | Harbison et al. | 94 |
| Datta and Zhao | 25 | |
| Yu et al. | 300 | |
| Chen et al. | 221 | |
| ChIP-chip data and | Banerjee and Zhang | 31 |
| Tsai et al. | 18 | |
| Chang et al. | 55 | |
| He et al. | 30 | |
| ChIP-chip data and | Nagamine et al. | 24 |
| ChIP-chip data and | Yang et al. | 186 |
| ChIP-chip data, | Chuang et al. | 13 |
| ChIP-chip data, | Wang J | 14 |
| ChIP-chip data, | Wang Y et al. | 159 |
| Gene expression data | Elati et al. | 20 |
Figure 1Performance evaluation and comparison using TF-based performance indices. (a) Index 1 is based on PPI partners overlap of a predicted cooperative TF pair (PCTFP); (b) Index 2 is based on the shortest path length of a PCTFP in the PPI network; (c) Index 3 is based on the functional similarity of a PCTFP; (d) Index 4 based on the overlap between a set of PCTFPs and a benchmarked set of 27 known cooperative TF pairs. The blue bar indicates the algorithm which outperforms the others using that performance index.
Figure 2Performance evaluation and comparison using TG-based performance indices. (a) Index 1 is based on the overlap of a PCTFP's target genes; (b) Index 2 is based on the expression coherence of a PCTFP's common target genes; (c) Index 3 is based on the functional coherence of a PCTFP's common target genes; (d) Index 4 is based on the PPI coherence of a PCTFP's common target genes. The blue bar indicates the algorithm which outperforms the others using that performance index.
Ranking scores given to each performance index for each study.
| Banerjee | Chang | Chen | Chuang | Datta | Elati | Harbison | He | Nagamine | Tsai | WangJ | WangY | Yang | Yu | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TF-based | Idx 1 | 6 | 10 | 12 | 2 | 5 | 1 | 8 | 9 | 7 | 3 | 4 | 11 | 14 | 13 |
| Idx 2 | 9 | 11 | 12 | 6 | 8 | 1 | 7 | 10 | 2 | 3 | 5 | 4 | 14 | 13 | |
| Idx 3 | 2 | 7 | 13 | 4 | 9 | 10 | 1 | 11 | 8 | 6 | 5 | 3 | 14 | 12 | |
| Idx 4 | 6 | 9 | 3 | 10 | 12 | 13 | 2 | 14 | 7 | 8 | 11 | 1 | 4 | 5 | |
| TG-based | Idx 1 | 5 | 4 | 2 | 7 | 12 | 14 | 6 | 11 | 8 | 13 | 10 | 3 | 1 | 9 |
| Idx 2 | 7 | 5 | 11 | 4 | 3 | 14 | 10 | 6 | 9 | 2 | 1 | 12 | 13 | 8 | |
| Idx 3 | 6 | 4 | 9 | 5 | 3 | 14 | 11 | 7 | 10 | 2 | 1 | 12 | 13 | 8 | |
| Idx 4 | 6 | 5 | 11 | 3 | 4 | 14 | 10 | 8 | 7 | 2 | 1 | 12 | 13 | 9 | |
| Sum | 47 | 55 | 73 | 41 | 56 | 81 | 55 | 76 | 58 | 39 | 38 | 58 | 86 | 77 | |
| Ranking | 4 | 5 | 10 | 3 | 7 | 13 | 5 | 11 | 8 | 2 | 1 | 8 | 14 | 12 | |
The comprehensive performance ranking is based on the sum of ranking scores and shown in the last row.
Normalized scores given to each performance index for each study.
| Banerjee | Chang | Chen | Chuang | Datta | Elati | Harbison | He | Nagamine | Tsai | WangJ | WangY | Yang | Yu | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TF-based | Idx 1 | 0.436 | 0.335 | 0.286 | 0.648 | 0.453 | 1 | 0.408 | 0.338 | 0.424 | 0.546 | 0.493 | 0.305 | 0.197 | 0.265 |
| Idx 2 | 0.719 | 0.596 | 0.582 | 0.834 | 0.737 | 1 | 0.797 | 0.628 | 0.910 | 0.894 | 0.840 | 0.841 | 0.472 | 0.512 | |
| Idx 3 | 0.968 | 0.933 | 0.740 | 0.951 | 0.874 | 0.840 | 1 | 0.837 | 0.927 | 0.939 | 0.947 | 0.967 | 0.420 | 0.786 | |
| Idx 4 | 0.100 | 0.035 | 0.247 | 0.028 | 0.028 | 0 | 0.315 | 0 | 0.100 | 0.047 | 0.028 | 1 | 0.177 | 0.164 | |
| TG-based | Idx 1 | 0.537 | 0.623 | 0.732 | 0.428 | 0.299 | 0.124 | 0.465 | 0.327 | 0.394 | 0.277 | 0.345 | 0.687 | 1 | 0.385 |
| Idx 2 | 0.419 | 0.477 | 0.279 | 0.554 | 0.618 | 0.025 | 0.283 | 0.423 | 0.320 | 0.686 | 1 | 0.275 | 0.226 | 0.335 | |
| Idx 3 | 0.317 | 0.432 | 0.293 | 0.403 | 0.497 | 0.024 | 0.275 | 0.316 | 0.284 | 0.624 | 1 | 0.267 | 0.213 | 0.302 | |
| Idx 4 | 0.505 | 0.546 | 0.407 | 0.561 | 0.554 | 0.048 | 0.411 | 0.465 | 0.473 | 0.747 | 1 | 0.371 | 0.355 | 0.438 | |
| Sum | 4.001 | 3.976 | 3.565 | 4.406 | 4.060 | 3.061 | 3.953 | 3.333 | 3.832 | 4.759 | 5.653 | 4.712 | 3.059 | 3.186 | |
| Ranking | 6 | 7 | 10 | 4 | 5 | 13 | 8 | 11 | 9 | 2 | 1 | 3 | 14 | 12 | |
The comprehensive performance ranking is based on the sum of normalized scores and shown in the last row.
Figure 3Robustness against using mean or median of the scores in each performance index. We compared both final ranking lists that resulted from using mean and median, and obtained (a) correlation coefficient equal to 0.83 (P-value = 2.40e-4 for testing the hypothesis of no correlation) when the sum of the ranking scores is used to summarize the evaluation and (b) correlation coefficient equal to 0.74 (P-value = 2.40e-3) when the sum of normalized scores is used to summarize the evaluation. These results suggest that the comprehensive performance comparison results are robust against using mean or median in each performance index.
Figure 4Robustness against using two different comprehensive ranking measures. We compared both final ranking lists that resulted from using two comprehensive ranking measures (sum of ranking scores and sum of normalized scores), and obtained (a) correlation coefficient equal to 0.90 (P-value = 1.49e-5 for testing the hypothesis of no correlation) when the mean of the scores in each index is used and (b) correlation coefficient equal to 0.93 (P-value = 1.50e-6) when the median of the scores in each index is used. These results suggest that the comprehensive performance comparison results are robust against using the sum of ranking scores or the sum of normalized scores to summarize evaluation.