| Literature DB >> 29741575 |
Tim Kehl1, Lara Schneider1, Kathrin Kattler2, Daniel Stöckel1, Jenny Wegert3, Nico Gerstner1, Nicole Ludwig4, Ute Distler5, Markus Schick6, Ulrich Keller6,7, Stefan Tenzer5, Manfred Gessler3, Jörn Walter2, Andreas Keller1, Norbert Graf8, Eckart Meese4, Hans-Peter Lenhof1.
Abstract
Motivation: Transcriptional regulators play a major role in most biological processes. Alterations in their activities are associated with a variety of diseases and in particular with tumor development and progression. Hence, it is important to assess the effects of deregulated regulators on pathological processes.Entities:
Mesh:
Year: 2018 PMID: 29741575 PMCID: PMC6184769 DOI: 10.1093/bioinformatics/bty372
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.REGGAE workflow. (A) Overexpressed genes (second column) are sorted according to their t-scores (first column). For each gene , the list of regulators is sorted with respect to the absolute values of the corresponding correlation coefficients. The black nodes represent a selected regulator that controls five target genes. (B) Shows the new regulator list L created by sorting the elements column by column. (C) Enrichment analysis (running sum) for the transcriptional regulator marked in black
Fig. 2.Robustness of REGGAE results. (A) Effect of an increasing number of bootstrap replications on the order of regulators in the REGGAE result lists for up-regulated genes. The number of bootstrap samples (x-axis) is plotted against the total number of position changes (y-axis). (B) Venn diagram depicting the overlap of REGGAE results for the five different input lists
Runtime comparison for top 250 up-regulated genes
| Method | Runtime (s) |
|---|---|
| CSA | 450.27 (±78.76) |
| REGGAE | 174.98 (±1.69) |
| REGGAE (without bootstrapping) | 23.40 (±0.36) |
| RIF1 | 23.60 (±0.28) |
| RIF2 | 23.85 (±0.10) |
| TDD | 14.86 (±0.63) |
| TED | 658.20 (±29.80) |
| TFactS | 42.37 (±0.23) |
| TFRank | 116.74 (±4.22) |
Note: Runtimes were obtained on an Intel Core i7-3770 processor.
CSA analysis was conducted using 1 000 000 permutations.
REGGAE analysis was performed using 1000 bootstrap replications.
Fig. 3.Venn diagrams showing the overlap of the different methods with the list generated by REGGAE. All results were calculated based on the aggregated lists of the most up-regulated genes. P-values for the overlaps were calculated using the hypergeometric test
Top five regulators identified by REGGAE in comparison to other approaches
| Regulators | REGGAE | CSA | RIF1 | RIF2 | TDD | TED | TFactS | TFRank |
|---|---|---|---|---|---|---|---|---|
| FOXA1 | 8.34 | |||||||
| GATA3 | 5.16 | |||||||
| ESR1 | −0.10 | |||||||
| MYB | 4.14 | 8.4 | ||||||
| SPDEF | 8.54 | 1.4 | 3.6×10−19(72) |
Note: For REGGAE, CSA and TFactS adjusted P-values are depicted. For RIF1, RIF2 and TFRank, which do not provide P-values, the respective test statistic value is shown. Numbers in parentheses represent the rank in the sorted result list.
Results for perturbation experiments of (A) artificially induced overexpression of MYC in Eµ-Myc mice (B) knock-out experiments of the pluripotency factors NANOG, POU5F1 and SOX2
| A | B | |||
|---|---|---|---|---|
| Method | MYC | NANOG | POU5F1 | SOX2 |
| CSA | 510 | | 510 | | ||
| REGGAE | ||||
| RIF1 | 791 | | 795 | | 285 | 555 | |
| RIF2 | 762 | | 332 | | ||
| TDD | 466 | 492 | 815 | 771 | 822 | 800 | 467 | 523 |
| TED | 208 | 225 | 567 | 501 | 683 | 588 | 682 | 682 |
| TFactS | ||||
| TFRank | 499 | | |||
Note: For all methods ranks in the sorted result lists for up- and down-regulated genes are shown (up | down). Ranks are highlighted in bold if corresponding P-values are statistically significant for methods that provide P-values (CSA, REGGAE, TED and TFactS) or are amongst the top 200 genes for all other methods (RIF1, RIF2, TDD and TFRank).