| Literature DB >> 20487530 |
Guofeng Meng1, Axel Mosig, Martin Vingron.
Abstract
BACKGROUND: Observed co-expression of a group of genes is frequently attributed to co-regulation by shared transcription factors. This assumption has led to the hypothesis that promoters of co-expressed genes should share common regulatory motifs, which forms the basis for numerous computational tools that search for these motifs. While frequently explored for yeast, the validity of the underlying hypothesis has not been assessed systematically in mammals. This demonstrates the need for a systematic and quantitative evaluation to what degree co-expressed genes share over-represented motifs for mammals.Entities:
Mesh:
Year: 2010 PMID: 20487530 PMCID: PMC3098066 DOI: 10.1186/1471-2105-11-267
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The pipeline for over-representation evaluation of TF binding sites. (a) and (b) TF binding sites were described with the matrices in Jaspar Database; (c) experiments with modified TF activity were selected from the ArrayExpress Database and (d) analyzed with computational methods for differentially expressed genes; (e) and (f) over-represented motifs or matrices were predicted using Weeder and oPOSSUM and those motifs (h) or matrices (g) were compared with the target matrices to check whether the binding sites of affected TFs were recovered.
Transcription factors and their Jaspar target matrices.
| Experiment Name | TF Name | ||
|---|---|---|---|
| E-GEOD-10954 | cMyc | bHLH-ZIP | MYC-MAX, MAX, Mycn |
| E-GEOD-11039 | e2f2 | E2F_TDP | E2F1 |
| E-GEOD-11352 | ESRalpha | Nuclear Receptor | ESR1 |
| E-GEOD-11809 | irf1 | TRP-CLUSTER | IRF1 IRF2 |
| E-GEOD-2060 | CREB | bZIP | CREB1, bZIP910, bZIP911 |
| E-GEOD-2192 | PPARg2 | Nuclear Receptor | PPARG-RXRA, PPARG |
| E-GEOD-2527 | Gata-1 | ZN-FINGER, GATA | Gata1 |
| E-GEOD-3126 | HNF4a | NUCLEAR RECEPTOR | HNF4A |
| E-GEOD-3244 | p53 | TP53 | P53 |
| E-GEOD-6077 | nmyc | bHLH-ZIP | Mycn, MYC-MAX, MAX |
| E-GEOD-6487 | Myf | bHLH | Myf |
| E-GEOD-7137 | KLF15 | ZN-FINGER, C2H2 | Klf4 |
| E-GEOD-7219 | NFkappaB2 | REL | NF-kappaB, NFKB1 |
| E-GEOD-7223 | AIbZIP | bZIP | CREB1, bZIP910, bZIP911 |
| E-GEOD-7835 | HiF1 | bHLH | Arnt, Arnt-Ahr |
| E-GEOD-9786 | PPARalpha | Nuclear Receptor | PPARG, PPARG-RXRA |
| E-MEXP-1444 | Cepba | bZIP | Cebpa, Ddit3-Cebpa |
| E-MEXP-634 | Gata-3 | ZN-FINGER, GATA | GATA3 |
| E-GEOD-590 | USF1 | Zipper | USF1 |
| E-GEOD-5800 | Irf6 | TRP-CLUSTER | IRF1 IRF2 |
| E-GEOD-5823 | c-MYC | bHLH-ZIP | Mycn, MYC-MAX, MAX |
| E-GEOD-2624 | NF-kB | REL | NF-kappaB, NFKB1 |
| E-GEOD-3116 | HNF4 | NUCLEAR RECEPTOR | HNF4A |
| E-GEOD-5424 | Foxa2 | Forkhead | FOXF2, FOXD1, FOXC1, FOXL1, Foxq1, Foxd3, Foxa2, FOXI1 |
| E-GEOD-8943 | FOXQ1 | Forkhead | FOXF2, FOXD1, FOXC1, FOXL1, Foxq1, Foxd3, Foxa2, FOXI1 |
| E-GEOD-11557 | Evi-1 | zinc finger | Evi1 |
| E-TABM-43 | TP53 | TP53 | P53 |
| E-GEOD-2815 | Myb | Helix-Turn-Helix | Myb |
| E-GEOD-5475 | PPARI | Nuclear Receptor | PPARG, PPARG-RXRA |
| E-GEOD-6846 | STAT2 | stat | STAT1 |
| E-GEOD-11836 | Nkx3.1 | HOMEO | Nkx2-5 |
| E-MEXP-871 | HMGA2 | - | HMG-1, HMG-IY |
| E-MEXP-1413 | E2F2 | E2F TDP | E2F1 |
Figure 2Over-representation of ESR1 in up-regulated genes. (a) 100 up-regulated genes and (b) 200 up-regulated genes were input into oPOSSUM.
Results for over-representation analysis in 33 experiments
| Experiment | Name. TF | default parameter setting | Con. Most significantly over-representation | ||
|---|---|---|---|---|---|
| Up-regulated genes | Down-regulated genes | Parameter setting | status | ||
| E-GEOD-10954 | cMyc | N | S | 400, down-regulated, 10000 bp | S |
| E-GEOD-11352 | ESRalpha | S | N | 100, up-regulated, 4000 bp | S |
| E-GEOD-11809 | irf1 | S | W | 100, up-regulated, 4000 bp | S |
| E-GEOD-3126 | HNF4a | S | N | 400, up-regulated, 4000 bp | S |
| E-GEOD-6077 | nmyc | S | N | 100, up-regulated, 7000 bp | S |
| E-GEOD-6487 | Myf | S | N | 400, up-regulated, 4000 bp | S |
| E-GEOD-7219 | NFkappaB2 | S | N | 200, up-regulated, 7000 bp | S |
| E-GEOD-7223 | AIbZIP | S | N | 200, up-regulated, 4000 bp | S |
| E-GEOD-7835 | HiF1 | N | S | 400, up-regulated, 7000 bp | S |
| E-MEXP-1444 | Cepba | S | W | 100, up-regulated, 7000 bp | S |
| E-GEOD-2624 | NF-kB | N | S | 200 down-regulated, 2000 bp | S |
| E-GEOD-11557 | Evi-1 | N | S | 200 down-regulated, 2000 bp | S |
| E-GEOD-5424 | Foxa2 | W | N | 300 up-regulated, 7000 bp | S |
| E-TABM-43 | TP53 | W | N | 200 up-regulated, 2000 bp | S |
| E-GEOD-3116 | HNF4 | W | N | 100 up-regulated, 2000 bp | S |
| E-GEOD-2060 | CREB | N | N | 400, up-regulated, 7000 bp | S |
| E-GEOD-3244 | p53 | N | N | 100, up-regulated, 7000 bp | S |
| E-GEOD-9786 | PPARalpha | N | N | 100, down-regulated, 2000 bp | S |
| E-GEOD-5475 | PPARI | N | N | 100 down-regulated, 7000 bp | S |
| E-GEOD-2192 | PPARg2 | W | N | 200, up-regulated, 4000 bp | W |
| E-GEOD-11039 | e2f2 | W | N | 100, up-regulated, 4000 bp | W |
| E-GEOD-590 | USF1 | N | N | 300 up-regulated, 7000 bp | W |
| E-GEOD-5800 | Irf6 | N | N | 100 up-regulated, 4000 bp | W |
| E-GEOD-8943 | FOXQ1 | W | N | 200 up-regulated, 4000 bp | W |
| E-GEOD-5823 | c-MYC | W | W | 300 up-regulated 4000 bp | W |
| E-GEOD-2527 | Gata-1 | N | N | - | - |
| E-GEOD-7137 | KLF15 | N | N | - | - |
| E-MEXP-634 | Gata-3 | N | N | - | - |
| E-GEOD-2815 | Myb | N | N | - | - |
| E-GEOD-6846 | STAT2 | N | N | - | - |
| E-GEOD-11836 | Nkx3.1 | N | N | - | - |
| E-MEXP-871 | HMGA2 | N | N | - | - |
| E-MEXP-1413 | E2F2 | N | N | - | - |
S: significantly over-represented; W: weakly over-represented; N: not over-represented
Figure 3Regulatory motifs predicted by Weeder and their similarity with target matrices.
Figure 4Number of recovered experiments under different similarity percentage.