| Literature DB >> 16916454 |
Srinivas Veerla1, Mattias Höglund.
Abstract
BACKGROUND: The use of global gene expression profiling to identify sets of genes with similar expression patterns is rapidly becoming a widespread approach for understanding biological processes. A logical and systematic approach to study co-expressed genes is to analyze their promoter sequences to identify transcription factors that may be involved in establishing specific profiles and that may be experimentally investigated.Entities:
Mesh:
Year: 2006 PMID: 16916454 PMCID: PMC1560170 DOI: 10.1186/1471-2105-7-384
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Promoter Clustering. [A] Promoter clustering of QTC Cluster 2 from the AML data set and [B] of QTC Cluster 7 from the serum response data set. Promoter clustering was based on TFBS present in the -600/+100 region. [C] Promoter clustering of QTC Cluster 2 from the AML data set and [D] of QTC Cluster 7 from the serum response data set using evolutionarily conserved TFBS. Genes with known orthologs are considered for the analysis.
Significant TFBS of QTC subclusters from the AML data set.
| 1 | 1A | 13 | Significant TFBS but not present in all |
| 1B | 16 | No significant | |
| 2 | 2A | 7 | AP1, CETS1P54, ETS1, HAND1E47, MYOD |
| 2B | 7 | No significant | |
| 2C | 14 | No significant | |
| 3 | 3A | 3 | AHR, AP1, AREB6, ATATA, BARBIE, CAAT, CDPCR3HD, CEBPDELTA, CETS168, CETS1P54, CHOP, CP2, E2F1, ELK1, ETS1, ETS2, FOXD3, FOXJ2, FOXO4, FREAC2, FREAC7, GATA, GATA1, GATA2, GATA3, GFI1, HAND1E47, HEB, HFH3, HFH8, HNF3ALPHA, HNF3B, HNF6, IK1, IPF1, LDSPOLYA, LHX3, LMO2COM, MAZ, MTATA, MYOD, NFY, NKX25, OCT1, PADS, POU1F1, RFX1, S8, SOX9, TAL1BETAE47, TATA, TCF1P, TGIF, TITF1, XFD1, XFD3, ZTA |
| 3B | 12 | No significant | |
| 3C | 6 | CAAT, CMYB, FOXO4 | |
| 41 | - | - | - |
| 5 | 5A | 6 | Significant TFBS but not present in all |
| 5B | 3 | No significant | |
| 5C | 10 | AP1, BARBIE, ER, GATA, GATA1, LMO2COM | |
| 6 | 6A | 8 | Significant TFBS but not present in all |
| 6B | 7 | Significant TFBS but not present in all | |
| 7 | 7A | 3 | FOXO4, MYOD |
| 7B | 6 | No significant | |
| 8 | 8A | 7 | BARBIE, CAAT, CEBPDELTA, ELK1, ETS1, ETS2, FOXJ2, FOXO4, FREAC7, GATA, GATA1, GATA3, GFI1, HFH3, HNF3ALPHA, HNF3B, LMO2COM, TGIF |
| 8B | 6 | No significant | |
| 9 | 9A | 7 | No significant |
| 9B | 5 | CETS168, IK1, MYOD, NRF2 | |
| 10 | 10A | 4 | Significant TFBS but not present in all |
| 10B | 5 | AP1, CETS1P54, ER, FOXO4, GATA1 | |
| 11 | 11A | 5 | No significant |
| 11B | 6 | GATA, IK1, MYOD | |
| 12 | 12A | 4 | GATA1, GATA2 |
| 12B | 7 | Significant TFBS but not present in all | |
| 13 | 13A | 3 | No significant |
| 13B | 5 | CETS168, CMYB, IK1, MYCMAX, NRF2, TFIII |
1 No subclusters formed.
Figure 2Mean size of evolutionarily conserved regions in the gene clusters. The clusters with non-significant binding sites (ns) have mean sizes of evolutionary conserved regions less than 600 bp and clusters with significant binding sites (s) mean sizes greater than 600 bp.
Figure 3Illustration of promoters regions obtained from analyses of the AML data set. [A] Cluster 5C and [B] Cluster 8A. For each gene the 3 kb upstream and 1 kb downstream from the TSS is shown and the positions of significant transcription factor binding sites indicated by a color code. Boxes, exons; green arrows, transcription start sites for the investigated gene; red arrows, transcription start sites of additional genes; red bars, exons of additional genes.
Significant TFBS of PTM subclusters from the AML data set.
| FOXO3A | 1A | 3 | No significant TFBS |
| 1B | 6 | CAAT, CETS1P54, FOXO4 (FOXO1 (4/6) FOXO3 (4/6)) | |
| 1C | 4 | No significant TFBS | |
| MAX | 2A | 12 | Significant TFBS but not present in all |
| 2B | 13 | AP1, BARBIE, GATA, GATA1, HFH3, LMO2COM | |
| SP3 | 3A | 6 | Significant TFBS but not present in all |
| 3B | 8 | No significant TFBS | |
| 3C | 5 | Significant TFBS but not present in all | |
| 3D | 13 | AP1, BARBIE, CAAT, CEBPDELTA, CETS1P54, FOXO4, GATA, GATA1, LMO2COM, NKX25, S8 | |
| STAT3 | 4A | 6 | MAZ, NRF2 |
| 4B | 6 | AP1, GATA, GATA1, LIMO2COM | |
| TGIF2 | 5A | 7 | Significant TFBS but not present in all |
| 5B | 4 | CETS136, ELK1, ETS1, FOXO4, GATA, GATA1, IK1, LMO2COM, NRF2, RFX1, SREBP1, TGIF, USF |
1 DNA binding transcription factor used for cluster formation.
2 Significant TFBS present in all promoters of the subcluster genes and the TFBS in parenthesis are present in the fraction promoters indicated.
Figure 4Illustration of promoters regions obtained from analyses of the AML data set by seaching with known transcriptions factors. [A] Cluster 1B and [B] cluster 5B. For each gene the 3 kb upstream and 1 kb downstream from the TSS is shown and the positions of significant transcription factor binding sites indicated by a color code. Boxes, exons; green arrows, transcription start sites.
Significant TFBS of QTC subclusters from the serum response data set.
| 11A | 3 | 8 | AP1, ETS1, RFX1 (SRF 3/8) | |
| 11B | 3 | 12 | Significant TFBS but not present in all | |
| 1A | 4 | 14 | No significant TFBS | |
| 1B | 4 | 8 | CEBPDELTA | |
| 1C | 4 | 21 | Significant TFBS but not present in all | |
| 1D | 4 | 17 | No significant TFBS | |
| 6A | 4 | 20 | AP1, FOXO4, GATA1, HFH3, HNF3ALPHA, HNF3B (MYCMAX 12/20)) | |
| 6B | 4 | 7 | ARNT, HIF1, MYCMAX, NMYC, USF | |
| 6C | 4 | 24 | No significant TFBS | MYC, SRF |
| 8A | 4 | 17 | Significant TFBS but not present in all | |
| 8B | 4 | 10 | Significant TFBS but not present in all | |
| 4A | 5 | 11 | No significant TFBS | |
| 4B | 5 | 10 | CETS1P54, ETS1, ETS2, IK1 | |
| 9A | 5 | 6 | AP1, BARBIE (SRF 2/6) | |
| 9B | 5 | 15 | No significant TFBS | |
| 9C | 5 | 13 | Significant TFBS but not present in all (SRF 3/13) | |
| 14A | 5 | 5 | No significant TFBS | |
| 14B | 5 | 6 | Significant TFBS but not present in all | |
| 14C | 5 | 10 | Significant TFBS but not present in all | |
| 12A | 5 | 11 | Significant TFBS but not present in all | |
| 12B | 5 | 15 | RFX1 | |
| 7A | 6 | 10 | AP1, BARBIE, CETS1P54, CREL, FOXJ2, FOXO4, GATA1, HAND1E47, HFH3, HNF3ALPHA, HNF3B, IK1, NKX25, RFX1 (MYCMAX 5/10, TCF4 6/10) | TCF4 |
| 7B | 6 | 8 | No significant TFBS | |
| 7C | 6 | 14 | FOXO4 | |
| 7D | 6 | 11 | Significant TFBS but not present in all | |
| 15A | 6 | 17 | Significant TFBS but not present in all | |
| 15B | 6 | 6 | No significant TFBS | |
| 5A | 7 | 13 | AREB6, IK1 | |
| 5B | 7 | 5 | CAAT, NFY | |
| 2A | 9 | 19 | No significant TFBS | |
| 2B | 9 | 16 | Significant TFBS but not present in all | |
| 2C | 9 | 16 | AP1 (SRF 4/16) | |
| 2D | 9 | 7 | ARNT, HIF1, MYCMAX, NMYC, USF | |
| 10A | 9 | 7 | AP1, BARBIE, CAAT, CDPCR3HD, CABPDELTA, E2F1, ETS1, FOXJ2, FOXO4, GATA, GATA1, GFI1, HFH3, HFH8, HNF3ALPHA, IPF1, LMO2COM, MTATA, MYOD, TGIF (MYCMAX 4/7, TCF4 5/7) | |
| 10B | 9 | 16 | No significant TFBS | |
| 13A | 10 | 10 | GATA1 | |
| 13B | 10 | 12 | No significant TFBS | |
| 3A | 12 | 8 | E2F, E2F1 (E2F1DPIRB (7/8), E2F1DP2 (5/8) E2F4DP2 (5/8) | E2F |
| 3B | 12 | 12 | GATA1 | |
| 16A | -3 | 7 | No significant TFBS | |
| 16B | - | 18 | No significant TFBS | |
| 17A | - | 8 | Significant TFBS but not present in all | |
| 17B | - | 11 | No significant TFBS |
1 TFBS in parenthesis are present in the fraction of promoters indicated.
2 DNA binding transcription factors present in the respective subclusters.
3 Indicates flat profiles.
Figure 5Promoter clustering of QTC 3 genes obtained from analyses the serum response data. [A] Subcluster 3A (green box) including the E2F1 gene (red box). [B] The organization of the promoters of the cluster 3A genes. For each gene the 3 kb upstream and 1 kb downstream from the TSS is shown and the positions of significant transcription factor binding sites indicated by a color code. Boxes, exons; green arrows, transcription start sites for the investigated gene; red arrows, transcription start sites of additional genes; red bars, exons of additional genes.
Figure 6Illustration of QTC 6A gene promoters obtained from the serum response data. For each gene the 3 kb upstream and 1 kb downstream from the TSS is shown and the positions of significant transcription factor binding sites indicated by a color code. Boxes, exons; green arrows, transcription start sites for the investigated gene; red arrows, transcription start sites of additional genes; red bars, exons of additional genes.
Figure 7A tentative consensus . The 3 kb upstream and 1 kb downstream from the TSS is shown and the positions of significant transcription factor binding sites indicated by color codes. Boxes, exons; green arrows, transcription start sites.