| Literature DB >> 15137914 |
Davide Corà1, Ferdinando Di Cunto, Paolo Provero, Lorenzo Silengo, Michele Caselle.
Abstract
BACKGROUND: Transcriptional regulation is a key mechanism in the functioning of the cell, and is mostly effected through transcription factors binding to specific recognition motifs located upstream of the coding region of the regulated gene. The computational identification of such motifs is made easier by the fact that they often appear several times in the upstream region of the regulated genes, so that the number of occurrences of relevant motifs is often significantly larger than expected by pure chance.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15137914 PMCID: PMC449910 DOI: 10.1186/1471-2105-5-57
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Significant motifs associated to nucleolar proteins implicated in ribosome biogenesis. The columns titled "C", "F" and "P" correspond to the three branches of the Gene Ontology: cellular component, molecular function and biological process respectively.
| motif | C | F | P |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| - | - | ribosome biogenesis | |
| - | - | ribosome biogenesis and assembly | |
| - | - | ribosome biogenesis and assembly | |
| - | - | ribosome biogenesis and assembly | |
| nucleolus | - | ribosome biogenesis and assembly | |
| nucleolus | - | ribosome biogenesis and assembly | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| - | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | rRNA processing | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus complex | - | transcription from Pol I promoter | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | ribosome biogenesis | |
| nucleolus | - | 35S primary transcript processing | |
| small nucleolar ribonucleoprotein complex | - | 35S primary transcript processing | |
Significant motifs associated to ribosomal proteins. Here and in the following tables the column "TF" reports the transcription factors studied in Ref. [5] for which the intersection between the experimentally found targets of regulation and our sets is statistically significant.
| motif | C | F | P | TF |
| cytosolic ribosome (sensu Eukarya) | structural constituent of ribosome | - | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 RAP1 | |
| large ribosomal subunit | - | - | FHL1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | structural constituent of ribosome | - | FHL1 | |
| cytosolic ribosome (sensu Eukarya) | structural constituent of ribosome | protein biosynthesis | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 RAP1 | |
| cytosol | - | - | FHL1 RAP1 | |
| cytosolic ribosome (sensu Eukarya) | - | - | FHL1 | |
Significant motifs associated to DNA replication and nucleosome.
| motif | C | F | P | TF |
| - | - | DNA replication | MBP1 SWI6 | |
| - | - | DNA dependent DNA replication | ||
| - | - | DNA replication and chromosome cycle | MBP1 SWI6 | |
| - | - | DNA replication | MBP1 SWI6 | |
| - | - | mitotic sister chromatid cohesion | MBP1 SWI4 SWI6 | |
| replication fork | - | DNA metabolism | MBP1 SWI4 SWI6 | |
| - | - | DNA replication | MBP1 SWI6 | |
| - | - | mitotic sister chromatid cohesion | MBP1 SWI6 | |
| - | - | DNA metabolism | MBP1 | |
| nucleosome | - | - | HIR1 HIR2 | |
| nucleosome | - | - | HIR1 HIR2 | |
| nucleosome | - | chromatin assembly/disassembly | HIR1 HIR2 | |
| nucleosome | - | - | MBP1 SWI4 SWI6 | |
| nucleosome | - | - | ||
| nucleosome | - | - | ||
Other motifs with significant intersection with ChIP data.
| motif | C | F | P | TF |
| - | - | amino acid metabolism | GCN4 | |
| - | - | non-protein amino acid metabolism | GCN4 | |
| - | - | amino acid metabolism | GCN4 | |
| - | - | amino acid metabolism | GCN4 | |
| - | - | amino acid biosynthesis | GCN4 | |
| - | - | amino acid metabolism | ||
| - | - | amino acid biosynthesis | BAS1 GCN4 | |
| - | - | sexual reproduction | DIG1 STE12 | |
| - | - | sexual reproduction | DIG1 STE12 | |
| - | - | sulfur amino acid transport | MET4 | |
| - | - | sulfur metabolism | MET4 MET31 | |
| - | heat shock protein | protein folding | HSF1 | |
Motifs associated to siderophore transport and tricarboxylic acid cycle.
| motif | C | F | P |
| - | - | siderophore transport | |
| - | - | siderophore transport | |
| - | - | siderophore transport | |
| - | - | siderophore transport | |
| - | - | siderophore transport | |
| - | heavy metal ion porter | siderophore transport | |
| cell wall (sensu Fungi) | - | - | |
| - | - | tricarboxylic acid cycle | |
| - | - | tricarboxylic acid cycle | |
| - | - | tricarboxylic acid cycle | |
| - | - | tricarboxylic acid cycle | |
Other significant motifs associated to transcription factors not studied in Ref. [5].
| motif | C | F | P |
| alpha, alpha-trehalose-phosphate synthase complex (UDP-forming) | - | - | |
| alpha, alpha-trehalose-phosphate synthase complex (UDP-forming) | - | - | |
| 26S proteasome | threonine endopeptidase | - | |
| 26S proteasome | peptidase | - | |
| 26S proteasome | proteasome endopeptidase | - | |
| 26S proteasome | proteasome endopeptidase | - | |
| spindle | - | - | |
| - | - | steroid metabolism | |
Genome prevalence of the consensus sequences constructed by aligning the motifs found significant by the algorithm, and functional characterization of the corresponding genes. The second column contains the total number of genes in the yeast genome in which the consensus sequence is found at least once in the upstream region. The third column reports one of the annotation terms to which the consensus sequence is associated, and the number of genes annotated to the term among the ones containing the consensus sequence.
| Consensus sequence | Total genes | Annotated genes |
| GATGAGATAGCT | 17 | nucleolus (8) |
| AAAAATTTTC | 233 | nucleolus (37) |
| MCSCMTACAY | 78 | cytosolic ribosome (20) |
| ARACGCGTMG | 15 | DNA replication and chromosome cycle (2) |
| AAMYGCGAWCG | 6 | nucleosome (0) |
| GATGAGTCAY | 5 | amino-acid metabolism (3) |
| TGAAACA | 560 | sexual reproduction (26) |
| AGGGTGCA | 40 | siderophore transport (5) |
| CGGCGCCGAG | 5 | tricarboxylic acid cycle (3) |
| ACCCCT | 434 | alpha, alpha-trehalose phosphate synthase complex (UDP-forming) (2) |
| CCGGTGGC | 17 | 26S proteasome (6) |
Candidate targets of regulation by the UPC2p and ECM22p transcription factors.
| ATF2 | (YGR177C) |
| ECM22 | (YLR228C) |
| ERG3 | (YLR056W) |
| ERG4 | (YGL012W) |
| ERG5 | (YMR015C) |
| ERG7 | (YHR072W) |
| ERG26 | (YGL001C) |
Candidate targets of regulation by the Hcm1p transcription factor.
| MPS1 | (YDL028C) |
| CIN8 | (YEL061C) |
| PDS1 | (YDR113C) |
| SPC98 | (YNL126W) |
| VIK1 | (YPL253C) |
| SPC25 | (YER018C) |
| ESP1 | (YGR098C) |
| STU2 | (YLR045C) |
| SLI15 | (YBR156C) |
Figure 1Location of the motifs belonging to the consensus GATGAGATGAGCT with respect to the translation and transcription start sites for 6 genes for which the latter is known. The binding sites are denoted by rectangles above or below the line depending on whether the consensus sequence is read on the Crick or Watson strand respectively. The vertical bar is the transcription start site, as given in Ref. [20].