| Literature DB >> 17962296 |
Nicolas Sierro1, Yuko Makita, Michiel de Hoon, Kenta Nakai.
Abstract
DBTBS, first released in 1999, is a reference database on transcriptional regulation in Bacillus subtilis, summarizing the experimentally characterized transcription factors, their recognition sequences and the genes they regulate. Since the previous release, the original content was extended by the addition of the data contained in 569 new publications, the total of which now reaches 947. The number of B. subtilis promoters annotated in the database was more than doubled to 1475. In addition, 463 experimentally validated B. subtilis operons and their terminators have been included. Given the increase in the number of fully sequenced bacterial genomes, we decided to extend the usability of DBTBS in comparative regulatory genomics. We therefore created a new section on the conservation of the upstream regulatory sequences between homologous genes in 40 Gram-positive bacterial species, as well as on the presence of overrepresented hexameric motifs that may have regulatory functions. DBTBS can be accessed at: http://dbtbs.hgc.jp.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17962296 PMCID: PMC2247474 DOI: 10.1093/nar/gkm910
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of the updated data
| Category | Release 3 | Release 5 |
|---|---|---|
| Referenced publications | 378 | 947 |
| Transcription factors | 114 | 120 |
| Position-specific scoring matrices | 45 | 45 |
| Promoters | 633 | 1475 |
| Regulated operons | 525 | 736 |
| Terminators | 0 | 463 |
The number of items of each category present in release 3 (November 2004) and release 5 (September 2007) of DBTBS are given.
Repartition of the clusters and motifs
| Genus | Bacteria | Genes | Specific sub- groups | % | Specific motifs | % | |
|---|---|---|---|---|---|---|---|
| BAC | 7 | 7450 | 1581 | 79.0 | 5652 | 22.5 | |
| CAB | 1 | 7 | 2 | 100.0 | 0 | 0.0 | |
| CLO | 3 | 271 | 46 | 53.5 | 11 | 0.4 | |
| ENT | 1 | 11 | 1 | 33.3 | 0 | 0.0 | |
| GEO | 1 | 19 | 4 | 100.0 | 0 | 0.0 | |
| LAB | 3 | 299 | 24 | 25.5 | 13 | 0.4 | |
| LAC | 1 | 45 | 7 | 100.0 | 0 | 0.0 | |
| LIS | 2 | 51 | 9 | 64.3 | 0 | 0.0 | |
| MYC | 9 | 298 | 40 | 58.0 | 24 | 0.8 | |
| OCB | 1 | 6 | 1 | 50.0 | 0 | 0.0 | |
| OYP | 1 | 92 | 18 | 85.7 | 0 | 0.0 | |
| STA | 4 | 3435 | 555 | 60.5 | 1516 | 8.3 | |
| STR | 5 | 1573 | 196 | 46.3 | 184 | 1.9 | |
| TAB | 1 | 59 | 10 | 100.0 | 0 | 0.0 |
Genuses are listed followed by the three-letter abbreviation used for cluster names. Bacteria: number of bacteria species used for the given genus; Genes: total number of genes reported in the bacteria of the given genus; Specific subgroups: number of subgroups for which no homologous subgroup is found in other genuses; Specific motifs: number of hexameric motif matrices found to bind exclusively in a given genus. Percentages are given based on the total number of subgroups, respectively motifs, found in a given genus.
Figure 1.Hexameric motif conservation in an upstream intergenic region. The upper part shows the search entry box, with the criteria selected for filtering of the displayed hexameric motifs. The lower part is the resulting figure, showing the conservation profile as a black line and the alignment used to obtain it. The blue line represents the calculated conserved region and the red boxes the found hexameric motifs. A 2×/4× magnifier is available in order to conveniently scan the sequence alignment.