| Literature DB >> 23185043 |
Natsuko Ichikawa1, Machi Sasagawa, Mika Yamamoto, Hisayuki Komaki, Yumi Yoshida, Shuji Yamazaki, Nobuyuki Fujita.
Abstract
This article introduces DoBISCUIT (Database of BIoSynthesis clusters CUrated and InTegrated, http://www.bio.nite.go.jp/pks/), a literature-based, manually curated database of gene clusters for secondary metabolite biosynthesis. Bacterial secondary metabolites often show pharmacologically important activities and can serve as lead compounds and/or candidates for drug development. Biosynthesis of each secondary metabolite is catalyzed by a number of enzymes, usually encoded by a gene cluster. Although many scientific papers describe such gene clusters, the gene information is not always described in a comprehensive manner and the related information is rarely integrated. DoBISCUIT integrates the latest literature information and provides standardized gene/module/domain descriptions related to the gene clusters.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23185043 PMCID: PMC3531092 DOI: 10.1093/nar/gks1177
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Number of sequences and references registered in DoBISCUIT
| Data type | Number |
|---|---|
| Gene clusters | 72 |
| Collected INSDC sequences | 119 |
| Collected references | 516 |
| Assigned genes | 2006 |
| Description changed from original INSDC entry | 1621 |
| Description accepted | 196 |
| Description not concernedb | 189 |
aBased on the latest database release as of 3 October 2012.
bGenes proved not be involved in biosynthesis process.
Functional categories and number of classified genes
| Functional category | Number of CDS |
|---|---|
| Aglycon biosynthesis | |
| PKS | 310 |
| NRPS | 26 |
| PKS/NRPS hybrid | 5 |
| Other | 54 |
| Biosynthesis, modification and addition of aglycon units and moiety | |
| Extender unit | 80 |
| Starter unit | 53 |
| Sugar unit | 251 |
| Modification | |
| Hydroxylation | 5 |
| Methylation | 38 |
| Reduction | 153 |
| Other modification | 111 |
| Other function | |
| Transcriptional regulator | 141 |
| Translation | 3 |
| Transport | 81 |
| Resistance | 12 |
| Electron carrier | 10 |
| Biosynthesis of butyrolactone | 4 |
| Putative and unknown function | |
| General function prediction | 191 |
| Function unknown | 45 |
| Hypothetical protein | 227 |
Figure 1.(A) Default view of the cluster information page for the alpha-lipomycin biosynthetic gene cluster. Information related to chemical compound, producing organism, gene coordinates, domain/modules, related references and downloadable flat files are displayed. (B) CDS list view of the alpha-lipomycin biosynthetic cluster. The list page is reached via the cluster information page by clicking ‘CDS list’ tab. Relative coordinate of CDSs in the biosynthetic cluster, annotated gene descriptions and functional categories are listed.
Figure 2.Default view of the CDS information page for a gene in the tautomycetin biosynthetic cluster. A hyperlink to the CDS information page is provided on the cluster information page and the CDS list page. Manual curation results, such as annotated product name, original product name, gene name and functional category are displayed in the annotation section. Experimental evidence for this gene is also provided in the reference column of the annotation section. Functionally important domains for formation of the polyketide chain are displayed in the PKS/NRPS module section. In the sequence section, different domains are indicated by different colors.
Figure 3.Novelty chart view of the KS domain and the A domain. Number and similarity to known sequences of KS (left) and A (right) domain sequences amplified from NBRC strains are displayed as a bar chart. The width of the horizontal rectangle represents the number, and the color gradient of each rectangle represents the degree of similarity for each NBRC strain shown in the center.