| Literature DB >> 17044936 |
Carol L Ecale Zhou1, Marisa W Lam, Jason R Smith, Adam T Zemla, Matthew D Dyer, Thomas A Kuczmarski, Elizabeth A Vitalis, Thomas R Slezak.
Abstract
BACKGROUND: MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. DESCRIPTION: MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17044936 PMCID: PMC1622758 DOI: 10.1186/1471-2105-7-459
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Data flow diagram for MannDB sequence analysis pipeline. External data sources (yellow) are downloaded into MannDB. Software systems (lavender boxes) process and enable display of data. MannDB pipeline manager controls execution of open-source tools (ovals) and blast against MvirDB (green oval).
Figure 2MannDB database query and browser sample web pages. In this example, user has selected the Campylobacter jejuni proteome (left), entered free text "toxin" (top oval), and checked the MvirDB homology checkbox (bottom oval), resulting in 3 database hits (top right). Selecting single chain protein id 64721 (top right, oval), followed by the "cross-reference" checkbox (middle right, oval) brings up a report page (bottom right) displaying the MvirDB cross reference link (oval).