| Literature DB >> 17678535 |
Timur Shtatland1, Daniel Guettler, Misha Kossodo, Misha Pivovarov, Ralph Weissleder.
Abstract
BACKGROUND: Peptides are important molecules with diverse biological functions and biomedical uses. To date, there does not exist a single, searchable archive for peptide sequences or associated biological data. Rather, peptide sequences still have to be mined from abstracts and full-length articles, and/or obtained from the fragmented public sources. DESCRIPTION: We have constructed a new database (PepBank), which at the time of writing contains a total of 19,792 individual peptide entries. The database has a web-based user interface with a simple, Google-like search function, advanced text search, and BLAST and Smith-Waterman search capabilities. The major source of peptide sequence data comes from text mining of MEDLINE abstracts. Another component of the database is the peptide sequence data from public sources (ASPD and UniProt). An additional, smaller part of the database is manually curated from sets of full text articles and text mining results. We show the utility of the database in different examples of affinity ligand discovery.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17678535 PMCID: PMC1976427 DOI: 10.1186/1471-2105-8-280
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Database core model. MySQL tables are shown as rectangles. Mandatory attributes are in bold, optional are in italics. Relationships are shown as lines, with the arrows pointing from the primary to the foreign keys, and multiplicities as shown.
Figure 2Web-based user interface of PepBank. Illustration of a typical user workflow. The user enters the query with Quick or Advanced Search. The results are returned in a table sortable in the browser. The user selects the entry or entries of interest. The sequence in the example shown was obtained by text mining and was then manually curated. The score, between 0 and 1, reflects the degree of confidence in the interaction (higher score for more confidence). Manually curated entries receive higher score than entries from automated text mining.