| Literature DB >> 16381951 |
Thomas McLaughlin1, Jennifer A Siepen, Julian Selley, Jennifer A Lynch, King Wai Lau, Hujun Yin, Simon J Gaskell, Simon J Hubbard.
Abstract
Proteome science relies on bioinformatics tools to characterize proteins via their proteolytic peptides which are identified via characteristic mass spectra generated after their ions undergo fragmentation in the gas phase within the mass spectrometer. The resulting secondary ion mass spectra are compared with protein sequence databases in order to identify the amino acid sequence. Although these search tools (e.g. SEQUEST, Mascot, X!Tandem, Phenyx) are frequently successful, much is still not understood about the amino acid sequence patterns which promote/protect particular fragmentation pathways, and hence lead to the presence/absence of particular ions from different ion series. In order to advance this area, we have developed a database, PepSeeker (http://nwsr.smith.man.ac.uk/pepseeker), which captures this peptide identification and ion information from proteome experiments. The database currently contains >185,000 peptides and associated database search information. Users may query this resource to retrieve peptide, protein and spectral information based on protein or peptide information, including the amino acid sequence itself represented by regular expressions coupled with ion series information. We believe this database will be useful to proteome researchers wishing to understand gas phase peptide ion chemistry in order to improve peptide identification strategies. Questions can be addressed to j.selley@manchester.ac.uk.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16381951 PMCID: PMC1347429 DOI: 10.1093/nar/gkj066
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1PepSeeker database scheme, showing the relationship between tables.
Pepseeker database statistics
| Viewable spectra | 1 397 159 |
| Proteins | 49 537 |
| Peptides (total) | 186 873 |
| Unique peptides | 47 732 |
| Average peptide length | 11.6 amino acids |
| Range of peptide lengths | 3–66 amino acids |
Figure 2Screen-shot of the PepSeeker front-end, showing an example of the navigation from a simple ion search to a list of the matching peptides through to a graphical representation of the spectra and associated ion information. An example PepSeeker query is shown searching for all peptides within the database containing the sequence PPPP. The first window is the query entry, the second window is the output and the third window displays the spectrum and table associated to the peptide SQGPPPPGKPQGPPPQGGSK.