Literature DB >> 21444829

Gapped spectral dictionaries and their applications for database searches of tandem mass spectra.

Kyowon Jeong1, Sangtae Kim, Nuno Bandeira, Pavel A Pevzner.   

Abstract

Generating all plausible de novo interpretations of a peptide tandem mass (MS/MS) spectrum (Spectral Dictionary) and quickly matching them against the database represent a recently emerged alternative approach to peptide identification. However, the sizes of the Spectral Dictionaries quickly grow with the peptide length making their generation impractical for long peptides. We introduce Gapped Spectral Dictionaries (all plausible de novo interpretations with gaps) that can be easily generated for any peptide length thus addressing the limitation of the Spectral Dictionary approach. We show that Gapped Spectral Dictionaries are small thus opening a possibility of using them to speed-up MS/MS searches. Our MS-Gapped-Dictionary algorithm (based on Gapped Spectral Dictionaries) enables proteogenomics applications (such as searches in the six-frame translation of the human genome) that are prohibitively time consuming with existing approaches. MS-Gapped-Dictionary generates gapped peptides that occupy a niche between accurate but short peptide sequence tags and long but inaccurate full length peptide reconstructions. We show that, contrary to conventional wisdom, some high-quality spectra do not have good peptide sequence tags and introduce gapped tags that have advantages over the conventional peptide sequence tags in MS/MS database searches.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21444829      PMCID: PMC3108828          DOI: 10.1074/mcp.M110.002220

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  31 in total

1.  De novo peptide sequencing via tandem mass spectrometry.

Authors:  V Dancík; T A Addona; K R Clauser; J E Vath; P A Pevzner
Journal:  J Comput Biol       Date:  1999 Fall-Winter       Impact factor: 1.479

2.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

3.  Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry.

Authors:  Richard S Johnson; J Alex Taylor
Journal:  Mol Biotechnol       Date:  2002-11       Impact factor: 2.695

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

5.  The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Authors:  Sangtae Kim; Nikolai Mischerikow; Nuno Bandeira; J Daniel Navarro; Louis Wich; Shabaz Mohammed; Albert J R Heck; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-09       Impact factor: 5.911

6.  PepNovo: de novo peptide sequencing via probabilistic network modeling.

Authors:  Ari Frank; Pavel Pevzner
Journal:  Anal Chem       Date:  2005-02-15       Impact factor: 6.986

7.  Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry.

Authors:  Marshall Bern; Yuhan Cai; David Goldberg
Journal:  Anal Chem       Date:  2007-01-23       Impact factor: 6.986

8.  Spectral profiles, a novel representation of tandem mass spectra and their applications for de novo peptide sequencing and identification.

Authors:  Sangtae Kim; Nuno Bandeira; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2009-03-02       Impact factor: 5.911

9.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags.

Authors:  M Mann; M Wilm
Journal:  Anal Chem       Date:  1994-12-15       Impact factor: 6.986

10.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Authors:  Sangtae Kim; Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2008-07-03       Impact factor: 4.466

View more
  12 in total

1.  De novo sequencing and homology searching.

Authors:  Bin Ma; Richard Johnson
Journal:  Mol Cell Proteomics       Date:  2011-11-16       Impact factor: 5.911

2.  Speeding up tandem mass spectral identification using indexes.

Authors:  Xiaowen Liu; Alessandro Mammana; Vineet Bafna
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

3.  The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Authors:  Sangtae Kim; Nikolai Mischerikow; Nuno Bandeira; J Daniel Navarro; Louis Wich; Shabaz Mohammed; Albert J R Heck; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-09       Impact factor: 5.911

Review 4.  Current algorithmic solutions for peptide-based proteomics data generation and identification.

Authors:  Michael R Hoopmann; Robert L Moritz
Journal:  Curr Opin Biotechnol       Date:  2012-11-08       Impact factor: 9.740

5.  Systematic Evaluation of Protein Sequence Filtering Algorithms for Proteoform Identification Using Top-Down Mass Spectrometry.

Authors:  Qiang Kou; Si Wu; Xiaowen Liu
Journal:  Proteomics       Date:  2018-02-06       Impact factor: 3.984

6.  Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra.

Authors:  Ajit P Singh; John Halloran; Jeff A Bilmes; Katrin Kirchoff; William S Noble
Journal:  Uncertain Artif Intell       Date:  2012-08

7.  A Spectrum Graph-Based Protein Sequence Filtering Algorithm for Proteoform Identification by Top-Down Mass Spectrometry.

Authors:  Runmin Yang; Daming Zhu; Qiang Kou; Poomima Bhat-Nakshatri; Harikrishna Nakshatri; Si Wu; Xiaowen Liu
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

8.  Computational mass spectrometry-based proteomics.

Authors:  Lukas Käll; Olga Vitek
Journal:  PLoS Comput Biol       Date:  2011-12-01       Impact factor: 4.475

9.  MS-GF+ makes progress towards a universal database search tool for proteomics.

Authors:  Sangtae Kim; Pavel A Pevzner
Journal:  Nat Commun       Date:  2014-10-31       Impact factor: 14.919

10.  UniNovo: a universal tool for de novo peptide sequencing.

Authors:  Kyowon Jeong; Sangtae Kim; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2013-06-12       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.