Literature DB >> 22866926

Learning score function parameters for improved spectrum identification in tandem mass spectrometry experiments.

Marina Spivak1, Michael S Bereman, Michael J Maccoss, William Stafford Noble.   

Abstract

The identification of proteins from spectra derived from a tandem mass spectrometry experiment involves several challenges: matching each observed spectrum to a peptide sequence, ranking the resulting collection of peptide-spectrum matches, assigning statistical confidence estimates to the matches, and identifying the proteins. The present work addresses algorithms to rank peptide-spectrum matches. Many of these algorithms, such as PeptideProphet, IDPicker, or Q-ranker, follow a similar methodology that includes representing peptide-spectrum matches as feature vectors and using optimization techniques to rank them. We propose a richer and more flexible feature set representation that is based on the parametrization of the SEQUEST XCorr score and that can be used by all of these algorithms. This extended feature set allows a more effective ranking of the peptide-spectrum matches based on the target-decoy strategy, in comparison to a baseline feature set devoid of these XCorr-based features. Ranking using the extended feature set gives 10-40% improvement in the number of distinct peptide identifications relative to a range of q-value thresholds. While this work is inspired by the model of the theoretical spectrum and the similarity measure between spectra used specifically by SEQUEST, the method itself can be applied to the output of any database search. Further, our approach can be trivially extended beyond XCorr to any linear operator that can serve as similarity score between experimental spectra and peptide sequences.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22866926      PMCID: PMC3436966          DOI: 10.1021/pr300234m

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  23 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

Review 3.  Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.

Authors:  Lukas Käll; John D Storey; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2007-12-08       Impact factor: 4.466

Review 4.  Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors.

Authors:  Sheila J Barton; John C Whittaker
Journal:  Mass Spectrom Rev       Date:  2009 Jan-Feb       Impact factor: 10.946

5.  A fast SEQUEST cross correlation algorithm.

Authors:  Jimmy K Eng; Bernd Fischer; Jonas Grossmann; Michael J Maccoss
Journal:  J Proteome Res       Date:  2008-09-06       Impact factor: 4.466

6.  Rapid and accurate peptide identification from tandem mass spectra.

Authors:  Christopher Y Park; Aaron A Klammer; Lukas Käll; Michael J MacCoss; William S Noble
Journal:  J Proteome Res       Date:  2008-05-28       Impact factor: 4.466

Review 7.  A face in the crowd: recognizing peptides through database search.

Authors:  Jimmy K Eng; Brian C Searle; Karl R Clauser; David L Tabb
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

8.  Evaluation of front-end higher energy collision-induced dissociation on a benchtop dual-pressure linear ion trap mass spectrometer for shotgun proteomics.

Authors:  Michael S Bereman; Jesse D Canterbury; Jarrett D Egertson; Julie Horner; Philip M Remes; Jae Schwartz; Vlad Zabrouskov; Michael J MacCoss
Journal:  Anal Chem       Date:  2012-01-12       Impact factor: 6.986

9.  Comparison of database search strategies for high precursor mass accuracy MS/MS data.

Authors:  Edward J Hsieh; Michael R Hoopmann; Brendan MacLean; Michael J MacCoss
Journal:  J Proteome Res       Date:  2010-02-05       Impact factor: 4.466

10.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Authors:  Marina Spivak; Jason Weston; Léon Bottou; Lukas Käll; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-07       Impact factor: 4.466

View more
  3 in total

1.  Data-independent-acquisition mass spectrometry for identification of targeted-peptide site-specific modifications.

Authors:  Caleb J Porter; Michael S Bereman
Journal:  Anal Bioanal Chem       Date:  2015-06-24       Impact factor: 4.142

2.  COOH-terminal collagen Q (COLQ) mutants causing human deficiency of endplate acetylcholinesterase impair the interaction of ColQ with proteins of the basal lamina.

Authors:  Juan Arredondo; Marian Lara; Fiona Ng; Danielle A Gochez; Diana C Lee; Stephanie P Logia; Joanna Nguyen; Ricardo A Maselli
Journal:  Hum Genet       Date:  2013-11-27       Impact factor: 4.132

3.  A cost-sensitive online learning method for peptide identification.

Authors:  Xijun Liang; Zhonghang Xia; Ling Jian; Yongxiang Wang; Xinnan Niu; Andrew J Link
Journal:  BMC Genomics       Date:  2020-04-25       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.