Literature DB >> 21349864

Assigning spectrum-specific P-values to protein identifications by mass spectrometry.

Victor Spirin1, Alexander Shpunt, Jan Seebacher, Marc Gentzel, Andrej Shevchenko, Steven Gygi, Shamil Sunyaev.   

Abstract

MOTIVATION: Although many methods and statistical approaches have been developed for protein identification by mass spectrometry, the problem of accurate assessment of statistical significance of protein identifications remains an open question. The main issues are as follows: (i) statistical significance of inferring peptide from experimental mass spectra must be platform independent and spectrum specific and (ii) individual spectrum matches at the peptide level must be combined into a single statistical measure at the protein level.
RESULTS: We present a method and software to assign statistical significance to protein identifications from search engines for mass spectrometric data. The approach is based on asymptotic theory of order statistics. The parameters of the asymptotic distributions of identification scores are estimated for each spectrum individually. The method relies on new unbiased estimators for parameters of extreme value distribution. The estimated parameters are used to assign a spectrum-specific P-value to each peptide-spectrum match. The protein-level confidence measure combines P-values of peptide-to-spectrum matches.
CONCLUSION: We extensively tested the method using triplicate mouse and yeast high-throughput proteomic experiments. The proposed statistical approach improves the sensitivity of protein identifications without compromising specificity. While the method was primarily designed to work with Mascot, it is platform-independent and is applicable to any search engine which outputs a single score for a peptide-spectrum match. We demonstrate this by testing the method in conjunction with X!Tandem. AVAILABILITY: The software is available for download at ftp://genetics.bwh.harvard.edu/SSPV/. CONTACT: ssunyaev@rics.bwh.harvard.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21349864      PMCID: PMC3072553          DOI: 10.1093/bioinformatics/btr089

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Qscore: an algorithm for evaluating SEQUEST database search results.

Authors:  Roger E Moore; Mary K Young; Terry D Lee
Journal:  J Am Soc Mass Spectrom       Date:  2002-04       Impact factor: 3.109

2.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

3.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Authors:  Jürgen Cox; Matthias Mann
Journal:  Nat Biotechnol       Date:  2008-11-30       Impact factor: 54.908

4.  Improved ranking functions for protein and modification-site identifications.

Authors:  Marshall Bern; David Goldberg
Journal:  J Comput Biol       Date:  2008-09       Impact factor: 1.479

5.  A fast SEQUEST cross correlation algorithm.

Authors:  Jimmy K Eng; Bernd Fischer; Jonas Grossmann; Michael J Maccoss
Journal:  J Proteome Res       Date:  2008-09-06       Impact factor: 4.466

6.  Rapid and accurate peptide identification from tandem mass spectra.

Authors:  Christopher Y Park; Aaron A Klammer; Lukas Käll; Michael J MacCoss; William S Noble
Journal:  J Proteome Res       Date:  2008-05-28       Impact factor: 4.466

7.  Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry.

Authors:  Lukas Reiter; Manfred Claassen; Sabine P Schrimpf; Marko Jovanovic; Alexander Schmidt; Joachim M Buhmann; Michael O Hengartner; Ruedi Aebersold
Journal:  Mol Cell Proteomics       Date:  2009-07-16       Impact factor: 5.911

8.  Generalized method for probability-based peptide and protein identification from tandem mass spectrometry data and sequence database searching.

Authors:  Antonio Ramos-Fernández; Alberto Paradela; Rosana Navajas; Juan Pablo Albar
Journal:  Mol Cell Proteomics       Date:  2008-05-31       Impact factor: 5.911

9.  Statistical calibration of the SEQUEST XCorr function.

Authors:  Aaron A Klammer; Christopher Y Park; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-04       Impact factor: 4.466

10.  RAId_DbS: peptide identification using database searches with realistic statistics.

Authors:  Gelio Alves; Aleksey Y Ogurtsov; Yi-Kuo Yu
Journal:  Biol Direct       Date:  2007-10-25       Impact factor: 4.540

View more
  15 in total

1.  Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate.

Authors:  Uri Keich; Kaipo Tamura; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-01-03       Impact factor: 4.466

2.  Mass spectrometry-based protein identification with accurate statistical significance assignment.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  Bioinformatics       Date:  2014-10-31       Impact factor: 6.937

3.  MixGF: spectral probabilities for mixture spectra from more than one peptide.

Authors:  Jian Wang; Philip E Bourne; Nuno Bandeira
Journal:  Mol Cell Proteomics       Date:  2014-09-15       Impact factor: 5.911

4.  Mass Spectrometry-based Proteomics and Peptidomics for Systems Biology and Biomarker Discovery.

Authors:  Robert Cunningham; Di Ma; Lingjun Li
Journal:  Front Biol (Beijing)       Date:  2012-08-01

5.  Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  Bioinformatics       Date:  2016-04-29       Impact factor: 6.937

6.  Progressive calibration and averaging for tandem mass spectrometry statistical confidence estimation: Why settle for a single decoy?

Authors:  Uri Keich; William Stafford Noble
Journal:  Res Comput Mol Biol       Date:  2017-04-12

7.  A new approach to evaluating statistical significance of spectral identifications.

Authors:  Hosein Mohimani; Sangtae Kim; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2013-03-08       Impact factor: 4.466

8.  A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors:  Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal:  J Proteome Res       Date:  2018-04-16       Impact factor: 4.466

Review 9.  Computational and statistical analysis of protein mass spectrometry data.

Authors:  William Stafford Noble; Michael J MacCoss
Journal:  PLoS Comput Biol       Date:  2012-01-26       Impact factor: 4.475

10.  Tandem Mass Spectrum Identification via Cascaded Search.

Authors:  Attila Kertesz-Farkas; Uri Keich; William Stafford Noble
Journal:  J Proteome Res       Date:  2015-06-30       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.