Literature DB >> 21391616

On using samples of known protein content to assess the statistical calibration of scores assigned to peptide-spectrum matches in shotgun proteomics.

Viktor Granholm1, William Stafford Noble, Lukas Käll.   

Abstract

In shotgun proteomics, the quality of a hypothesized match between an observed spectrum and a peptide sequence is quantified by a score function. Because the score function lies at the heart of any peptide identification pipeline, this function greatly affects the final results of a proteomics assay. Consequently, valid statistical methods for assessing the quality of a given score function are extremely important. Previously, several research groups have used samples of known protein composition to assess the quality of a given score function. We demonstrate that this approach is problematic, because the outcome can depend on factors other than the score function itself. We then propose an alternative use of the same type of data to validate a score function. The central idea of our approach is that database matches that are not explained by any protein in the purified sample comprise a robust representation of incorrect matches. We apply our alternative assessment scheme to several commonly used score functions, and we show that our approach generates a reproducible measure of the calibration of a given peptide identification method. Furthermore, we show how our quality test can be useful in the development of novel score functions.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21391616      PMCID: PMC3268674          DOI: 10.1021/pr1012619

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  21 in total

1.  Qscore: an algorithm for evaluating SEQUEST database search results.

Authors:  Roger E Moore; Mary K Young; Terry D Lee
Journal:  J Am Soc Mass Spectrom       Date:  2002-04       Impact factor: 3.109

2.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

3.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

4.  The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Authors:  Sangtae Kim; Nikolai Mischerikow; Nuno Bandeira; J Daniel Navarro; Louis Wich; Shabaz Mohammed; Albert J R Heck; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-09       Impact factor: 5.911

5.  Prediction of error associated with false-positive rate determination for peptide identification in large-scale proteomics experiments using a combined reverse and forward peptide sequence database strategy.

Authors:  Edward L Huttlin; Adrian D Hegeman; Amy C Harms; Michael R Sussman
Journal:  J Proteome Res       Date:  2007-01       Impact factor: 4.466

6.  Rapid and accurate peptide identification from tandem mass spectra.

Authors:  Christopher Y Park; Aaron A Klammer; Lukas Käll; Michael J MacCoss; William S Noble
Journal:  J Proteome Res       Date:  2008-05-28       Impact factor: 4.466

7.  The standard protein mix database: a diverse data set to assist in the production of improved Peptide and protein identification software tools.

Authors:  John Klimek; James S Eddes; Laura Hohmann; Jennifer Jackson; Amelia Peterson; Simon Letarte; Philip R Gafken; Jonathan E Katz; Parag Mallick; Hookeun Lee; Alexander Schmidt; Reto Ossola; Jimmy K Eng; Ruedi Aebersold; Daniel B Martin
Journal:  J Proteome Res       Date:  2007-08-21       Impact factor: 4.466

8.  False discovery rates of protein identifications: a strike against the two-peptide rule.

Authors:  Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2009-09       Impact factor: 4.466

9.  Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry.

Authors:  David L Tabb; Lorenzo Vega-Montoto; Paul A Rudnick; Asokan Mulayath Variyath; Amy-Joan L Ham; David M Bunk; Lisa E Kilpatrick; Dean D Billheimer; Ronald K Blackman; Helene L Cardasis; Steven A Carr; Karl R Clauser; Jacob D Jaffe; Kevin A Kowalski; Thomas A Neubert; Fred E Regnier; Birgit Schilling; Tony J Tegeler; Mu Wang; Pei Wang; Jeffrey R Whiteaker; Lisa J Zimmerman; Susan J Fisher; Bradford W Gibson; Christopher R Kinsinger; Mehdi Mesri; Henry Rodriguez; Stephen E Stein; Paul Tempst; Amanda G Paulovich; Daniel C Liebler; Cliff Spiegelman
Journal:  J Proteome Res       Date:  2010-02-05       Impact factor: 4.466

10.  A HUPO test sample study reveals common problems in mass spectrometry-based proteomics.

Authors:  Alexander W Bell; Eric W Deutsch; Catherine E Au; Robert E Kearney; Ron Beavis; Salvatore Sechi; Tommy Nilsson; John J M Bergeron
Journal:  Nat Methods       Date:  2009-06       Impact factor: 28.547

View more
  19 in total

1.  Nonparametric Bayesian evaluation of differential protein quantification.

Authors:  Oliver Serang; A Ertugrul Cansizoglu; Lukas Käll; Hanno Steen; Judith A Steen
Journal:  J Proteome Res       Date:  2013-09-11       Impact factor: 4.466

2.  A non-parametric cutout index for robust evaluation of identified proteins.

Authors:  Oliver Serang; Joao Paulo; Hanno Steen; Judith A Steen
Journal:  Mol Cell Proteomics       Date:  2013-01-04       Impact factor: 5.911

3.  A Markov Chain Monte Carlo Method for Estimating the Statistical Significance of Proteoform Identifications by Top-Down Mass Spectrometry.

Authors:  Qiang Kou; Zhe Wang; Rachele A Lubeckyj; Si Wu; Liangliang Sun; Xiaowen Liu
Journal:  J Proteome Res       Date:  2019-01-28       Impact factor: 4.466

4.  Fast and accurate database searches with MS-GF+Percolator.

Authors:  Viktor Granholm; Sangtae Kim; José C F Navarro; Erik Sjölund; Richard D Smith; Lukas Käll
Journal:  J Proteome Res       Date:  2013-12-23       Impact factor: 4.466

5.  Determining the calibration of confidence estimation procedures for unique peptides in shotgun proteomics.

Authors:  Viktor Granholm; José Fernández Navarro; William Stafford Noble; Lukas Käll
Journal:  J Proteomics       Date:  2012-12-23       Impact factor: 4.044

6.  A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors:  Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal:  J Proteome Res       Date:  2018-04-16       Impact factor: 4.466

7.  A cross-validation scheme for machine learning algorithms in shotgun proteomics.

Authors:  Viktor Granholm; William Stafford Noble; Lukas Käll
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

8.  Enhanced peptide identification by electron transfer dissociation using an improved Mascot Percolator.

Authors:  James C Wright; Mark O Collins; Lu Yu; Lukas Käll; Markus Brosch; Jyoti S Choudhary
Journal:  Mol Cell Proteomics       Date:  2012-04-06       Impact factor: 5.911

9.  ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching.

Authors:  Paolo Cifani; Avantika Dhabaria; Zining Chen; Akihide Yoshimi; Emily Kawaler; Omar Abdel-Wahab; John T Poirier; Alex Kentsis
Journal:  J Proteome Res       Date:  2018-10-19       Impact factor: 4.466

10.  Binary Classifier for Computing Posterior Error Probabilities in MetaMorpheus.

Authors:  Michael R Shortreed; Robert J Millikin; Lei Liu; Zach Rolfs; Rachel M Miller; Leah V Schaffer; Brian L Frey; Lloyd M Smith
Journal:  J Proteome Res       Date:  2021-03-08       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.