Literature DB >> 20712337

Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.

Oliver Serang1, Michael J MacCoss, William Stafford Noble.   

Abstract

The problem of identifying proteins from a shotgun proteomics experiment has not been definitively solved. Identifying the proteins in a sample requires ranking them, ideally with interpretable scores. In particular, "degenerate" peptides, which map to multiple proteins, have made such a ranking difficult to compute. The problem of computing posterior probabilities for the proteins, which can be interpreted as confidence in a protein's presence, has been especially daunting. Previous approaches have either ignored the peptide degeneracy problem completely, addressed it by computing a heuristic set of proteins or heuristic posterior probabilities, or estimated the posterior probabilities with sampling methods. We present a probabilistic model for protein identification in tandem mass spectrometry that recognizes peptide degeneracy. We then introduce graph-transforming algorithms that facilitate efficient computation of protein probabilities, even for large data sets. We evaluate our identification procedure on five different well-characterized data sets and demonstrate our ability to efficiently compute high-quality protein posteriors.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20712337      PMCID: PMC2948606          DOI: 10.1021/pr100594k

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  16 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  A Heuristic method for assigning a false-discovery rate for protein identifications from Mascot database search results.

Authors:  D Brent Weatherly; James A Atwood; Todd A Minning; Cameron Cavola; Rick L Tarleton; Ron Orlando
Journal:  Mol Cell Proteomics       Date:  2005-02-09       Impact factor: 5.911

3.  MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.

Authors:  David L Tabb; Christopher G Fernando; Matthew C Chambers
Journal:  J Proteome Res       Date:  2007-02       Impact factor: 4.466

4.  Proteomic parsimony through bipartite graph analysis improves accuracy and transparency.

Authors:  Bing Zhang; Matthew C Chambers; David L Tabb
Journal:  J Proteome Res       Date:  2007-08-04       Impact factor: 4.466

Review 5.  Analysis and validation of proteomic data generated by tandem mass spectrometry.

Authors:  Alexey I Nesvizhskii; Olga Vitek; Ruedi Aebersold
Journal:  Nat Methods       Date:  2007-10       Impact factor: 28.547

6.  Rapid and accurate peptide identification from tandem mass spectra.

Authors:  Christopher Y Park; Aaron A Klammer; Lukas Käll; Michael J MacCoss; William S Noble
Journal:  J Proteome Res       Date:  2008-05-28       Impact factor: 4.466

7.  Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry.

Authors:  Lukas Reiter; Manfred Claassen; Sabine P Schrimpf; Marko Jovanovic; Alexander Schmidt; Joachim M Buhmann; Michael O Hengartner; Ruedi Aebersold
Journal:  Mol Cell Proteomics       Date:  2009-07-16       Impact factor: 5.911

8.  EBP, a program for protein identification using multiple tandem mass spectrometry datasets.

Authors:  Thomas S Price; Margaret B Lucitt; Weichen Wu; David J Austin; Angel Pizarro; Anastasia K Yocum; Ian A Blair; Garret A FitzGerald; Tilo Grosser
Journal:  Mol Cell Proteomics       Date:  2006-12-12       Impact factor: 5.911

9.  The standard protein mix database: a diverse data set to assist in the production of improved Peptide and protein identification software tools.

Authors:  John Klimek; James S Eddes; Laura Hohmann; Jennifer Jackson; Amelia Peterson; Simon Letarte; Philip R Gafken; Jonathan E Katz; Parag Mallick; Hookeun Lee; Alexander Schmidt; Reto Ossola; Jimmy K Eng; Ruedi Aebersold; Daniel B Martin
Journal:  J Proteome Res       Date:  2007-08-21       Impact factor: 4.466

10.  IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering.

Authors:  Ze-Qiang Ma; Surendra Dasari; Matthew C Chambers; Michael D Litton; Scott M Sobecki; Lisa J Zimmerman; Patrick J Halvey; Birgit Schilling; Penelope M Drake; Bradford W Gibson; David L Tabb
Journal:  J Proteome Res       Date:  2009-08       Impact factor: 4.466

View more
  48 in total

1.  MSAcquisitionSimulator: data-dependent acquisition simulator for LC-MS shotgun proteomics.

Authors:  Dennis Goldfarb; Wei Wang; Michael B Major
Journal:  Bioinformatics       Date:  2015-12-17       Impact factor: 6.937

2.  An Oxygen-Dependent Interaction between FBXL5 and the CIA-Targeting Complex Regulates Iron Homeostasis.

Authors:  Adarsh K Mayank; Vijaya Pandey; Ajay A Vashisht; William D Barshop; Shima Rayatpisheh; Tanu Sharma; Tisha Haque; David N Powers; James A Wohlschlegel
Journal:  Mol Cell       Date:  2019-06-19       Impact factor: 17.970

3.  Statistical approach to protein quantification.

Authors:  Sarah Gerster; Taejoon Kwon; Christina Ludwig; Mariette Matondo; Christine Vogel; Edward M Marcotte; Ruedi Aebersold; Peter Bühlmann
Journal:  Mol Cell Proteomics       Date:  2013-11-19       Impact factor: 5.911

4.  Protein identification problem from a Bayesian point of view.

Authors:  Yong Fuga Li; Randy J Arnold; Predrag Radivojac; Haixu Tang
Journal:  Stat Interface       Date:  2012-01-01       Impact factor: 0.582

Review 5.  A face in the crowd: recognizing peptides through database search.

Authors:  Jimmy K Eng; Brian C Searle; Karl R Clauser; David L Tabb
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

6.  Concerning the accuracy of Fido and parameter choice.

Authors:  Oliver Serang
Journal:  Bioinformatics       Date:  2012-11-28       Impact factor: 6.937

Review 7.  Inference and validation of protein identifications.

Authors:  Manfred Claassen
Journal:  Mol Cell Proteomics       Date:  2012-08-03       Impact factor: 5.911

8.  A non-parametric cutout index for robust evaluation of identified proteins.

Authors:  Oliver Serang; Joao Paulo; Hanno Steen; Judith A Steen
Journal:  Mol Cell Proteomics       Date:  2013-01-04       Impact factor: 5.911

9.  A review of statistical methods for protein identification using tandem mass spectrometry.

Authors:  Oliver Serang; William Noble
Journal:  Stat Interface       Date:  2012       Impact factor: 0.582

10.  Improved Protein Inference from Multiple Protease Bottom-Up Mass Spectrometry Data.

Authors:  Rachel M Miller; Robert J Millikin; Connor V Hoffmann; Stefan K Solntsev; Gloria M Sheynkman; Michael R Shortreed; Lloyd M Smith
Journal:  J Proteome Res       Date:  2019-08-23       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.