Literature DB >> 22057310

Generic comparison of protein inference engines.

Manfred Claassen1, Lukas Reiter, Michael O Hengartner, Joachim M Buhmann, Ruedi Aebersold.   

Abstract

Protein identifications, instead of peptide-spectrum matches, constitute the biologically relevant result of shotgun proteomics studies. How to appropriately infer and report protein identifications has triggered a still ongoing debate. This debate has so far suffered from the lack of appropriate performance measures that allow us to objectively assess protein inference approaches. This study describes an intuitive, generic and yet formal performance measure and demonstrates how it enables experimentalists to select an optimal protein inference strategy for a given collection of fragment ion spectra. We applied the performance measure to systematically explore the benefit of excluding possibly unreliable protein identifications, such as single-hit wonders. Therefore, we defined a family of protein inference engines by extending a simple inference engine by thousands of pruning variants, each excluding a different specified set of possibly unreliable identifications. We benchmarked these protein inference engines on several data sets representing different proteomes and mass spectrometry platforms. Optimally performing inference engines retained all high confidence spectral evidence, without posterior exclusion of any type of protein identifications. Despite the diversity of studied data sets consistently supporting this rule, other data sets might behave differently. In order to ensure maximal reliable proteome coverage for data sets arising in other studies we advocate abstaining from rigid protein inference rules, such as exclusion of single-hit wonders, and instead consider several protein inference approaches and assess these with respect to the presented performance measure in the specific application context.

Mesh:

Substances:

Year:  2011        PMID: 22057310      PMCID: PMC3322578          DOI: 10.1074/mcp.O110.007088

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  36 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

3.  DBParser: web-based software for shotgun proteomic data analyses.

Authors:  Xiaoyu Yang; Vijay Dondeti; Rebecca Dezube; Dawn M Maynard; Lewis Y Geer; Jonathan Epstein; Xiongfong Chen; Sanford P Markey; Jeffrey A Kowalak
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

4.  Options and considerations when selecting a quantitative proteomics strategy.

Authors:  Bruno Domon; Ruedi Aebersold
Journal:  Nat Biotechnol       Date:  2010-07-09       Impact factor: 54.908

5.  Data management and preliminary data analysis in the pilot phase of the HUPO Plasma Proteome Project.

Authors:  Marcin Adamski; Thomas Blackwell; Rajasree Menon; Lennart Martens; Henning Hermjakob; Chris Taylor; Gilbert S Omenn; David J States
Journal:  Proteomics       Date:  2005-08       Impact factor: 3.984

6.  False discovery rates of protein identifications: a strike against the two-peptide rule.

Authors:  Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2009-09       Impact factor: 4.466

7.  Challenges in deriving high-confidence protein identifications from data gathered by a HUPO plasma proteome collaborative study.

Authors:  David J States; Gilbert S Omenn; Thomas W Blackwell; Damian Fermin; Jimmy Eng; David W Speicher; Samir M Hanash
Journal:  Nat Biotechnol       Date:  2006-03       Impact factor: 54.908

8.  Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry.

Authors:  Frank Desiere; Eric W Deutsch; Alexey I Nesvizhskii; Parag Mallick; Nichole L King; Jimmy K Eng; Alan Aderem; Rose Boyle; Erich Brunner; Samuel Donohoe; Nelson Fausto; Ernst Hafen; Lee Hood; Michael G Katze; Kathleen A Kennedy; Floyd Kregenow; Hookeun Lee; Biaoyang Lin; Dan Martin; Jeffrey A Ranish; David J Rawlings; Lawrence E Samelson; Yuzuru Shiio; Julian D Watts; Bernd Wollscheid; Michael E Wright; Wei Yan; Lihong Yang; Eugene C Yi; Hui Zhang; Ruedi Aebersold
Journal:  Genome Biol       Date:  2004-12-10       Impact factor: 13.583

9.  Proteome coverage prediction with infinite Markov models.

Authors:  Manfred Claassen; Ruedi Aebersold; Joachim M Buhmann
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

10.  Absolute quantification of microbial proteomes at different states by directed mass spectrometry.

Authors:  Alexander Schmidt; Martin Beck; Johan Malmström; Henry Lam; Manfred Claassen; David Campbell; Ruedi Aebersold
Journal:  Mol Syst Biol       Date:  2011-07-19       Impact factor: 11.429

View more
  12 in total

1.  Quantification of Dynamic Protein Interactions and Phosphorylation in LPS Signaling Pathway by SWATH-MS.

Authors:  Xiurong Wu; Daowei Yang; Fu Zhao; Zhang-Hua Yang; Dazheng Wang; Muzhen Qiao; Yuan Fang; Wanyun Li; Rui Wu; Peng He; Yu Cong; Chang'an Chen; Lichen Hu; Yihua Yan; Changchuan Xie; Yaying Wu; Jiahuai Han; Chuan-Qi Zhong
Journal:  Mol Cell Proteomics       Date:  2019-03-08       Impact factor: 5.911

Review 2.  Inference and validation of protein identifications.

Authors:  Manfred Claassen
Journal:  Mol Cell Proteomics       Date:  2012-08-03       Impact factor: 5.911

3.  A non-parametric cutout index for robust evaluation of identified proteins.

Authors:  Oliver Serang; Joao Paulo; Hanno Steen; Judith A Steen
Journal:  Mol Cell Proteomics       Date:  2013-01-04       Impact factor: 5.911

4.  Practical and Efficient Searching in Proteomics: A Cross Engine Comparison.

Authors:  Joao A Paulo
Journal:  Webmedcentral       Date:  2013-10-01

5.  ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects.

Authors:  Yaoyang Zhang; Tao Xu; Bing Shan; Jonathan Hart; Aaron Aslanian; Xuemei Han; Nobel Zong; Haomin Li; Howard Choi; Dong Wang; Lipi Acharya; Lisa Du; Peter K Vogt; Peipei Ping; John R Yates
Journal:  J Proteomics       Date:  2015-07-18       Impact factor: 4.044

Review 6.  Proteomics and the analysis of proteomic data: 2013 overview of current protein-profiling technologies.

Authors:  Can Bruce; Kathryn Stone; Erol Gulcicek; Kenneth Williams
Journal:  Curr Protoc Bioinformatics       Date:  2013-03

Review 7.  Proteomics for systems toxicology.

Authors:  Bjoern Titz; Ashraf Elamin; Florian Martin; Thomas Schneider; Sophie Dijon; Nikolai V Ivanov; Julia Hoeng; Manuel C Peitsch
Journal:  Comput Struct Biotechnol J       Date:  2014-08-27       Impact factor: 7.271

8.  Evaluating the impact of different sequence databases on metaproteome analysis: insights from a lab-assembled microbial mixture.

Authors:  Alessandro Tanca; Antonio Palomba; Massimo Deligios; Tiziana Cubeddu; Cristina Fraumene; Grazia Biosa; Daniela Pagnozzi; Maria Filippa Addis; Sergio Uzzau
Journal:  PLoS One       Date:  2013-12-09       Impact factor: 3.240

9.  Quality Control of Biomedicinal Allergen Products - Highly Complex Isoallergen Composition Challenges Standard MS Database Search and Requires Manual Data Analyses.

Authors:  Jelena Spiric; Anna M Engin; Michael Karas; Andreas Reuter
Journal:  PLoS One       Date:  2015-11-11       Impact factor: 3.240

10.  Proteoform-Specific Insights into Cellular Proteome Regulation.

Authors:  Emma L Norris; Madeleine J Headlam; Keyur A Dave; David D Smith; Alexander Bukreyev; Toshna Singh; Buddhika A Jayakody; Keith J Chappell; Peter L Collins; Jeffrey J Gorman
Journal:  Mol Cell Proteomics       Date:  2016-07-22       Impact factor: 5.911

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.