Literature DB >> 30482846

Integrated Identification and Quantification Error Probabilities for Shotgun Proteomics.

Matthew The1, Lukas Käll2.   

Abstract

Protein quantification by label-free shotgun proteomics experiments is plagued by a multitude of error sources. Typical pipelines for identifying differential proteins use intermediate filters to control the error rate. However, they often ignore certain error sources and, moreover, regard filtered lists as completely correct in subsequent steps. These two indiscretions can easily lead to a loss of control of the false discovery rate (FDR). We propose a probabilistic graphical model, Triqler, that propagates error information through all steps, employing distributions in favor of point estimates, most notably for missing value imputation. The model outputs posterior probabilities for fold changes between treatment groups, highlighting uncertainty rather than hiding it. We analyzed 3 engineered data sets and achieved FDR control and high sensitivity, even for truly absent proteins. In a bladder cancer clinical data set we discovered 35 proteins at 5% FDR, whereas the original study discovered 1 and MaxQuant/Perseus 4 proteins at this threshold. Compellingly, these 35 proteins showed enrichment for functional annotation terms, whereas the top ranked proteins reported by MaxQuant/Perseus showed no enrichment. The model executes in minutes and is freely available at https://pypi.org/project/triqler/.
© 2019 The and Käll.

Entities:  

Keywords:  Bioinformatics; Bioinformatics software; Biostatistics; Bladder cancer; Label-free quantification; Quantification

Mesh:

Year:  2018        PMID: 30482846      PMCID: PMC6398204          DOI: 10.1074/mcp.RA118.001018

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  40 in total

1.  Nonparametric Bayesian evaluation of differential protein quantification.

Authors:  Oliver Serang; A Ertugrul Cansizoglu; Lukas Käll; Hanno Steen; Judith A Steen
Journal:  J Proteome Res       Date:  2013-09-11       Impact factor: 4.466

2.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

3.  A statistical framework for protein quantitation in bottom-up MS-based proteomics.

Authors:  Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

4.  A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

Authors:  Mikhail M Savitski; Mathias Wilhelm; Hannes Hahne; Bernhard Kuster; Marcus Bantscheff
Journal:  Mol Cell Proteomics       Date:  2015-05-17       Impact factor: 5.911

5.  moFF: a robust and automated approach to extract peptide ion intensities.

Authors:  Andrea Argentini; Ludger J E Goeminne; Kenneth Verheggen; Niels Hulstaert; An Staes; Lieven Clement; Lennart Martens
Journal:  Nat Methods       Date:  2016-11-29       Impact factor: 28.547

6.  Multiple testing corrections in quantitative proteomics: A useful but blunt tool.

Authors:  Dana Pascovici; David C L Handler; Jemma X Wu; Paul A Haynes
Journal:  Proteomics       Date:  2016-09       Impact factor: 3.984

7.  Faster SEQUEST searching for peptide identification from tandem mass spectra.

Authors:  Benjamin J Diament; William Stafford Noble
Journal:  J Proteome Res       Date:  2011-07-29       Impact factor: 4.466

8.  A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors:  Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal:  J Proteome Res       Date:  2018-04-16       Impact factor: 4.466

9.  Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses.

Authors:  George Rosenberger; Isabell Bludau; Uwe Schmitt; Moritz Heusel; Christie L Hunter; Yansheng Liu; Michael J MacCoss; Brendan X MacLean; Alexey I Nesvizhskii; Patrick G A Pedrioli; Lukas Reiter; Hannes L Röst; Stephen Tate; Ying S Ting; Ben C Collins; Ruedi Aebersold
Journal:  Nat Methods       Date:  2017-08-21       Impact factor: 28.547

10.  Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

Authors:  Matthew The; Michael J MacCoss; William S Noble; Lukas Käll
Journal:  J Am Soc Mass Spectrom       Date:  2016-08-29       Impact factor: 3.109

View more
  8 in total

1.  Bayesian Confidence Intervals for Multiplexed Proteomics Integrate Ion-statistics with Peptide Quantification Concordance.

Authors:  Leonid Peshkin; Meera Gupta; Lillia Ryazanova; Martin Wühr
Journal:  Mol Cell Proteomics       Date:  2019-07-16       Impact factor: 5.911

2.  A Bayesian Null Interval Hypothesis Test Controls False Discovery Rates and Improves Sensitivity in Label-Free Quantitative Proteomics.

Authors:  Robert J Millikin; Michael R Shortreed; Mark Scalf; Lloyd M Smith
Journal:  J Proteome Res       Date:  2020-04-14       Impact factor: 4.466

3.  Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics?

Authors:  Deanna L Plubell; Lukas Käll; Bobbie-Jo Webb-Robertson; Lisa M Bramer; Ashley Ives; Neil L Kelleher; Lloyd M Smith; Thomas J Montine; Christine C Wu; Michael J MacCoss
Journal:  J Proteome Res       Date:  2022-02-27       Impact factor: 4.466

Review 4.  Challenges and Opportunities for Bayesian Statistics in Proteomics.

Authors:  Oliver M Crook; Chun-Wa Chung; Charlotte M Deane
Journal:  J Proteome Res       Date:  2022-03-08       Impact factor: 4.466

5.  Focus on the spectra that matter by clustering of quantification data in shotgun proteomics.

Authors:  Matthew The; Lukas Käll
Journal:  Nat Commun       Date:  2020-06-26       Impact factor: 14.919

6.  Triqler for MaxQuant: Enhancing Results from MaxQuant by Bayesian Error Propagation and Integration.

Authors:  Matthew The; Lukas Käll
Journal:  J Proteome Res       Date:  2021-03-04       Impact factor: 4.466

7.  mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection.

Authors:  William E Fondrie; William S Noble
Journal:  J Proteome Res       Date:  2021-02-17       Impact factor: 5.370

8.  Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics.

Authors:  Miranda L Gardner; Michael A Freitas
Journal:  Int J Mol Sci       Date:  2021-09-06       Impact factor: 5.923

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.