Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Integrated Identification and Quantification Error Probabilities for Shotgun Proteomics.

Literature DB >> 30482846

Integrated Identification and Quantification Error Probabilities for Shotgun Proteomics.

Abstract

Protein quantification by label-free shotgun proteomics experiments is plagued by a multitude of error sources. Typical pipelines for identifying differential proteins use intermediate filters to control the error rate. However, they often ignore certain error sources and, moreover, regard filtered lists as completely correct in subsequent steps. These two indiscretions can easily lead to a loss of control of the false discovery rate (FDR). We propose a probabilistic graphical model, Triqler, that propagates error information through all steps, employing distributions in favor of point estimates, most notably for missing value imputation. The model outputs posterior probabilities for fold changes between treatment groups, highlighting uncertainty rather than hiding it. We analyzed 3 engineered data sets and achieved FDR control and high sensitivity, even for truly absent proteins. In a bladder cancer clinical data set we discovered 35 proteins at 5% FDR, whereas the original study discovered 1 and MaxQuant/Perseus 4 proteins at this threshold. Compellingly, these 35 proteins showed enrichment for functional annotation terms, whereas the top ranked proteins reported by MaxQuant/Perseus showed no enrichment. The model executes in minutes and is freely available at https://pypi.org/project/triqler/.

Entities: Disease

Keywords: Bioinformatics; Bioinformatics software; Biostatistics; Bladder cancer; Label-free quantification; Quantification

Mesh：

Year: 2018 PMID： 30482846 PMCID： PMC6398204 DOI： 10.1074/mcp.RA118.001018

Source DB: PubMed Journal: Mol Cell Proteomics ISSN： 1535-9476 Impact factor: 5.911

40 in total

1. Nonparametric Bayesian evaluation of differential protein quantification.

Authors: Oliver Serang; A Ertugrul Cansizoglu; Lukas Käll; Hanno Steen; Judith A Steen
Journal: J Proteome Res Date: 2013-09-11 Impact factor: 4.466

2. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors: Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal: Nat Protoc Date: 2009 Impact factor: 13.491

3. A statistical framework for protein quantitation in bottom-up MS-based proteomics.

Authors: Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney
Journal: Bioinformatics Date: 2009-06-17 Impact factor: 6.937

4. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

Authors: Mikhail M Savitski; Mathias Wilhelm; Hannes Hahne; Bernhard Kuster; Marcus Bantscheff
Journal: Mol Cell Proteomics Date: 2015-05-17 Impact factor: 5.911

5. moFF: a robust and automated approach to extract peptide ion intensities.

Authors: Andrea Argentini; Ludger J E Goeminne; Kenneth Verheggen; Niels Hulstaert; An Staes; Lieven Clement; Lennart Martens
Journal: Nat Methods Date: 2016-11-29 Impact factor: 28.547

6. Multiple testing corrections in quantitative proteomics: A useful but blunt tool.

Authors: Dana Pascovici; David C L Handler; Jemma X Wu; Paul A Haynes
Journal: Proteomics Date: 2016-09 Impact factor: 3.984

7. Faster SEQUEST searching for peptide identification from tandem mass spectra.

Authors: Benjamin J Diament; William Stafford Noble
Journal: J Proteome Res Date: 2011-07-29 Impact factor: 4.466

8. A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors: Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal: J Proteome Res Date: 2018-04-16 Impact factor: 4.466

9. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses.

Authors: George Rosenberger; Isabell Bludau; Uwe Schmitt; Moritz Heusel; Christie L Hunter; Yansheng Liu; Michael J MacCoss; Brendan X MacLean; Alexey I Nesvizhskii; Patrick G A Pedrioli; Lukas Reiter; Hannes L Röst; Stephen Tate; Ying S Ting; Ben C Collins; Ruedi Aebersold
Journal: Nat Methods Date: 2017-08-21 Impact factor: 28.547

10. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

Authors: Matthew The; Michael J MacCoss; William S Noble; Lukas Käll
Journal: J Am Soc Mass Spectrom Date: 2016-08-29 Impact factor: 3.109

8 in total

1. Bayesian Confidence Intervals for Multiplexed Proteomics Integrate Ion-statistics with Peptide Quantification Concordance.

Authors: Leonid Peshkin; Meera Gupta; Lillia Ryazanova; Martin Wühr
Journal: Mol Cell Proteomics Date: 2019-07-16 Impact factor: 5.911

2. A Bayesian Null Interval Hypothesis Test Controls False Discovery Rates and Improves Sensitivity in Label-Free Quantitative Proteomics.

Authors: Robert J Millikin; Michael R Shortreed; Mark Scalf; Lloyd M Smith
Journal: J Proteome Res Date: 2020-04-14 Impact factor: 4.466

Integrated Identification and Quantification Error Probabilities for Shotgun Proteomics.

1. Nonparametric Bayesian evaluation of differential protein quantification.

2. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

3. A statistical framework for protein quantitation in bottom-up MS-based proteomics.

4. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

5. moFF: a robust and automated approach to extract peptide ion intensities.

6. Multiple testing corrections in quantitative proteomics: A useful but blunt tool.

7. Faster SEQUEST searching for peptide identification from tandem mass spectra.

8. A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

9. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses.

10. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

1. Bayesian Confidence Intervals for Multiplexed Proteomics Integrate Ion-statistics with Peptide Quantification Concordance.

2. A Bayesian Null Interval Hypothesis Test Controls False Discovery Rates and Improves Sensitivity in Label-Free Quantitative Proteomics.

3. Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics?

Review 4. Challenges and Opportunities for Bayesian Statistics in Proteomics.

5. Focus on the spectra that matter by clustering of quantification data in shotgun proteomics.

6. Triqler for MaxQuant: Enhancing Results from MaxQuant by Bayesian Error Propagation and Integration.

7. mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection.

8. Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics.