Literature DB >> 26257019

Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less.

Oliver Serang1,2, Lukas Käll3.   

Abstract

In any high-throughput scientific study, it is often essential to estimate the percent of findings that are actually incorrect. This percentage is called the false discovery rate (abbreviated "FDR"), and it is an invariant (albeit, often unknown) quantity for any well-formed study. In proteomics, it has become common practice to incorrectly conflate the protein FDR (the percent of identified proteins that are actually absent) with protein-level target-decoy, a particular method for estimating the protein-level FDR. In this manner, the challenges of one approach have been used as the basis for an argument that the field should abstain from protein-level FDR analysis altogether or even the suggestion that the very notion of a protein FDR is flawed. As we demonstrate in simple but accurate simulations, not only is the protein-level FDR an invariant concept, when analyzing large data sets, the failure to properly acknowledge it or to correct for multiple testing can result in large, unrecognized errors, whereby thousands of absent proteins (and, potentially every protein in the FASTA database being considered) can be incorrectly identified.

Keywords:  false discovery rate (FDR); human proteome; multiple testing; protein identification; simulation; statistics

Mesh:

Substances:

Year:  2015        PMID: 26257019     DOI: 10.1021/acs.jproteome.5b00568

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  15 in total

1.  DeMix-Q: Quantification-Centered Data Processing Workflow.

Authors:  Bo Zhang; Lukas Käll; Roman A Zubarev
Journal:  Mol Cell Proteomics       Date:  2016-01-04       Impact factor: 5.911

2.  Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics.

Authors:  Richard D LeDuc; Ryan T Fellers; Bryan P Early; Joseph B Greer; Daniel P Shams; Paul M Thomas; Neil L Kelleher
Journal:  Mol Cell Proteomics       Date:  2019-01-15       Impact factor: 5.911

3.  DecoyPyrat: Fast Non-redundant Hybrid Decoy Sequence Generation for Large Scale Proteomics.

Authors:  James C Wright; Jyoti S Choudhary
Journal:  J Proteomics Bioinform       Date:  2016-06-27

4.  Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides.

Authors:  Boris Bogdanow; Henrik Zauber; Matthias Selbach
Journal:  Mol Cell Proteomics       Date:  2016-05-23       Impact factor: 5.911

5.  Testing and Validation of Computational Methods for Mass Spectrometry.

Authors:  Laurent Gatto; Kasper D Hansen; Michael R Hoopmann; Henning Hermjakob; Oliver Kohlbacher; Andreas Beyer
Journal:  J Proteome Res       Date:  2015-11-17       Impact factor: 4.466

6.  A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Authors:  Matthew The; Fredrik Edfors; Yasset Perez-Riverol; Samuel H Payne; Michael R Hoopmann; Magnus Palmblad; Björn Forsström; Lukas Käll
Journal:  J Proteome Res       Date:  2018-04-16       Impact factor: 4.466

7.  SWATH2stats: An R/Bioconductor Package to Process and Convert Quantitative SWATH-MS Proteomics Data for Downstream Analysis Tools.

Authors:  Peter Blattmann; Moritz Heusel; Ruedi Aebersold
Journal:  PLoS One       Date:  2016-04-07       Impact factor: 3.240

8.  How to talk about protein-level false discovery rates in shotgun proteomics.

Authors:  Matthew The; Ayesha Tasnim; Lukas Käll
Journal:  Proteomics       Date:  2016-09       Impact factor: 3.984

9.  The potential clinical impact of the release of two drafts of the human proteome.

Authors:  Iakes Ezkurdia; Enrique Calvo; Angela Del Pozo; Jesús Vázquez; Alfonso Valencia; Michael L Tress
Journal:  Expert Rev Proteomics       Date:  2015-10-23       Impact factor: 3.940

10.  Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

Authors:  Matthew The; Michael J MacCoss; William S Noble; Lukas Käll
Journal:  J Am Soc Mass Spectrom       Date:  2016-08-29       Impact factor: 3.109

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.