Literature DB >> 30546175

Controlling the FDR in imperfect matches to an incomplete database.

Uri Keich1, William Stafford Noble2.   

Abstract

We consider the problem of controlling the FDR among discoveries from searching an incomplete database. This problem differs from the classical multiple testing setting because there are two different types of false discoveries: those arising from objects that have no match in the database and those that are incorrectly matched. We show that commonly used FDR controlling procedures are inadequate for this setup, a special case of which is tandem mass spectrum identification. We then derive a novel FDR controlling approach which extensive simulations suggest is unbiased. We also compare its performance with problem-specific as well as general FDR controlling procedures using both simulated and real mass spectrometry data.

Entities:  

Keywords:  false discovery rate; multiple hypothesis testing; tandem mass spectrometry

Year:  2018        PMID: 30546175      PMCID: PMC6287756          DOI: 10.1080/01621459.2017.1375931

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  12 in total

1.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

2.  The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Authors:  Sangtae Kim; Nikolai Mischerikow; Nuno Bandeira; J Daniel Navarro; Louis Wich; Shabaz Mohammed; Albert J R Heck; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-09       Impact factor: 5.911

3.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.

Authors:  Joshua E Elias; Steven P Gygi
Journal:  Nat Methods       Date:  2007-03       Impact factor: 28.547

4.  Target-decoy search strategy for mass spectrometry-based proteomics.

Authors:  Joshua E Elias; Steven P Gygi
Journal:  Methods Mol Biol       Date:  2010

5.  Computing exact p-values for a cross-correlation shotgun proteomics score function.

Authors:  J Jeffry Howbert; William Stafford Noble
Journal:  Mol Cell Proteomics       Date:  2014-06-02       Impact factor: 5.911

Review 6.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

7.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Authors:  Sangtae Kim; Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2008-07-03       Impact factor: 4.466

8.  False discovery rates in spectral identification.

Authors:  Kyowon Jeong; Sangtae Kim; Nuno Bandeira
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

9.  Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

Authors:  Uri Keich; Attila Kertesz-Farkas; William Stafford Noble
Journal:  J Proteome Res       Date:  2015-07-27       Impact factor: 4.466

10.  On the importance of well-calibrated scores for identifying shotgun proteomics spectra.

Authors:  Uri Keich; William Stafford Noble
Journal:  J Proteome Res       Date:  2014-12-17       Impact factor: 4.466

View more
  1 in total

1.  Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate.

Authors:  Uri Keich; Kaipo Tamura; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-01-03       Impact factor: 4.466

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.