Literature DB >> 34236864

Accurately Assigning Peptides to Spectra When Only a Subset of Peptides Are Relevant.

Andy Lin1, Deanna L Plubell1, Uri Keich2, William S Noble1,3.   

Abstract

The standard proteomics database search strategy involves searching spectra against a peptide database and estimating the false discovery rate (FDR) of the resulting set of peptide-spectrum matches. One assumption of this protocol is that all the peptides in the database are relevant to the hypothesis being investigated. However, in settings where researchers are interested in a subset of peptides, alternative search and FDR control strategies are needed. Recently, two methods were proposed to address this problem: subset-search and all-sub. We show that both methods fail to control the FDR. For subset-search, this failure is due to the presence of "neighbor" peptides, which are defined as irrelevant peptides with a similar precursor mass and fragmentation spectrum as a relevant peptide. Not considering neighbors compromises the FDR estimate because a spectrum generated by an irrelevant peptide can incorrectly match well to a relevant peptide. Therefore, we have developed a new method, "subset-neighbor search" (SNS), that accounts for neighbor peptides. We show evidence that SNS controls the FDR when neighbors are present and that SNS outperforms group-FDR, the only other method that appears to control the FDR relative to a subset of relevant peptides.

Entities:  

Keywords:  FDR control; database search; false discovery rate; protein database

Mesh:

Substances:

Year:  2021        PMID: 34236864      PMCID: PMC8489664          DOI: 10.1021/acs.jproteome.1c00483

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   5.370


  30 in total

1.  MS1, MS2, and SQT-three unified, compact, and easily parsed file formats for the storage of shotgun proteomic spectra and identifications.

Authors:  W Hayes McDonald; David L Tabb; Rovshan G Sadygov; Michael J MacCoss; John Venable; Johannes Graumann; Jeff R Johnson; Daniel Cociorva; John R Yates
Journal:  Rapid Commun Mass Spectrom       Date:  2004       Impact factor: 2.419

2.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.

Authors:  Joshua E Elias; Steven P Gygi
Journal:  Nat Methods       Date:  2007-03       Impact factor: 28.547

3.  Transferred subgroup false discovery rate for rare post-translational modifications detected by mass spectrometry.

Authors:  Yan Fu; Xiaohong Qian
Journal:  Mol Cell Proteomics       Date:  2013-11-07       Impact factor: 5.911

4.  Averaging Strategy To Reduce Variability in Target-Decoy Estimates of False Discovery Rate.

Authors:  Uri Keich; Kaipo Tamura; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-01-03       Impact factor: 4.466

5.  Quality assessments of peptide-spectrum matches in shotgun proteomics.

Authors:  Viktor Granholm; Lukas Käll
Journal:  Proteomics       Date:  2011-02-07       Impact factor: 3.984

6.  Mass spectrometrists should search for all peptides, but assess only the ones they care about.

Authors:  Adriaan Sticker; Lennart Martens; Lieven Clement
Journal:  Nat Methods       Date:  2017-06-29       Impact factor: 28.547

7.  Response to "Mass spectrometrists should search for all peptides, but assess only the ones they care about".

Authors:  William Stafford Noble; Uri Keich
Journal:  Nat Methods       Date:  2017-06-29       Impact factor: 28.547

8.  Evaluation of FASP, SP3, and iST Protocols for Proteomic Sample Preparation in the Low Microgram Range.

Authors:  Malte Sielaff; Jörg Kuharev; Toszka Bohn; Jennifer Hahlbrock; Tobias Bopp; Stefan Tenzer; Ute Distler
Journal:  J Proteome Res       Date:  2017-10-11       Impact factor: 4.466

9.  MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors:  Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2017-04-10       Impact factor: 28.547

10.  Transfer posterior error probability estimation for peptide identification.

Authors:  Xinpei Yi; Fuzhou Gong; Yan Fu
Journal:  BMC Bioinformatics       Date:  2020-05-04       Impact factor: 3.169

View more
  3 in total

1.  Building Spectral Libraries from Narrow-Window Data-Independent Acquisition Mass Spectrometry Data.

Authors:  Lilian R Heil; William E Fondrie; Christopher D McGann; Alexander J Federation; William S Noble; Michael J MacCoss; Uri Keich
Journal:  J Proteome Res       Date:  2022-05-12       Impact factor: 5.370

2.  An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics.

Authors:  Laura Fancello; Thomas Burger
Journal:  Genome Biol       Date:  2022-06-20       Impact factor: 17.906

Review 3.  Considerations for constructing a protein sequence database for metaproteomics.

Authors:  J Alfredo Blakeley-Ruiz; Manuel Kleiner
Journal:  Comput Struct Biotechnol J       Date:  2022-01-21       Impact factor: 7.271

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.