Literature DB >> 33381824

New mixture models for decoy-free false discovery rate estimation in mass spectrometry proteomics.

Yisu Peng1, Shantanu Jain1, Yong Fuga Li2, Michal Greguš3,4, Alexander R Ivanov3,4, Olga Vitek1,4, Predrag Radivojac1,3,4.   

Abstract

MOTIVATION: Accurate estimation of false discovery rate (FDR) of spectral identification is a central problem in mass spectrometry-based proteomics. Over the past two decades, target-decoy approaches (TDAs) and decoy-free approaches (DFAs) have been widely used to estimate FDR. TDAs use a database of decoy species to faithfully model score distributions of incorrect peptide-spectrum matches (PSMs). DFAs, on the other hand, fit two-component mixture models to learn the parameters of correct and incorrect PSM score distributions. While conceptually straightforward, both approaches lead to problems in practice, particularly in experiments that push instrumentation to the limit and generate low fragmentation-efficiency and low signal-to-noise-ratio spectra.
RESULTS: We introduce a new decoy-free framework for FDR estimation that generalizes present DFAs while exploiting more search data in a manner similar to TDAs. Our approach relies on multi-component mixtures, in which score distributions corresponding to the correct PSMs, best incorrect PSMs and second-best incorrect PSMs are modeled by the skew normal family. We derive EM algorithms to estimate parameters of these distributions from the scores of best and second-best PSMs associated with each experimental spectrum. We evaluate our models on multiple proteomics datasets and a HeLa cell digest case study consisting of more than a million spectra in total. We provide evidence of improved performance over existing DFAs and improved stability and speed over TDAs without any performance degradation. We propose that the new strategy has the potential to extend beyond peptide identification and reduce the need for TDA on all analytical platforms. AVAILABILITYAND IMPLEMENTATION: https://github.com/shawn-peng/FDR-estimation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 33381824      PMCID: PMC7773488          DOI: 10.1093/bioinformatics/btaa807

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  47 in total

1.  Target-decoy approach and false discovery rate: when things may go wrong.

Authors:  Nitin Gupta; Nuno Bandeira; Uri Keich; Pavel A Pevzner
Journal:  J Am Soc Mass Spectrom       Date:  2011-05-05       Impact factor: 3.109

2.  PepNovo: de novo peptide sequencing via probabilistic network modeling.

Authors:  Ari Frank; Pavel Pevzner
Journal:  Anal Chem       Date:  2005-02-15       Impact factor: 6.986

Review 3.  Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.

Authors:  Lukas Käll; John D Storey; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2007-12-08       Impact factor: 4.466

Review 4.  Analysis of protein complexes using mass spectrometry.

Authors:  Anne-Claude Gingras; Matthias Gstaiger; Brian Raught; Ruedi Aebersold
Journal:  Nat Rev Mol Cell Biol       Date:  2007-08       Impact factor: 94.444

5.  The problem with peptide presumption and low Mascot scoring.

Authors:  Bret Cooper
Journal:  J Proteome Res       Date:  2011-01-26       Impact factor: 4.466

6.  Bias in False Discovery Rate Estimation in Mass-Spectrometry-Based Peptide Identification.

Authors:  Yulia Danilova; Anastasia Voronkova; Pavel Sulimov; Attila Kertész-Farkas
Journal:  J Proteome Res       Date:  2019-04-18       Impact factor: 4.466

Review 7.  Gentle Introduction to the Statistical Foundations of False Discovery Rate in Quantitative Proteomics.

Authors:  Thomas Burger
Journal:  J Proteome Res       Date:  2017-11-14       Impact factor: 4.466

8.  Nanoliter-Scale Oil-Air-Droplet Chip-Based Single Cell Proteomic Analysis.

Authors:  Zi-Yi Li; Min Huang; Xiu-Kun Wang; Ying Zhu; Jin-Song Li; Catherine C L Wong; Qun Fang
Journal:  Anal Chem       Date:  2018-03-27       Impact factor: 6.986

9.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database.

Authors:  J R Yates; J K Eng; A L McCormack; D Schieltz
Journal:  Anal Chem       Date:  1995-04-15       Impact factor: 6.986

10.  SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation.

Authors:  Bogdan Budnik; Ezra Levy; Guillaume Harmange; Nikolai Slavov
Journal:  Genome Biol       Date:  2018-10-22       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.