Literature DB >> 32009418

Machine Learning Strategy That Leverages Large Data sets to Boost Statistical Power in Small-Scale Experiments.

William E Fondrie1, William S Noble1,2.   

Abstract

Machine learning methods have proven invaluable for increasing the sensitivity of peptide detection in proteomics experiments. Most modern tools, such as Percolator and PeptideProphet, use semisupervised algorithms to learn models directly from the data sets that they analyze. Although these methods are effective for many proteomics experiments, we suspected that they may be suboptimal for experiments of smaller scale. In this work, we found that the power and consistency of Percolator results were reduced as the size of the experiment was decreased. As an alternative, we propose a different operating mode for Percolator: learn a model with Percolator from a large data set and use the learned model to evaluate the small-scale experiment. We call this a "static modeling" approach, in contrast to Percolator's usual "dynamic model" that is trained anew for each data set. We applied this static modeling approach to two settings: small, gel-based experiments and single-cell proteomics. In both cases, static models increased the yield of detected peptides and eliminated the model-induced variability of the standard dynamic approach. These results suggest that static models are a powerful tool for bringing the full benefits of Percolator and other semisupervised algorithms to small-scale experiments.

Entities:  

Keywords:  SVM; bioinformatics; confidence estimation; machine learning; peptide identification; percolator; proteomics; single-cell mass spectrometry; support vector machine; tandem mass spectrometry

Mesh:

Year:  2020        PMID: 32009418     DOI: 10.1021/acs.jproteome.9b00780

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  5 in total

1.  Optimization of skeletal protein preparation for LC-MS/MS sequencing yields additional coral skeletal proteins in Stylophora pistillata.

Authors:  Yanai Peled; Jeana L Drake; Assaf Malik; Ricardo Almuly; Maya Lalzar; David Morgenstern; Tali Mass
Journal:  BMC Mater       Date:  2020-07-16

2.  Driving Single Cell Proteomics Forward with Innovation.

Authors:  Nikolai Slavov
Journal:  J Proteome Res       Date:  2021-10-01       Impact factor: 4.466

3.  mokapot: Fast and Flexible Semisupervised Learning for Peptide Detection.

Authors:  William E Fondrie; William S Noble
Journal:  J Proteome Res       Date:  2021-02-17       Impact factor: 5.370

4.  Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics.

Authors:  Mathias Wilhelm; Daniel P Zolg; Michael Graber; Siegfried Gessulat; Tobias Schmidt; Karsten Schnatbaum; Celina Schwencke-Westphal; Philipp Seifert; Niklas de Andrade Krätzig; Johannes Zerweck; Tobias Knaute; Eva Bräunlein; Patroklos Samaras; Ludwig Lautenbacher; Susan Klaeger; Holger Wenschuh; Roland Rad; Bernard Delanghe; Andreas Huhmer; Steven A Carr; Karl R Clauser; Angela M Krackhardt; Ulf Reimer; Bernhard Kuster
Journal:  Nat Commun       Date:  2021-06-07       Impact factor: 14.919

Review 5.  Proteome Discoverer-A Community Enhanced Data Processing Suite for Protein Informatics.

Authors:  Benjamin C Orsburn
Journal:  Proteomes       Date:  2021-03-23
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.