Literature DB >> 29607643

A Matter of Time: Faster Percolator Analysis via Efficient SVM Learning for Large-Scale Proteomics.

John T Halloran1, David M Rocke2.   

Abstract

Percolator is an important tool for greatly improving the results of a database search and subsequent downstream analysis. Using support vector machines (SVMs), Percolator recalibrates peptide-spectrum matches based on the learned decision boundary between targets and decoys. To improve analysis time for large-scale data sets, we update Percolator's SVM learning engine through software and algorithmic optimizations rather than heuristic approaches that necessitate the careful study of their impact on learned parameters across different search settings and data sets. We show that by optimizing Percolator's original learning algorithm, l2-SVM-MFN, large-scale SVM learning requires nearly only a third of the original runtime. Furthermore, we show that by employing the widely used Trust Region Newton (TRON) algorithm instead of l2-SVM-MFN, large-scale Percolator SVM learning is reduced to nearly only a fifth of the original runtime. Importantly, these speedups only affect the speed at which Percolator converges to a global solution and do not alter recalibration performance. The upgraded versions of both l2-SVM-MFN and TRON are optimized within the Percolator codebase for multithreaded and single-thread use and are available under Apache license at bitbucket.org/jthalloran/percolator_upgrade .

Entities:  

Keywords:  TRON; machine learning; percolator; support vector machine; tandem mass spectrometry

Mesh:

Year:  2018        PMID: 29607643      PMCID: PMC6420878          DOI: 10.1021/acs.jproteome.7b00767

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  3 in total

1.  Speeding Up Percolator.

Authors:  John T Halloran; Hantian Zhang; Kaan Kara; Cédric Renggli; Matthew The; Ce Zhang; David M Rocke; Lukas Käll; William Stafford Noble
Journal:  J Proteome Res       Date:  2019-08-23       Impact factor: 4.466

2.  Diagnosis, clustering, and immune cell infiltration analysis of m6A-related genes in patients with acute myocardial infarction-a bioinformatics analysis.

Authors:  Changzai Liang; Shen Wang; Meng Zhang; Tianzhu Li
Journal:  J Thorac Dis       Date:  2022-05       Impact factor: 3.005

3.  A cost-sensitive online learning method for peptide identification.

Authors:  Xijun Liang; Zhonghang Xia; Ling Jian; Yongxiang Wang; Xinnan Niu; Andrew J Link
Journal:  BMC Genomics       Date:  2020-04-25       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.