Literature DB >> 31523183

SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis.

Chuang Li1, Kenli Li2, Keqin Li3, Xianghui Xie4, Feng Lin5.   

Abstract

Tandem mass spectrometry (MS/MS)-based de novo peptide sequencing is a powerful method for high-throughput protein analysis. However, the explosively increasing size of MS/MS spectra dataset inevitably and exponentially raises the computational demand of existing de novo peptide sequencing methods, which is an issue urgently to be solved in computational biology. This paper introduces an efficient tool based on SW26010 many-core processor, namely SWPepNovo, to process the large-scale peptide MS/MS spectra using a parallel peptide spectrum matches (PSMs) algorithm. Our design employs a two-level parallelization mechanism: (1) the task-level parallelism between MPEs using MPI based on a data transformation method and a dynamic feedback task scheduling algorithm, (2) the thread-level parallelism across CPEs using asynchronous task transfer and multithreading. Moreover, three optimization strategies, including vectorization, double buffering and memory access optimizations, have been employed to overcome both the compute-bound and the memory-bound bottlenecks in the parallel PSMs algorithm. The results of experiments conducted on multiple spectra datasets demonstrate the performance of SWPepNovo against three state-of-the-art tools for peptide sequencing, including PepNovo+, PEAKS and DeepNovo-DIA. The SWPepNovo also shows high scalability in experiments on extremely large datasets sized up to 11.22 GB. The software and the parameter settings are available at https://github.com/ChuangLi99/SWPepNovo.

Entities:  

Keywords:  Large-scale MS/MS spectra analysis; SW26010; de novo peptide sequencing; high performance computing

Year:  2019        PMID: 31523183      PMCID: PMC6743289          DOI: 10.7150/ijbs.32142

Source DB:  PubMed          Journal:  Int J Biol Sci        ISSN: 1449-2288            Impact factor:   6.580


  23 in total

1.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

Review 2.  Algorithms for the de novo sequencing of peptides from tandem mass spectra.

Authors:  Jens Allmer
Journal:  Expert Rev Proteomics       Date:  2011-10       Impact factor: 3.940

3.  pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry.

Authors:  Dequan Li; Yan Fu; Ruixiang Sun; Charles X Ling; Yonggang Wei; Hu Zhou; Rong Zeng; Qiang Yang; Simin He; Wen Gao
Journal:  Bioinformatics       Date:  2005-04-07       Impact factor: 6.937

4.  Discovering known and unanticipated protein modifications using MS/MS database searching.

Authors:  Wilfred H Tang; Benjamin R Halpern; Ignat V Shilov; Sean L Seymour; Sean P Keating; Alex Loboda; Alpesh A Patel; Daniel A Schaeffer; Lydia M Nuwaysir
Journal:  Anal Chem       Date:  2005-07-01       Impact factor: 6.986

5.  Highly accelerated feature detection in proteomics data sets using modern graphics processing units.

Authors:  Rene Hussong; Barbara Gregorius; Andreas Tholey; Andreas Hildebrandt
Journal:  Bioinformatics       Date:  2009-05-14       Impact factor: 6.937

6.  pNovo: de novo peptide sequencing and identification using HCD spectra.

Authors:  Hao Chi; Rui-Xiang Sun; Bing Yang; Chun-Qing Song; Le-Heng Wang; Chao Liu; Yan Fu; Zuo-Fei Yuan; Hai-Peng Wang; Si-Min He; Meng-Qiu Dong
Journal:  J Proteome Res       Date:  2010-05-07       Impact factor: 4.466

7.  Speeding up tandem mass spectrometry based database searching by peptide and spectrum indexing.

Authors:  You Li; Hao Chi; Le-Heng Wang; Hai-Peng Wang; Yan Fu; Zuo-Fei Yuan; Su-Jun Li; Yan-Sheng Liu; Rui-Xiang Sun; Rong Zeng; Si-Min He
Journal:  Rapid Commun Mass Spectrom       Date:  2010-03       Impact factor: 2.419

8.  Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware.

Authors:  Xiangyuan Zhu; Kenli Li; Ahmad Salah; Lin Shi; Keqin Li
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2015 Jan-Feb       Impact factor: 3.710

9.  A ranking-based scoring function for peptide-spectrum matches.

Authors:  Ari M Frank
Journal:  J Proteome Res       Date:  2009-05       Impact factor: 4.466

10.  Special issue on Computational Resources and Methods in Biological Sciences.

Authors:  Hao Lin; Shaoliang Peng; Jian Huang
Journal:  Int J Biol Sci       Date:  2018-07-01       Impact factor: 6.580

View more
  2 in total

1.  Communication Lower-Bounds for Distributed-Memory Computations for Mass Spectrometry based Omics Data.

Authors:  Fahad Saeed; Muhammad Haseeb; S S Iyengar
Journal:  J Parallel Distrib Comput       Date:  2021-11-17       Impact factor: 3.734

Review 2.  Bioinformatics Methods for Mass Spectrometry-Based Proteomics Data Analysis.

Authors:  Chen Chen; Jie Hou; John J Tanner; Jianlin Cheng
Journal:  Int J Mol Sci       Date:  2020-04-20       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.