Literature DB >> 25322837

An efficient algorithm for the blocked pattern matching problem.

Fei Deng1, Lusheng Wang1, Xiaowen Liu2.   

Abstract

MOTIVATION: Tandem mass spectrometry (MS) has become the method of choice for protein identification and quantification. In the era of big data biology, tandem mass spectra are often searched against huge protein databases generated from genomes or RNA-Seq data for peptide identification. However, most existing tools for MS-based peptide identification compare a tandem mass spectrum against all peptides in a database whose molecular masses are similar to the precursor mass of the spectrum, making mass spectral data analysis slow for huge databases. Tag-based methods extract peptide sequence tags from a tandem mass spectrum and use them as a filter to reduce the number of candidate peptides, thus speeding up the database search. Recently, gapped tags have been introduced into mass spectral data analysis because they improve the sensitivity of peptide identification compared with sequence tags. However, the blocked pattern matching (BPM) problem, which is an essential step in gapped tag-based peptide identification, has not been fully solved.
RESULTS: In this article, we propose a fast and memory-efficient algorithm for the BPM problem. Experiments on both simulated and real datasets showed that the proposed algorithm achieved high speed and high sensitivity for peptide filtration in peptide identification by database search. CONTACT: cswangl@cityu.edu.hk or xwliu@iupui.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2014        PMID: 25322837     DOI: 10.1093/bioinformatics/btu678

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Systematic Evaluation of Protein Sequence Filtering Algorithms for Proteoform Identification Using Top-Down Mass Spectrometry.

Authors:  Qiang Kou; Si Wu; Xiaowen Liu
Journal:  Proteomics       Date:  2018-02-06       Impact factor: 3.984

2.  A Spectrum Graph-Based Protein Sequence Filtering Algorithm for Proteoform Identification by Top-Down Mass Spectrometry.

Authors:  Runmin Yang; Daming Zhu; Qiang Kou; Poomima Bhat-Nakshatri; Harikrishna Nakshatri; Si Wu; Xiaowen Liu
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

3.  A graph-based filtering method for top-down mass spectral identification.

Authors:  Runmin Yang; Daming Zhu
Journal:  BMC Genomics       Date:  2018-09-24       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.