Literature DB >> 25400485

A Space-Bounded Anytime Algorithm for the Multiple Longest Common Subsequence Problem.

Jiaoyun Yang1, Yun Xu2, Yi Shang3, Guoliang Chen2.   

Abstract

The multiple longest common subsequence (MLCS) problem, related to the identification of sequence similarity, is an important problem in many fields. As an NP-hard problem, its exact algorithms have difficulty in handling large-scale data and time- and space-efficient algorithms are required in real-world applications. To deal with time constraints, anytime algorithms have been proposed to generate good solutions with a reasonable time. However, there exists little work on space-efficient MLCS algorithms. In this paper, we formulate the MLCS problem into a graph search problem and present two space-efficient anytime MLCS algorithms, SA-MLCS and SLA-MLCS. SA-MLCS uses an iterative beam widening search strategy to reduce space usage during the iterative process of finding better solutions. Based on SA-MLCS, SLA-MLCS, a space-bounded algorithm, is developed to avoid space usage from exceeding available memory. SLA-MLCS uses a replacing strategy when SA-MLCS reaches a given space bound. Experimental results show SA-MLCS and SLA-MLCS use an order of magnitude less space and time than the state-of-the-art approximate algorithm MLCS-APP while finding better solutions. Compared to the state-of-the-art anytime algorithm Pro-MLCS, SA-MLCS and SLA-MLCS can solve an order of magnitude larger size instances. Furthermore, SLA-MLCS can find much better solutions than SA-MLCS on large size instances.

Entities:  

Keywords:  Heuristic search; anytime algorithm; multiple longest common subsequence (MLCS); space bounded

Year:  2014        PMID: 25400485      PMCID: PMC4231498          DOI: 10.1109/TKDE.2014.2304464

Source DB:  PubMed          Journal:  IEEE Trans Knowl Data Eng        ISSN: 1041-4347            Impact factor:   6.977


  6 in total

1.  MAWA∗-a memory-bounded anytime heuristic-search algorithm.

Authors:  Satya Gautam Vadlamudi; Sandip Aine; Partha Pratim Chakrabarti
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2010-11-18

Review 2.  Next-generation sequencing transforms today's biology.

Authors:  Stephan C Schuster
Journal:  Nat Methods       Date:  2007-12-19       Impact factor: 28.547

3.  Matching sequences under deletion-insertion constraints.

Authors:  D Sankoff
Journal:  Proc Natl Acad Sci U S A       Date:  1972-01       Impact factor: 11.205

4.  Fingerprinting G-protein-coupled receptors.

Authors:  T K Attwood; J B Findlay
Journal:  Protein Eng       Date:  1994-02

5.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

6.  A fast parallel algorithm for finding the longest common sequence of multiple biosequences.

Authors:  Yixin Chen; Andrew Wan; Wei Liu
Journal:  BMC Bioinformatics       Date:  2006-12-12       Impact factor: 3.169

  6 in total
  3 in total

1.  A new algorithm for "the LCS problem" with application in compressing genome resequencing data.

Authors:  Richard Beal; Tazin Afrin; Aliya Farheen; Donald Adjeroh
Journal:  BMC Genomics       Date:  2016-08-18       Impact factor: 3.969

2.  A Novel Efficient Graph Model for the Multiple Longest Common Subsequences (MLCS) Problem.

Authors:  Zhan Peng; Yuping Wang
Journal:  Front Genet       Date:  2017-08-09       Impact factor: 4.599

3.  A fast and efficient path elimination algorithm for large-scale multiple common longest sequence problems.

Authors:  Changyong Yu; Pengxi Lin; Yuhai Zhao; Tianmei Ren; Guoren Wang
Journal:  BMC Bioinformatics       Date:  2022-09-07       Impact factor: 3.307

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.