Literature DB >> 21504573

Swiftly computing center strings.

Franziska Hufsky1, Léon Kuchenbecker, Katharina Jahn, Jens Stoye, Sebastian Böcker.   

Abstract

BACKGROUND: The center string (or closest string) problem is a classic computer science problem with important applications in computational biology. Given k input strings and a distance threshold d, we search for a string within Hamming distance at most d to each input string. This problem is NP complete.
RESULTS: In this paper, we focus on exact methods for the problem that are also swift in application. We first introduce data reduction techniques that allow us to infer that certain instances have no solution, or that a center string must satisfy certain conditions. We describe how to use this information to speed up two previously published search tree algorithms. Then, we describe a novel iterative search strategy that is efficient in practice, where some of our reduction techniques can also be applied. Finally, we present results of an evaluation study for two different data sets from a biological application.
CONCLUSIONS: We find that the running time for computing the optimal center string is dominated by the subroutine calls for d = dopt -1 and d = dopt. Our data reduction is very effective for both, either rejecting unsolvable instances or solving trivial positions. We find that this speeds up computations considerably.

Entities:  

Mesh:

Year:  2011        PMID: 21504573      PMCID: PMC3108310          DOI: 10.1186/1471-2105-12-106

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  4 in total

1.  Fast and practical algorithms for planted (l, d) motif search.

Authors:  Jaime Davila; Sudha Balla; Sanguthevar Rajasekaran
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2007 Oct-Dec       Impact factor: 3.710

2.  Computation of median gene clusters.

Authors:  Sebastian Böcker; Katharina Jahn; Julia Mixtacki; Jens Stoye
Journal:  J Comput Biol       Date:  2009-08       Impact factor: 1.479

3.  Degenerated primer design to amplify the heavy chain variable region from immunoglobulin cDNA.

Authors:  Ying Wang; Wei Chen; Xu Li; Bing Cheng
Journal:  BMC Bioinformatics       Date:  2006-12-12       Impact factor: 3.169

4.  The society of genes: networks of functional links between genes from comparative genomics.

Authors:  Itai Yanai; Charles DeLisi
Journal:  Genome Biol       Date:  2002-10-25       Impact factor: 13.583

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.