Literature DB >> 26357225

An Efficient Exact Algorithm for the Motif Stem Search Problem over Large Alphabets.

Qiang Yu, Hongwei Huo, Jeffrey Scott Vitter, Jun Huan, Yakov Nekrich.   

Abstract

In recent years, there has been an increasing interest in planted (l, d) motif search (PMS) with applications to discovering significant segments in biological sequences. However, there has been little discussion about PMS over large alphabets. This paper focuses on motif stem search (MSS), which is recently introduced to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l , d) motifs present in the input sequences, and the superset is expected to be as small as possible. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a method for generating all possible motif stems without redundant wildcards. (3) We propose an efficient exact algorithm, called StemFinder, for solving the MSS problem. Compared with the previous MSS algorithms, StemFinder runs much faster and reports fewer stems which represent a smaller superset of all (l, d) motifs. StemFinder is freely available at http://sites.google.com/site/feqond/stemfinder.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26357225     DOI: 10.1109/TCBB.2014.2361668

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data.

Authors:  Hongbo Zhang; Lin Zhu; De-Shuang Huang
Journal:  Sci Rep       Date:  2017-06-12       Impact factor: 4.379

2.  RefSelect: a reference sequence selection algorithm for planted (l, d) motif search.

Authors:  Qiang Yu; Hongwei Huo; Ruixing Zhao; Dazheng Feng; Jeffrey Scott Vitter; Jun Huan
Journal:  BMC Bioinformatics       Date:  2016-07-19       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.