Literature DB >> 16278946

Randomized algorithms for motif detection.

Lusheng Wang1, Liang Dong.   

Abstract

MOTIVATION: Motif detection for DNA sequences has many important applications in biological studies, e.g. locating binding sites regulatory signals, designing genetic probes etc. In this paper, we propose a randomized algorithm, design an improved EM algorithm and combine them to form a software tool.
RESULTS: (1) We design a randomized algorithm for consensus pattern problem. We can show that with high probability, our randomized algorithm finds a pattern in polynomial time with cost error at most x l for each string, where l is the length of the motif and can be any positive number given by the user. (2) We design an improved EM algorithm that outperforms the original EM algorithm. (3) We develop a software tool, MotifDetector, that uses our randomized algorithm to find good seeds and uses the improved EM algorithm to do local search. We compare MotifDetector with Buhler and Tompa's PROJECTION which is considered to be the best known software for motif detection. Simulations show that MotifDetector is slower than PROJECTION when the pattern length is relatively small, and outperforms PROJECTION when the pattern length becomes large. AVAILABILITY: It is available for free at http://www.cs.cityu.edu.hk/~lwang/software/motif/index.html, subject to copyright restrictions.

Mesh:

Substances:

Year:  2005        PMID: 16278946     DOI: 10.1142/s0219720005001508

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  2 in total

1.  An efficient rank based approach for closest string and closest substring.

Authors:  Liviu P Dinu; Radu Ionescu
Journal:  PLoS One       Date:  2012-06-04       Impact factor: 3.240

2.  Dataset of microbial community structure in alcohol sprayed banana associated with ripening process.

Authors:  Fenny Martha Dwivany; Fidya Syam; Husna Nugrahapraja; Ocky Karna Radjasa; Maelita Ramdani Moeis; Susumu Uchiyama
Journal:  Data Brief       Date:  2020-02-05
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.