Literature DB >> 23959399

PMS6: A Fast Algorithm for Motif Discovery.

Shibdas Bandyopadhyay1, Sartaj Sahni, Sanguthevar Rajasekaran.   

Abstract

We propose a new algorithm, PMS6, for the (l, d)-motif discovery problem in which we are to find all strings of length l that appear in every string of a given set of strings with at most d mismatches. The run time ratio PMS5/PMS6, where PMS5 is the fastest previously known algorithm for motif discovery in large instances, ranges from a high of 2.20 for the (21,8) challenge instances to a low of 1.69 for the (17,6) challenge instances. Both PMS5 and PMS6 require some amount of preprocessing. The preprocessing time for PMS6 is 34 times faster than that for PMS5 for (23,9) instances. When preprocessing time is factored in, the run time ratio PMS5/PMS6 is as high as 2.75 for (13,4) instances and as low as 1.95 for (17,6) instances.

Entities:  

Keywords:  Planted motif search; string algorithms

Year:  2012        PMID: 23959399      PMCID: PMC3744182          DOI: 10.1109/ICCABS.2012.6182627

Source DB:  PubMed          Journal:  IEEE Int Conf Comput Adv Bio Med Sci        ISSN: 2164-229X


  12 in total

1.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences.

Authors:  G Z Hertz; G D Stormo
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

2.  Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification.

Authors:  L Marsan; M F Sagot
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

3.  Finding motifs in the twilight zone.

Authors:  U Keich; P A Pevzner
Journal:  Bioinformatics       Date:  2002-10       Impact factor: 6.937

4.  Finding composite regulatory patterns in DNA sequences.

Authors:  Eleazar Eskin; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

5.  Exact algorithms for planted motif problems.

Authors:  S Rajasekaran; S Balla; C-H Huang
Journal:  J Comput Biol       Date:  2005-10       Impact factor: 1.479

6.  Fast and practical algorithms for planted (l, d) motif search.

Authors:  Jaime Davila; Sudha Balla; Sanguthevar Rajasekaran
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2007 Oct-Dec       Impact factor: 3.710

7.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.

Authors:  C E Lawrence; S F Altschul; M S Boguski; J S Liu; A F Neuwald; J C Wootton
Journal:  Science       Date:  1993-10-08       Impact factor: 47.728

8.  PMS5: an efficient exact algorithm for the (ℓ, d)-motif finding problem.

Authors:  Hieu Dinh; Sanguthevar Rajasekaran; Vamsi K Kundeti
Journal:  BMC Bioinformatics       Date:  2011-10-24       Impact factor: 3.169

9.  Efficient motif finding algorithms for large-alphabet inputs.

Authors:  Pavel P Kuksa; Vladimir Pavlovic
Journal:  BMC Bioinformatics       Date:  2010-10-26       Impact factor: 3.169

10.  A speedup technique for (l, d)-motif finding algorithms.

Authors:  Sanguthevar Rajasekaran; Hieu Dinh
Journal:  BMC Res Notes       Date:  2011-03-08
View more
  2 in total

1.  PMS6MC: A Multicore Algorithm for Motif Discovery.

Authors:  Shibdas Bandyopadhyay; Sartaj Sahni; Sanguthevar Rajasekaran
Journal:  Algorithms       Date:  2013-11-18

2.  Efficient algorithms for biological stems search.

Authors:  Tian Mi; Sanguthevar Rajasekaran
Journal:  BMC Bioinformatics       Date:  2013-05-16       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.