Literature DB >> 17109757

EXMOTIF: efficient structured motif extraction.

Yongqiang Zhang1, Mohammed J Zaki.   

Abstract

BACKGROUND: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences.
RESULTS: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites.
CONCLUSION: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at: http://www.cs.rpi.edu/~zaki/software/exMotif/.

Entities:  

Year:  2006        PMID: 17109757      PMCID: PMC1698483          DOI: 10.1186/1748-7188-1-21

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  14 in total

1.  Identifying target sites for cooperatively binding factors.

Authors:  D GuhaThakurta; G D Stormo
Journal:  Bioinformatics       Date:  2001-07       Impact factor: 6.937

2.  Genome-wide analysis of bacterial promoter regions.

Authors:  Eleazar Eskin; Uri Keich; Mikhail S Gelfand; Pavel A Pevzner
Journal:  Pac Symp Biocomput       Date:  2003

3.  Finding composite regulatory patterns in DNA sequences.

Authors:  Eleazar Eskin; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

4.  YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation.

Authors:  Saurabh Sinha; Martin Tompa
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Conservative extraction of over-represented extensible motifs.

Authors:  Alberto Apostolico; Matteo Comin; Laxmi Parida
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

6.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

7.  The value of prior knowledge in discovering motifs with MEME.

Authors:  T L Bailey; C Elkan
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1995

8.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae.

Authors:  J Zhu; M Q Zhang
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

9.  Discovery of novel transcription factor binding sites by statistical overrepresentation.

Authors:  Saurabh Sinha; Martin Tompa
Journal:  Nucleic Acids Res       Date:  2002-12-15       Impact factor: 16.971

10.  Scoring functions for transcription factor binding site prediction.

Authors:  Markus Friberg; Peter von Rohr; Gaston Gonnet
Journal:  BMC Bioinformatics       Date:  2005-04-04       Impact factor: 3.169

View more
  6 in total

1.  SMOTIF: efficient structured pattern and profile motif search.

Authors:  Yongqiang Zhang; Mohammed J Zaki
Journal:  Algorithms Mol Biol       Date:  2006-11-21       Impact factor: 1.405

2.  Protein sequences classification by means of feature extraction with substitution matrices.

Authors:  Rabie Saidi; Mondher Maddouri; Engelbert Mephu Nguifo
Journal:  BMC Bioinformatics       Date:  2010-04-08       Impact factor: 3.169

3.  Data Mining in Bioinformatics (BIOKDD).

Authors:  Mohammed J Zaki; George Karypis; Jiong Yang
Journal:  Algorithms Mol Biol       Date:  2007-04-11       Impact factor: 1.405

4.  libFLASM: a software library for fixed-length approximate string matching.

Authors:  Lorraine A K Ayad; Solon P P Pissis; Ahmad Retha
Journal:  BMC Bioinformatics       Date:  2016-11-10       Impact factor: 3.169

Review 5.  A Review on Planted (l, d) Motif Discovery Algorithms for Medical Diagnose.

Authors:  Satarupa Mohanty; Prasant Kumar Pattnaik; Ahmed Abdulhakim Al-Absi; Dae-Ki Kang
Journal:  Sensors (Basel)       Date:  2022-02-05       Impact factor: 3.576

6.  Direct vs 2-stage approaches to structured motif finding.

Authors:  Maria Federico; Mauro Leoncini; Manuela Montangero; Paolo Valente
Journal:  Algorithms Mol Biol       Date:  2012-08-21       Impact factor: 1.405

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.