Literature DB >> 11108467

Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification.

L Marsan1, M F Sagot.   

Abstract

This paper introduces two exact algorithms for extracting conserved structured motifs from a set of DNA sequences. Structured motifs may be described as an ordered collection of p > or = 1 "boxes" (each box corresponding to one part of the structured motif), p substitution rates (one for each box) and p - 1 intervals of distance (one for each pair of successive boxes in the collection). The contents of the boxes--that is, the motifs themselves--are unknown at the start of the algorithm. This is precisely what the algorithms are meant to find. A suffix tree is used for finding such motifs. The algorithms are efficient enough to be able to infer site consensi, such as, for instance, promoter sequences or regulatory sites, from a set of unaligned sequences corresponding to the noncoding regions upstream from all genes of a genome. In particular, both algorithms time complexity scales linearly with N2n where n is the average length of the sequences and N their number. An application to the identification of promoter and regulatory consensus sequences in bacterial genomes is shown.

Mesh:

Substances:

Year:  2000        PMID: 11108467     DOI: 10.1089/106652700750050826

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  53 in total

1.  Gibbs Recursive Sampler: finding transcription factor binding sites.

Authors:  William Thompson; Eric C Rouchka; Charles E Lawrence
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 2.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

3.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes.

Authors:  Giulio Pavesi; Paolo Mereghetti; Giancarlo Mauri; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

4.  Decoding human regulatory circuits.

Authors:  William Thompson; Michael J Palumbo; Wyeth W Wasserman; Jun S Liu; Charles E Lawrence
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

5.  SMOTIF: efficient structured pattern and profile motif search.

Authors:  Yongqiang Zhang; Mohammed J Zaki
Journal:  Algorithms Mol Biol       Date:  2006-11-21       Impact factor: 1.405

Review 6.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.

Authors:  Dmitry A Rodionov
Journal:  Chem Rev       Date:  2007-07-18       Impact factor: 60.622

7.  PIDA:A new algorithm for pattern identification.

Authors:  C Putonti; Bm Pettitt; Jg Reid; Y Fofanov
Journal:  Online J Bioinform       Date:  2007-01-01

8.  An algorithm for identifying novel targets of transcription factor families: application to hypoxia-inducible factor 1 targets.

Authors:  Yue Jiang; Bojan Cukic; Donald A Adjeroh; Heath D Skinner; Jie Lin; Qingxi J Shen; Bing-Hua Jiang
Journal:  Cancer Inform       Date:  2009-03-04

9.  New scoring schema for finding motifs in DNA Sequences.

Authors:  Fatemeh Zare-Mirakabad; Hayedeh Ahrabian; Mehdei Sadeghi; Abbas Nowzari-Dalini; Bahram Goliaei
Journal:  BMC Bioinformatics       Date:  2009-03-20       Impact factor: 3.169

10.  Compo: composite motif discovery using discrete models.

Authors:  Geir Kjetil Sandve; Osman Abul; Finn Drabløs
Journal:  BMC Bioinformatics       Date:  2008-12-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.