Literature DB >> 1731067

Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments.

L R Cardon1, G D Stormo.   

Abstract

An Expectation Maximization algorithm for identification of DNA binding sites is presented. The approach predicts the location of binding regions while allowing variable length spacers within the sites. In addition to predicting the most likely spacer length for a set of DNA fragments, the method identifies individual sites that differ in spacer size. No alignment of DNA sequences is necessary. The method is illustrated by application to 231 Escherichia coli DNA fragments known to contain promoters with variable spacings between their consensus regions. Maximum-likelihood tests of the differences between the spacing classes indicate that the consensus regions of the spacing classes are not distinct. Further tests suggest that several positions within the spacing region may contribute to promoter specificity.

Entities:  

Mesh:

Substances:

Year:  1992        PMID: 1731067     DOI: 10.1016/0022-2836(92)90723-w

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  34 in total

1.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

2.  PlantProm: a database of plant promoter sequences.

Authors:  Ilham A Shahmuradov; Alex J Gammerman; John M Hancock; Peter M Bramley; Victor V Solovyev
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 3.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

4.  Bipartite pattern discovery by entropy minimization-based multiple local alignment.

Authors:  Chengpeng Bi; Peter K Rogan
Journal:  Nucleic Acids Res       Date:  2004-09-23       Impact factor: 16.971

5.  Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

Authors:  Gurmukh Sahota; Gary D Stormo
Journal:  Bioinformatics       Date:  2010-08-31       Impact factor: 6.937

Review 6.  Identifying regulatory elements in eukaryotic genomes.

Authors:  Leelavati Narlikar; Ivan Ovcharenko
Journal:  Brief Funct Genomic Proteomic       Date:  2009-06-04

Review 7.  Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.

Authors:  Sacha A F T van Hijum; Marnix H Medema; Oscar P Kuipers
Journal:  Microbiol Mol Biol Rev       Date:  2009-09       Impact factor: 11.056

8.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

9.  Hidden Markov models of biological primary sequence information.

Authors:  P Baldi; Y Chauvin; T Hunkapiller; M A McClure
Journal:  Proc Natl Acad Sci U S A       Date:  1994-02-01       Impact factor: 11.205

10.  DNA sequence-dependent deformability deduced from protein-DNA crystal complexes.

Authors:  W K Olson; A A Gorin; X J Lu; L M Hock; V B Zhurkin
Journal:  Proc Natl Acad Sci U S A       Date:  1998-09-15       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.