Literature DB >> 2666673

Consensus methods for finding and ranking DNA binding sites. Application to Escherichia coli promoters.

M C O'Neill1.   

Abstract

There have been many different approaches employed to define the "consensus" sequence of various DNA binding sites and to use the definition obtained to locate and rank members of a given sequence family. The analysis presented here enlists two of these approaches, each in modified form, to develop a highly efficient search protocol for Escherichia coli promoters and to provide a relative ranking of these sites showing good agreement with in vitro measurements of promoter strength. Schneider et al. have applied Shannon's index of information content to evaluate the significance of each position within the consensus of a family of aligned sequences. In a formal sense, this index is only applicable to a group of sequences, providing at each position a negative entropy value between zero (random) and two bits (total conservation of a single base) for sequences in which all bases are equally represented. A method for evaluating how well an individual sequence conforms to the information content pattern of the consensus is described. A function is derived, by analogy to the information content of the sequence family, for application to individual sequences. Since this function is a measure of conformity, it can be used in a search protocol to identify new members of the family represented by the consensus. A protocol for locating E. coli promoters is presented. The Berg-von Hippel statistical-mechanical function is also tested in a similar application. While the information content function provides a superior search protocol, the Berg-von Hippel function, when scaled at each position by the information content, does well at ranking promoters according to their strength as measured in vitro.

Entities:  

Mesh:

Substances:

Year:  1989        PMID: 2666673     DOI: 10.1016/0022-2836(89)90256-8

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  23 in total

1.  Novel group of virulence activators within the AraC family that are not restricted to upstream binding sites.

Authors:  G P Munson; L G Holcomb; J R Scott
Journal:  Infect Immun       Date:  2001-01       Impact factor: 3.441

2.  An assessment of neural network and statistical approaches for prediction of E. coli promoter sites.

Authors:  P B Horton; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1992-08-25       Impact factor: 16.971

3.  Training back-propagation neural networks to define and detect DNA-binding sites.

Authors:  M C O'Neill
Journal:  Nucleic Acids Res       Date:  1991-01-25       Impact factor: 16.971

4.  Molecular characterization of mutations affecting expression level and growth rate-dependent regulation of the Escherichia coli zwf gene.

Authors:  D L Rowley; W P Fawcett; R E Wolf
Journal:  J Bacteriol       Date:  1992-01       Impact factor: 3.490

5.  Molecular characterization of the Escherichia coli K-12 zwf gene encoding glucose 6-phosphate dehydrogenase.

Authors:  D L Rowley; R E Wolf
Journal:  J Bacteriol       Date:  1991-02       Impact factor: 3.490

6.  SQUIRREL: Sequence QUery, Information Retrieval and REporting Library. A program package for analyzing signals in nucleic acid sequences for the VAX.

Authors:  C J Gartmann; U Grob
Journal:  Nucleic Acids Res       Date:  1991-11-11       Impact factor: 16.971

7.  Isolation of DNA damage-inducible promoters in Escherichia coli: regulation of polB (dinA), dinG, and dinH by LexA repressor.

Authors:  L K Lewis; M E Jenkins; D W Mount
Journal:  J Bacteriol       Date:  1992-05       Impact factor: 3.490

8.  A general procedure for locating and analyzing protein-binding sequence motifs in nucleic acids.

Authors:  M C O'Neill
Journal:  Proc Natl Acad Sci U S A       Date:  1998-09-01       Impact factor: 11.205

9.  Promoter region of the Escherichia coli O7-specific lipopolysaccharide gene cluster: structural and functional characterization of an upstream untranslated mRNA sequence.

Authors:  C L Marolda; M A Valvano
Journal:  J Bacteriol       Date:  1998-06       Impact factor: 3.490

10.  Genetic analysis of the dTDP-rhamnose biosynthesis region of the Escherichia coli VW187 (O7:K1) rfb gene cluster: identification of functional homologs of rfbB and rfbA in the rff cluster and correct location of the rffE gene.

Authors:  C L Marolda; M A Valvano
Journal:  J Bacteriol       Date:  1995-10       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.