Literature DB >> 3612791

Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters.

O G Berg, P H von Hippel.   

Abstract

We present a statistical-mechanical selection theory for the sequence analysis of a set of specific DNA regulatory sites that makes it possible to predict the relationship between individual base-pair choices in the site and specific activity (affinity). The theory is based on the assumption that specific DNA sequences have been selected to conform to some requirement for protein binding (or activity), and that all sequences that can fulfil this requirement are equally likely to occur. In most cases, the number of specific DNA sequences that are known for a certain DNA-binding protein is very small, and we discuss in detail the small-sample uncertainties that this leads to. When applied to the binding sites for cro repressor in phage lambda, the theory can predict, from the sequence statistics alone, their rank order binding affinities in reasonable agreement with measured values. However, the statistical uncertainty generated by such a small sample (only 6 sites known) limits the result to order-of-magnitude comparisons. When applied to the much larger sample of Escherichia coli promoter sequences, the theory predicts the correlation between in vitro activity (k2KB values) and homology score (closeness to the consensus sequence) observed by Mulligan et al. (1984). The analysis of base-pair frequencies in the promoter sample is consistent with the assumption that base-pairs at different positions in the sites contribute independently to the specific activity, except in a few marginal cases that are discussed. When the promoter sites are ordered according to predicted activities, they seem to conform to the Gaussian distribution that results from a requirement for maximal sequence variability within the constraint of providing a certain average activity. The theory allows us to compare the number of specific sites with a certain activity to the number that would be expected from random occurrence in the genome. While strong promoters are "overspecified", in the sense that their probability of random occurrence is very low, random sequences with weak promoter-like properties are expected to occur in very large numbers. This leads to the conclusion that functional specificity is based on other properties in addition to primary sequence recognition; some possibilities are discussed. Finally, we show that the sequence information, as defined by Schneider et al. (1986), can be used directly (at least in the case of equilibrium binding sites) to estimate the number of protein molecules that are specifically bound at random "pseudosites" in the genome.(ABSTRACT TRUNCATED AT 400 WORDS)

Entities:  

Mesh:

Substances:

Year:  1987        PMID: 3612791     DOI: 10.1016/0022-2836(87)90354-8

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  261 in total

1.  In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species.

Authors:  J H Graber; C R Cantor; S C Mohr; T F Smith
Journal:  Proc Natl Acad Sci U S A       Date:  1999-11-23       Impact factor: 11.205

2.  ACTIVITY: a database on DNA/RNA sites activity adapted to apply sequence-activity relationships from one system to another.

Authors:  J V Ponomarenko; D P Furman; A S Frolov; N L Podkolodny; G V Orlova; M P Ponomarenko; N A Kolchanov; A Sarai
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  Novel group of virulence activators within the AraC family that are not restricted to upstream binding sites.

Authors:  G P Munson; L G Holcomb; J R Scott
Journal:  Infect Immun       Date:  2001-01       Impact factor: 3.441

4.  Analysis of the spacing between the two palindromes of activation sequence-1 with respect to binding to different TGA factors and transcriptional activation potential.

Authors:  Stefanie Krawczyk; Corinna Thurow; Ricarda Niggeweg; Christiane Gatz
Journal:  Nucleic Acids Res       Date:  2002-02-01       Impact factor: 16.971

5.  Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome.

Authors:  Benjamin P Berman; Yutaka Nibu; Barret D Pfeiffer; Pavel Tomancak; Susan E Celniker; Michael Levine; Gerald M Rubin; Michael B Eisen
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-22       Impact factor: 11.205

6.  Non-independence of Mnt repressor-operator interaction determined by a new quantitative multiple fluorescence relative affinity (QuMFRA) assay.

Authors:  T K Man; G D Stormo
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

7.  Homotypic regulatory clusters in Drosophila.

Authors:  Alexander P Lifanov; Vsevolod J Makeev; Anna G Nazina; Dmitri A Papatsenko
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

8.  Identification of the binding sites of regulatory proteins in bacterial genomes.

Authors:  Hao Li; Virgil Rhodius; Carol Gross; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-14       Impact factor: 11.205

9.  Energetic funnel facilitates facilitated diffusion.

Authors:  Massimo Cencini; Simone Pigolotti
Journal:  Nucleic Acids Res       Date:  2018-01-25       Impact factor: 16.971

10.  Consensus DNA site for the Escherichia coli catabolite gene activator protein (CAP): CAP exhibits a 450-fold higher affinity for the consensus DNA site than for the E. coli lac DNA site.

Authors:  R H Ebright; Y W Ebright; A Gunasekera
Journal:  Nucleic Acids Res       Date:  1989-12-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.