Literature DB >> 2919167

Identifying protein-binding sites from unaligned DNA fragments.

G D Stormo1, G W Hartzell.   

Abstract

The ability to determine important features within DNA sequences from the sequences alone is becoming essential as large-scale sequencing projects are being undertaken. We present a method that can be applied to the problem of identifying the recognition pattern for a DNA-binding protein given only a collection of sequenced DNA fragments, each known to contain somewhere within it a binding site for that protein. Information about the position or orientation of the binding sites within those fragments is not needed. The method compares the "information content" of a large number of possible binding site alignments to arrive at a matrix representation of the binding site pattern. The specificity of the protein is represented as a matrix, rather than a consensus sequence, allowing patterns that are typical of regulatory protein-binding sites to be identified. The reliability of the method improves as the number of sequences increases, but the time required increases only linearly with the number of sequences. An example, using known cAMP receptor protein-binding sites, illustrates the method.

Entities:  

Mesh:

Substances:

Year:  1989        PMID: 2919167      PMCID: PMC286650          DOI: 10.1073/pnas.86.4.1183

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  19 in total

1.  Molecular cloning and expression of the biodegradative threonine dehydratase gene (tdc) of Escherichia coli K12.

Authors:  T J Goss; P Datta
Journal:  Mol Gen Genet       Date:  1985

Review 2.  Computer methods for analyzing sequence recognition of nucleic acids.

Authors:  G D Stormo
Journal:  Annu Rev Biophys Biophys Chem       Date:  1988

3.  Information content of binding sites on nucleotide sequences.

Authors:  T D Schneider; G D Stormo; L Gold; A Ehrenfeucht
Journal:  J Mol Biol       Date:  1986-04-05       Impact factor: 5.469

4.  Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli.

Authors:  D J Galas; M Eggert; M S Waterman
Journal:  J Mol Biol       Date:  1985-11-05       Impact factor: 5.469

Review 5.  Cyclic AMP receptor protein: role in transcription activation.

Authors:  B de Crombrugghe; S Busby; H Buc
Journal:  Science       Date:  1984-05-25       Impact factor: 47.728

6.  Computer methods to locate signals in nucleic acid sequences.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

7.  Escherichia coli promoter sequences predict in vitro RNA polymerase selectivity.

Authors:  M E Mulligan; D K Hawley; R Entriken; W R McClure
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

Review 8.  Compilation and analysis of Escherichia coli promoter DNA sequences.

Authors:  D K Hawley; W R McClure
Journal:  Nucleic Acids Res       Date:  1983-04-25       Impact factor: 16.971

9.  A perfectly symmetric lac operator binds the lac repressor very tightly.

Authors:  J R Sadler; H Sasmor; J L Betz
Journal:  Proc Natl Acad Sci U S A       Date:  1983-11       Impact factor: 11.205

10.  The catabolite-sensitive promoter for the chloramphenicol acetyl transferase gene is preceded by two binding sites for the catabolite gene activator protein.

Authors:  S F Le Grice; H Matzura; R Marcoli; S Iida; T A Bickle
Journal:  J Bacteriol       Date:  1982-04       Impact factor: 3.490

View more
  127 in total

1.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

2.  Splicing enhancement in the yeast rp51b intron.

Authors:  D Libri; A Lescure; M Rosbash
Journal:  RNA       Date:  2000-03       Impact factor: 4.942

3.  Reevaluation of the determinants of tyrosine sulfation.

Authors:  H B Nicholas; S S Chan; G L Rosenquist
Journal:  Endocrine       Date:  1999-12       Impact factor: 3.633

4.  Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis.

Authors:  H J Bussemaker; H Li; E D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

5.  The evolution of DNA regulatory regions for proteo-gamma bacteria by interspecies comparisons.

Authors:  Nikolaus Rajewsky; Nicholas D Socci; Martin Zapotocky; Eric D Siggia
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

6.  Discovering common stem-loop motifs in unaligned RNA sequences.

Authors:  J Gorodkin; S L Stricklin; G D Stormo
Journal:  Nucleic Acids Res       Date:  2001-05-15       Impact factor: 16.971

7.  Additivity in protein-DNA interactions: how good an approximation is it?

Authors:  Panayiotis V Benos; Martha L Bulyk; Gary D Stormo
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

8.  Finding important sites in protein sequences.

Authors:  Peter J Bickel; Katherina J Kechris; Philip C Spector; Gary J Wedemayer; Alexander N Glazer
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-04       Impact factor: 11.205

9.  Identification of the binding sites of regulatory proteins in bacterial genomes.

Authors:  Hao Li; Virgil Rhodius; Carol Gross; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-14       Impact factor: 11.205

10.  Consensus DNA site for the Escherichia coli catabolite gene activator protein (CAP): CAP exhibits a 450-fold higher affinity for the consensus DNA site than for the E. coli lac DNA site.

Authors:  R H Ebright; Y W Ebright; A Gunasekera
Journal:  Nucleic Acids Res       Date:  1989-12-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.