Literature DB >> 12419259

Probabilistic code for DNA recognition by proteins of the EGR family.

Panayiotis V Benos1, Alan S Lapedes, Gary D Stormo.   

Abstract

A recognition code for protein-DNA interactions would allow for the prediction of binding sites based on protein sequence, and the identification of binding proteins for specific DNA targets. Crystallographic studies of protein-DNA complexes showed that a simple, deterministic recognition code does not exist. Here, we present a probabilistic recognition code (P-code) that assigns energies to all possible base-pair-amino acid interactions for the early growth response factor (EGR) family of zinc-finger transcription factors. The specific energy values are determined by a maximum likelihood method using examples from in vitro randomisation experiments (namely, SELEX and phage display) reported in the literature. The accuracy of the model is tested in several ways, including the ability to predict in vivo binding sites of EGR proteins and other non-EGR zinc-finger proteins, and the correlation between predicted and measured binding affinities of various EGR proteins to several different DNA sites. We also show that this model improves significantly upon the prediction capabilities of previous qualitative and quantitative models. The probabilistic code we develop uses information about the interacting positions between the protein and DNA, but we show that such information is not necessary, although it reduces the number of parameters to be determined. We also employ the assumption that the total binding energy is the sum of the energies of the individual contacts, but we describe how that assumption can be relaxed at the cost of additional parameters.

Mesh:

Substances:

Year:  2002        PMID: 12419259     DOI: 10.1016/s0022-2836(02)00917-8

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  62 in total

1.  Additivity in protein-DNA interactions: how good an approximation is it?

Authors:  Panayiotis V Benos; Martha L Bulyk; Gary D Stormo
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

2.  Quantitative modeling of DNA-protein interactions: effects of amino acid substitutions on binding specificity of the Mnt repressor.

Authors:  Tsz-Kwong Man; Joshua SungWoo Yang; Gary D Stormo
Journal:  Nucleic Acids Res       Date:  2004-08-02       Impact factor: 16.971

Review 3.  Determining the specificity of protein-DNA interactions.

Authors:  Gary D Stormo; Yue Zhao
Journal:  Nat Rev Genet       Date:  2010-09-28       Impact factor: 53.242

4.  Percolation of the phd repressor-operator interface.

Authors:  Xueyan Zhao; Roy David Magnuson
Journal:  J Bacteriol       Date:  2005-03       Impact factor: 3.490

5.  Linking DNA-binding proteins to their recognition sequences by using protein microarrays.

Authors:  Su-Wen Ho; Ghil Jona; Christina T L Chen; Mark Johnston; Michael Snyder
Journal:  Proc Natl Acad Sci U S A       Date:  2006-06-19       Impact factor: 11.205

6.  Connecting protein structure with predictions of regulatory sites.

Authors:  Alexandre V Morozov; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-16       Impact factor: 11.205

7.  Predicting DNA recognition by Cys2His2 zinc finger proteins.

Authors:  Anton V Persikov; Robert Osada; Mona Singh
Journal:  Bioinformatics       Date:  2008-11-13       Impact factor: 6.937

8.  Analysis of homeodomain specificities allows the family-wide prediction of preferred recognition sites.

Authors:  Marcus B Noyes; Ryan G Christensen; Atsuya Wakabayashi; Gary D Stormo; Michael H Brodsky; Scot A Wolfe
Journal:  Cell       Date:  2008-06-27       Impact factor: 41.582

9.  Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors.

Authors:  Jiajian Liu; Gary D Stormo
Journal:  Bioinformatics       Date:  2008-06-27       Impact factor: 6.937

10.  De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins.

Authors:  Anton V Persikov; Mona Singh
Journal:  Nucleic Acids Res       Date:  2013-10-03       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.