| Literature DB >> 36148013 |
Arumay Pal1, Pinak Chakrabarti2, Sucharita Dey3.
Abstract
Protein-DNA interactions play a crucial role in gene expression and regulation. Identifying the DNA binding surface of proteins has long been a challenge-in comparison to protein-protein interactions, limited progress has been made in the development of efficient DNA binding site prediction and protein-DNA docking methods. Here we present ProDFace, a web tool that characterizes the binding region of a protein-DNA complex based on amino acid propensity, hydrogen bond (HB) donor capacity (number of solvent accessible HB donor groups), sequence conservation at the interface core and rim region, and geometry. The program takes as input the structure of a protein-DNA complex in PDB (Protein Data Bank) format, and outputs various physicochemical and geometric parameters of the interface, as well as conservation of the interface residues in the protein component. Values are provided for the whole interface, and after dissecting it into core and rim regions. Details of water mediated HBs between protein and DNA, potential HB donor groups present at the binding surface of protein, and conserved interface residues are also provided as downloadable text files. These parameters can be useful in evaluating and validating protein-DNA docking solutions, structures derived from simulation as well as solutions from the available prediction tools, and facilitate the development of more efficient prediction methods. The web-tool is freely available at structbioinfo.iitj.ac.in/resources/bioinfo/pd_interface .Entities:
Keywords: Protein-DNA complexes; docking; hydrogen bond; interface core and rim; protein-DNA interactions; sequence conservation
Year: 2022 PMID: 36148013 PMCID: PMC9486321 DOI: 10.3389/fmolb.2022.978310
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Workflow of ProDFace. The user uploads a query structure in PDB format. The protein-DNA binding region i.e. the interface and the rest of the surface region are identified for the whole complex, as well as for the protein and the DNA components. Structural, geometrical and physico-chemical properties of the interface region are calculated. Also, the interface hydration is analyzed. The interface is dissected into core and rim regions depending on their solvent accessibility (Guharoy and Chakrabarti, 2005) and the sequence conservation within these areas is calculated. All these properties have been described previously (Dey et al., 2012). The conserved residues are displayed in a separate plot.
Parameters for the interface in the human NF-kappaB p52 homodimer-DNA complex (PDB code, 1a3q).
| Parameters observed | Protein component | DNA component | Total |
|---|---|---|---|
| Interface area (Å2) | 1538 | 1481 | 3020 |
| Interface area/Surface area | 0.05 | 0.35 | 0.08 |
| Number of atoms | 161 | 163 | 324 |
| Number of residues (or nucleotides) | 50 | 22 | 72 |
| Fraction of non-polar atoms | 0.6 | 0.44 | 0.52 |
| Non-polar interface area (Å2) | 766 | 469 | 1235 |
| Fraction of fully buried atoms | 0.19 | 0.23 | 0.21 |
| Residue propensity score | −0.03 | — | — |
| Local density | 33 | 52 | — |
| Number of Potential hydrogen-bond Donors (Dp) | 28 | — | — |
| Number of direct hydrogen-bonds (HBs) | — | — | 22 |
| Number of DNA-HOH-Protein H-bond interactions (bridging HB) | — | — | 14 |
| Number of HBs involving water and only protein (or DNA) | 27 | 15 | 42 |
| Number of interface waters | — | — | 57 |
| Number of bridging waters | — | — | 11 |
| Number of waters H-bonded with only protein or DNA | 21 | 14 | 35 |
| Number of interface waters H-bonded to other interface waters only | — | — | 8 |
| Number of polypeptide segments | 7,8 | — | 15 |
Sequence entropy data for the interface residues in a subunit of human NF-kappaB p52 (PDB code, 1a3q).
| Number of homologs | Number of conserved residues | Mean entropy of core | Mean entropy of rim | Mean sequence entropy of protein |
|---|---|---|---|---|
| 512 | 13 | 0.23 | 0.59 | 0.78 |
Homologous sequences with 30% or more sequence identity were used in multiple sequence alignment obtained from the HSSP database.
FIGURE 2Result page of ProDFace. (A) Plot showing relative positions of the residues from one subunit (whose sequence entropies are shown color-coded, and also as circles and squares, depending on their location in core and rim, respectively) and nucleotides (separated into base, sugar and phosphate, shown in three distinct colors, and also distinguished into core and rim) in the file 1a3q.pdb (human NF-kappaB p52 homodimer-DNA complex as input). (B) Cartoon representation of human NF-kappaB p52 homodimer-DNA complex (PDB code, 1a3q, only one subunit shown), approximately in the orientation used in Figure 2A for displaying interface residues and nucleotides. Interface Cα atoms are shown as spheres and colored according to sequence entropy. DNA backbone is shown in red (phosphate in sphere), whereas base and sugar are shown in blue and gray sticks, respectively. Figure made using pymol (http://www.pymol.org).