| Literature DB >> 17526523 |
Clifford E Felder1, Jaime Prilusky, Israel Silman, Joel L Sussman.
Abstract
An Internet server at http://bip.weizmann.ac.il/dipol calculates the net charge, dipole moment and mean radius of any 3D protein structure or its constituent peptide chains, and displays the dipole vector superimposed on a ribbon backbone of the protein. The server can also display the angle between the dipole and a selected list of amino acid residues in the protein. When the net charges and dipole moments of approximately 12 000 non-homologous PDB biological units (PISCES set), and their unique chains of length 50 residues or longer, were examined, the great majority of both charges and dipoles fell into a very narrow range of values, with long extended tails containing a few extreme outliers. In general, there is no obvious relation between a protein's charge or dipole moment and its structure or function, so that its electrostatic properties are highly specific to the particular protein, except that the majority of chains with very large positive charges or dipoles bind to ribosomes or interact with nucleic acids.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17526523 PMCID: PMC1933167 DOI: 10.1093/nar/gkm307
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Benchmark comparison of calculated dipole moments (Debye) with and without hydrogens, using Parse 3 charges
| PDB_id or name | Including hydrogens | Without hydrogens | Angle | Measured | Description |
|---|---|---|---|---|---|
| Snake AChE( | 1647 | 1615 | 3 | 1000( | Snake AChE homology model |
| 1p0i | 1718 | 1636 | 1 | Human butyrylcholinesterase | |
| Glutactin | 105 | 108 | 8 | ||
| 5cha | 522 | 494 | 6 | 540( | Bovine α-chymotrypsin |
| 2gct | 453 | 492 | 7 | 465( | Bovine γ-chymotrypsin |
| 2hhb | 101 | 48 | 30 | 45( | Human deoxy-hemoglobin |
| 1hho | 210 | 196 | 4 | 271( | Human oxy-hemoglobin |
aAngle in degrees between the calculated dipole vectors with and without hydrogens.
Database statistics for 14,053 unique chains
| Minimum | Maximum | Mean | Median | SD | Skewness | Kurtosis | |
|---|---|---|---|---|---|---|---|
| Number of atoms | 242 | 12773 | 1761 | 1491 | 1171 | 1.94 | 6.59 |
| Number of residues | 50 | 1610 | 224 | 188 | 149 | 1.92 | 6.51 |
| Mean radius (Å) | 59 | 1601 | 261 | 228 | 149 | 1.92 | 6.78 |
| Net charge | –76 | 38 | –3.19 | –3 | 7.68 | –0.83 | 4.38 |
| Dipole moment (Debye) | 5 | 5563 | 543 | 436 | 420 | 3.09 | 18.29 |
| Charge per atom | –0.038 | 0.036 | –0.001 | –0.002 | 0.006 | 0.55 | 4.46 |
| Dipole per atom | 0.034 | 4.53 | 0.370 | 0.305 | 0.285 | 3.46 | 24,66 |
aThe symmetry of the distribution. The skewness of a normal Gaussian distribution is 0.
bThe sharpness or flatness of the distribution. The kurtosis of a normal Gaussian distribution is 0.
Database statistics for 11,981 biological unit proteins
| Minimum | Maximum | Mean | Median | SD | Skewness | Kurtosis | |
|---|---|---|---|---|---|---|---|
| Number of atoms | 321 | 62495 | 3004 | 2994 | 3431 | 5.19 | 48.67 |
| Number of residues | 50 | 8290 | 383 | 265 | 441 | 5.27 | 50.71 |
| Mean radius (Å) | 59 | 5946 | 400 | 298 | 380 | 4.37 | 33.88 |
| Net charge | –246 | 446 | –5.68 | –4 | 17.27 | 4.03 | 141.42 |
| Dipole moment (Debye) | 4 | 34460 | 639 | 452 | 977 | 14.36 | 333.26 |
| Charge per atom | –0.038 | 0.031 | –0.002 | –0.002 | 0.005 | 0.28 | 3.90 |
| Dipole per atom | 0.000 | 4.486 | 0.300 | 0.246 | 0.248 | 2.82 | 20.16 |
aThe symmetry of the distribution. The skewness of a normal Gaussian distribution is 0.
bThe sharpness or flatness of the distribution. The kurtosis of a normal Gaussian distribution is 0.
Figure 1.Histograms of net charges and dipole moments for unique protein chains in the PDB. (A) Histogram of the distribution of net charges for 14 053 unique chains, prepared using KalaidaGraph (23). The distribution for 11 981 biological units is very similar. (B) Histogram of the distribution of dipole moments for 14 053 unique chains. For clarity, the histogram had to be truncated at 4000 D, even though the distribution continues to almost 5600 D. Only 14 chains have dipoles larger than 4000 D. The distribution for 11 981 biological units is very similar. (C) Histogram of the distribution of the net charge per atom for unique chains. The distribution for biological units is very similar. (D) Histogram of the distribution of net dipole moment per atom for unique chains. For clarity, the histogram had to be truncated at 2 D, even though the distribution continues almost up to 25 D. The distribution for biological units is very similar.
Figure 2.Comparison of dipole moments calculated using (I) GRASP (2), displaying the dipole vector in green and –12 kT/e (red) and +12 kT/e (blue) isopotential surfaces; (II) Protein Dipole Moment Server (PDMS), showing the dipole moment vector in black, with a dark blue arrowhead, and the mass moment vector without an arrowhead. The same molecular orientations and modified Parse 3 partial atom charges were used, without hydrogens. (A) 1gd2: BZIP transcription factor PAP1 bound to DNA; 2 chains, 1051 atoms, 129 residues, mean radius 300 Å, charge +10, dipole moments 3328 D (I, GRASP) and 3032 (II, PDMS), with an angle of 2° between their vectors. This pattern is typical of several DNA-binding proteins, including 2dgc, yeast GCN4 Leu zipper, and 1dh3, mouse CREB BZIP-CRE complex, all of which are highly elongated and project a large region of negative potential into the region binding the positively charged DNA. (B) 1gd2, Chain E: 530 atoms, 65 residues, mean radius 212 Å, charge +4, dipoles 1747 D (I, GRASP) and 1591 D (II, PDMS) D, making an angle of 1°. (C) 1ynj: TAQ RNA polymerase. 6 chains, 24 309 atoms, 2821 residues, mean radius 2821 Å, charge –52, dipoles 6532 D (I, GRASP) and 6541 D (II, PDMS), making an angle of 3°. The putative DNA binding region is near the lower left. (D) 1ynj Chain D. 9602 atoms, 1238 residues, mean radius 1924 Å, charge –13, dipoles 8311 D (I, GRASP) and 9414 D (II, PDMS), making an angle of 6°. (E) 1flc: Hemagglutinin-esterase fusion glycoprotein of influenza virus. 6 chains, 13 700 atoms, 1767 residues, mean radius 1718 Å, charge –21, dipoles 7725 D (I, GRASP) and 7971 D (II, PDMS), making an angle of 1°. The acetylsialic acid binding domain is at the lower left. (F) 1flc Chain A. 3339 atoms, 427 residues, mean radius 598 Å, charge +5, dipoles 411 D (I, GRASP) and 411 D (II, PDMS), making an angle of 1°.