| Literature DB >> 17727706 |
Hong-Fang Ji1, De-Xin Kong, Liang Shen, Ling-Ling Chen, Bin-Guang Ma, Hong-Yu Zhang.
Abstract
BACKGROUND: Extant life depends greatly on the binding of small molecules (such as ligands) with macromolecules (such as proteins), and one ligand can bind multiple proteins. However, little is known about the global patterns of ligand-protein mapping.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17727706 PMCID: PMC2375006 DOI: 10.1186/gb-2007-8-8-r176
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Power-law behaviors of ligand-protein binding. The number of ligands (N) decays with an increase in the number (L) of (a) domains and (b) folds that bind the ligand and follows the equation N = aL-. The figure illustrates that a few ligands cover tens of protein domains or folds, while most ligands bind only one domain or fold.
Descriptors of chemical space consisting of sc-PDB-derived ligands and ACD-SC-derived ordinary molecules and corresponding loadings (Varimax normalized) for the first two factors*
| Descriptors | Characterization | Factor loadings | Software | |
| 1 | 2 | |||
| AREA | Total molecular surface area | 0.103 | Sybyl | |
| PSA | Polar molecular surface area | 0.255 | ||
| PV | Polar molecular volume | 0.501 | 0.741 | |
| VOL | Total molecular volume | 0.062 | ||
| MOLWEIGHT | Molecular weight | 0.206 | ||
| Acceptor | H-bond acceptor counts | 0.464 | ||
| Donor | H-bond donor counts | 0.376 | ||
| BondCount | Total bond counts | 0.060 | ||
| Chiral | Counts of chiral center | 0.367 | 0.617 | |
| Hydrophobe | Hydrophobic fragment counts | 0.767 | -0.417 | |
| RingCount | Ring counts | 0.686 | -0.069 | |
| RotBonds | Number of rotatable bonds | 0.630 | 0.428 | |
| HeavyAtoms | Number of non-H atoms | 0.149 | ||
| Carbons | Number of carbons atoms | -0.228 | Perl | |
| Oxygens | Number of oxygen atoms | 0.425 | 0.793 | |
| Nitrogens | Number of nitrogen atoms | 0.475 | 0.324 | |
| Sulfurs | Number of sulfur atoms | 0.141 | -0.009 | |
| Phosphorus | Number of phosphorus atoms | 0.162 | 0.617 | |
| Halides | Number of halide atoms | 0.076 | -0.170 | |
| DoubleBonds | Number of double bonds | 0.527 | 0.378 | |
| TripleBonds | Number of triple bonds | -0.009 | -0.109 | |
| RadOfGyration | Radius of gyration | 0.888 | 0.004 | Cerius 2 |
| ShadowXY | Surface area projections | 0.076 | ||
| ShadowXZ | 0.053 | |||
| ShadowYZ | 0.877 | 0.093 | ||
| ShadowXYfrac | -0.610 | -0.027 | ||
| ShadowXZfrac | -0.421 | -0.002 | ||
| ShadowYZfrac | -0.289 | 0.039 | ||
| Shadownu | 0.268 | -0.117 | ||
| ShadowXlength | 0.849 | -0.008 | ||
| ShadowYlength | 0.798 | 0.075 | ||
| ShadowZlength | 0.756 | 0.059 | ||
| Density | Density | -0.089 | 0.354 | |
| PMImag | Principal moment of inertia | 0.819 | 0.134 | |
| AlogP | Log of the partition coefficient using Ghose and Crippen's method. | 0.425 | -0.727 | |
| AlogP98 | Log of the partition coefficient, atom-type value, using latest parameters. | 0.365 | ||
| Fh2o | Desolvation free energy for water | -0.479 | -0.762 | |
| Foct | Desolvation free energy for octanol | -0.578 | -0.617 | |
| LogP | Log of the partition coefficient. | -0.022 | ||
| MR | Molar refractivity using Hopfinger's method. | 0.835 | -0.110 | |
| MolRef | Molar refractivity using linear additive method based on AlogP atom types | -0.033 | ||
| JX | Balaban indices | -0.567 | 0.027 | |
| Kappa1 | Kappa topological indices | 0.189 | ||
| Kappa2 | 0.026 | |||
| Kappa3 | 0.691 | 0.033 | ||
| Kappa1AM | 0.220 | |||
| Kappa2AM | 0.050 | |||
| Kappa3AM | 0.630 | 0.046 | ||
| PHI | Molecular flexibility index | 0.800 | 0.078 | |
| SC0 | Subgraph topological counts | 0.147 | ||
| SC1 | 0.125 | |||
| SC2 | 0.186 | |||
| SC3P | 0.141 | |||
| SC3C | 0.749 | 0.389 | ||
| SC3CH | 0.016 | -0.086 | ||
| CHI0 | Kier and Hall Chi connectivity indices | 0.190 | ||
| CHI1 | 0.115 | |||
| CHI2 | 0.210 | |||
| CHI3P | 0.136 | |||
| CHI3C | 0.655 | 0.484 | ||
| CHI3CH | 0.015 | -0.087 | ||
| CHIV0 | 0.076 | |||
| CHIV1 | 0.120 | |||
| CHIV2 | 0.137 | |||
| CHIV3P | 0.838 | 0.096 | ||
| CHIV3C | 0.476 | 0.148 | ||
| CHIV3CH | 0.016 | -0.088 | ||
| Wiener | Wiener topological index | 0.854 | 0.186 | |
| logZ | Logarithm of Hosoya topological index | -0.220 | -0.131 | |
| Zagreb | Zagreb topological index | 0.162 | ||
*The first factor explains 52.8% of the variance and the second explains 12.7%. Factors with high loadings (>0.9 for first factors and >0.8 for second factors) are shown in bold.
Figure 2Chemical space consisting of ligands (derived from sc-PDB) and ordinary molecules (randomly selected from ACD-SC), defined by the first two factors derived from 70 descriptors. The figure illustrates that ligands (in red) occupy the relatively upper part of the space, while ordinary molecules (in blue) occupy the relatively lower part, which means that it is the second factor that discriminates ligands from ordinary molecules. From the loadings of the second factor, it can be deduced that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are likely responsible for the differences between ligands and ordinary molecules, which is supported by the different average values of the four kinds of parameters for ligands and ordinary molecules (Table 2).
Figure 3Chemical space consisting of sc-PDB-derived ligands, defined by the first two factors derived from 70 descriptors. The figure illustrates that super ligands (shared by 3+ folds; in blue), especially those that are shared by 10+ folds (in red), concentrate in the relatively upper part of the space (the area of the circle is directly proportional to the number of folds that bind the ligand), which suggests that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are responsible for the strong protein-binding potential of the super ligands, which is supported by the different average values of the four kinds of parameters for ligands with different protein-binding potentials (Table 2).
Average values of descriptors characterizing polar molecular surface area, H-bond donors, H-bond acceptors, partition coefficient and rotatable bonds for ordinary molecules, ligands and ligands with different protein-binding potentials
| Descriptor* | Small molecules† | Average values | Standard error | Number of molecules |
| PSA | Molecules | 111.81 | 1.79 | 2,184 |
| Ligands | 230.59 | 2.79 | 2,176 | |
| Ligands (≤ 3) | 225.71 | 2.80 | 2,072 | |
| Ligands (4-9) | 304.28 | 15.67 | 80 | |
| Ligands (≥ 10) | 406.83 | 33.10 | 24 | |
| Donor | Molecules | 1.51 | 0.04 | 2,184 |
| Ligands | 3.97 | 0.07 | 2,176 | |
| Ligands (≤ 3) | 3.87 | 0.07 | 2,072 | |
| Ligands (4-9) | 5.24 | 0.43 | 80 | |
| Ligands (≥ 10) | 8.21 | 0.90 | 24 | |
| Acceptor | Molecules | 3.35 | 0.05 | 2,184 |
| Ligands | 5.87 | 0.09 | 2,176 | |
| Ligands (≤ 3) | 5.74 | 0.09 | 2,072 | |
| Ligands (4-9) | 7.69 | 0.53 | 80 | |
| Ligands (≥ 10) | 11.00 | 1.18 | 24 | |
| AlogP98 | Molecules | 2.87 | 0.05 | 2,184 |
| Ligands | 0.81 | 0.06 | 2,176 | |
| Ligands (≤ 3) | 0.92 | 0.06 | 2,072 | |
| Ligands (4-9) | -1.33 | 0.25 | 80 | |
| Ligands (≥ 10) | -1.80 | 0.38 | 24 | |
| LogP | Molecules | 0.77 | 0.08 | 2,184 |
| Ligands | -2.27 | 0.10 | 2,176 | |
| Ligands (≤ 3) | -2.10 | 0.10 | 2,072 | |
| Ligands (4-9) | -5.06 | 0.50 | 80 | |
| Ligands (≥ 10) | -8.11 | 0.96 | 24 | |
| RotBond | Molecules | 4.88 | 0.09 | 2,184 |
| Ligands | 7.49 | 0.11 | 2,176 | |
| Ligands (≤ 3) | 7.43 | 0.11 | 2072 | |
| Ligands (4-9) | 8.00 | 0.50 | 80 | |
| Ligands (≥ 10) | 11.33 | 1.19 | 24 |
* PSA, polar molecular surface area; Donor, H-bond donor counts; Acceptor, H-bond acceptor counts; AlogP98, log of the partition coefficient, atom-type value, using latest parameters; LogP, log of the partition coefficient; RotBond, number of rotatable bonds. †Molecules, ACD-SC-derived ordinary molecules; Ligands, sc-PDB-derived ligands; Ligands (≤ 3), ligands covering ≤ 3 folds; Ligands (4-9), ligands covering 4-9 folds; Ligands (≥ 10), ligands covering ≥ 10 folds.
The most prevalent bio-ligands in the fold universe (shared by 15+ folds) and the most common folds used by host proteins of each ligand
| Ligands | Number of folds | Most common folds |
| Adenosine-5'-triphosphate (ATP) | 35 | P-loop containing nucleoside triphosphate hydrolases (c.37) |
| Adenosine-5'-diphosphate (ADP) | 31 | P-loop containing nucleoside triphosphate hydrolases (c.37) |
| Guanosine-5'-diphosphate (GDP) | 29 | P-loop containing nucleoside triphosphate hydrolases (c.37) |
| Nicotinamide-adenine-dinucleotide (NAD) | 27 | NAD(P)-binding Rossmann-fold domains (c.2) |
| Flavin-adenine dinucleotide (FAD) | 21 | FAD/NAD(P)-binding domain (c.3) |
| Dihydro-nicotinamide-adenine-dinucleotide phosphate (NDP) | 18 | NAD(P)-binding Rossmann-fold domains (c.2) |
| Nicotinamide-adenine-dinucleotide phosphate (NAP) | 16 | NAD(P)-binding Rossmann-fold domains (c.2) |
| Flavin mononucleotide (FMN) | 16 | |
| Adenosine monophosphate (AMP) | 15 | Adenine nucleotide alpha hydrolase-like (c.26) |