| Literature DB >> 21151463 |
José R F Marques1, Rute R da Fonseca, Brett Drury, André Melo.
Abstract
Disulfide bonds provide an inexhaustible source of information on molecular evolution and biological specificity. In this work, we described the amino acid composition around disulfide bonds in a set of disulfide-rich proteins using appropriate descriptors, based on ANOVA (for all twenty natural amino acids or classes of amino acids clustered according to their chemical similarities) and Scheffé (for the disulfide-rich proteins superfamilies) statistics. We found that weakly hydrophilic and aromatic amino acids are quite abundant in the regions around disulfide bonds, contrary to aliphatic and hydrophobic amino acids. The density distributions (as a function of the distance to the center of the disulfide bonds) for all defined entities presented an overall unimodal behavior: the densities are null at short distances, have maxima at intermediate distances and decrease for long distances. In the end, the amino acid environment around the disulfide bonds was found to be different for different superfamilies, allowing the clustering of proteins in a biologically relevant way, suggesting that this type of chemical information might be used as a tool to assess the relationship between very divergent sets of disulfide-rich proteins.Entities:
Keywords: classification; disulfide bond; diversity; frequency; neighborhood
Mesh:
Substances:
Year: 2010 PMID: 21151463 PMCID: PMC3000107 DOI: 10.3390/ijms11114673
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The amino acid classes assembled using various physicochemical criteria were clustered into two classification groups.
| ALA, ILE, GLY, PRO, VAL, LEU | aliphatic side chain | ||
| TYR, PHE, TRP | aromatic side chain (absorbs UV) | ||
| CYS, MET | side chain containing a sulfur atom | ||
| SER, THR, ASN, GLN | polar side chain | ||
| ASP, GLU, HIS, LYS, ARG | charged side chain | ||
| SER, THR, ASN, GLN, ASP, GLU, HIS, LYS, ARG | hydrophilic | ||
| ALA, VAL, LEU, ILE, MET, PHE, TRP | hydrophobic | ||
| GLY, CYS, TYR | weakly hydrophilic | ||
| PRO | weakly hydrophobic |
Set of superfamilies under study. The statistical analyses included all the disulfide bonds identified in this protein set. The values in the last three columns were calculated as sums over all the PDB structures of each superfamily (see PDB ids in Table 1 in Supplementary Material).
| Crisp | Small proteins | Crisp domain-like | α | 5.3% | 6 | 54 | 1367 |
| Cystine-Knot | Small proteins | Cystine-Knot cykotines | β | 3.7% | 13 | 112 | 3131 |
| Defensin-like | Small proteins | Defensin-like | β | 7.4% | 15 | 47 | 730 |
| EGF-Laminin | Small proteins | Knottins | β | 6.4% | 27 | 121 | 2253 |
| Omega toxins | Small proteins | Knottins | β | 8.9% | 28 | 88 | 992 |
| Plant lectins | Small proteins | Knottins | β | 9.9% | 8 | 100 | 1045 |
| Small snake toxins | Small proteins | Snake toxins-like | β | 6.5% | 40 | 209 | 3279 |
| Scorpion-like toxins | Small proteins | Knottins | β | 7.9% | 70 | 247 | 3303 |
| BBI | Small proteins | Knottins | β | 9.6% | 5 | 33 | 371 |
| BPTI-like | Small proteins | BPTI-like | α + β | 5.1% | 12 | 42 | 814 |
| Kringle-like | Small proteins | Kringle-like | β | 3.7% | 12 | 53 | 1771 |
| Thioredoxin-like | Alpha and beta proteins | Thioredoxin | α/β | 0.8% | 43 | 66 | 10616 |
| Most frequent motif | Small proteins | Knottins | β | [6.7%, 7.3%] | - | - | - |
Calculated by equation 11;
Confidence interval, at a 95% level, for the disulfide bonds propensity of SDP structures; EGF: Epidermal growth factor; BBI: Bowman Birk Inhibitors; BPTI: basic pancreatic trypsin inhibitor.
Figure 1.Relative frequencies around disulfide bonds of (A) the natural amino acids, (B) classes in classification group 1, and (C) classes in classification group 2. The black columns represent the relative frequencies for the sample. The other columns represent the relative frequencies for each superfamily. The values of the statistical parameter F associated with the one-way ANOVA test are presented in parenthesis.
Figure 2.Densities for the twenty natural amino acids and the various classes in the different spherical shells. The following color notation is adopted: green means a density 50% smaller than a uniform density; yellow represents a density between 50% and 150% this density; and orange corresponds to 150% larger than the same reference.
Figure 3.Projected 3D-Cartesian representation of the twelve superfamilies under study, inferred from the Scheffé distances calculated on the original 20-dimensional space of the natural amino acid.
Figure 4.Representative amino acid disulfide environments (top: all side-chains; bottom: only the side-chains of the cysteines involved in disulfide-bonds are depicted). (A) thioredoxin-like (PDB id 1bed); (B) SDP’s superfamilies (plant defensin, PDB id 1q9b). A cutoff 10 Å around the disulfide bonds was considered.