| Literature DB >> 22272083 |
Cheng-Fang Tsai1, Kuei-Jen Lee.
Abstract
In this paper, the second-order hydrophobic moment for fifteen globular proteins in 150 nonhomologous protein chains was performed in a comparative study involving two sets of hydrophobicity: one selected from the consensus scale and the other derived from the CHARMM partial atomic charges. These proteins were divided into three groups, based on their number of residues (N) and the asphericity (δ). Proteins in Group I were spherical and those in Groups II and III were prolate. The size of the proteins is represented by the mean radius of gyration (R(g) ), which follows the Flory scaling law, R(g) ∝ N(ν). The mean value of v was 0.35, which is similar to a polymer chain in a poor solvent. The spatial distributions of the second-order moment for each of the proteins, obtained from the two sets of hydrophobicity, were compared using the Pearson correlation coefficient; the results reveal that there is a strong correlation between the two data sets for each protein structure when the CHARMM partial atomic charges, |q(i)| ≥ 0.3, assigned for polar atoms, are used. The locations at which these distributions vanish and approach a negative value are at approximately 50% of the percentage of solvent accessibility, indicating that there is a transition point from hydrophobic interior to hydrophilic exterior in the proteins. This may suggest that there is a position for the proteins to determine the residues at exposed sites beyond this range.Entities:
Keywords: CHARMM partial atomic charges; Secondorder hydrophobic moment; hydrophobicity consensus scale
Mesh:
Substances:
Year: 2011 PMID: 22272083 PMCID: PMC3257080 DOI: 10.3390/ijms12128449
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The asphericity (δ) as a function of protein indices.
List of proteins used in this work.
| Protein | Protein index | PDB code | No. residues |
|---|---|---|---|
| I | |||
| Photoactive yellow protein | 4 | 3PYP | 125 |
| Exonuclease III | 125 | 1AKO | 268 |
| Carboxypeptidase A | 39 | 2CTC | 307 |
| Cellulase cela | 96 | 1CEM | 363 |
| Myrosinase | 55 | 1E70 | 499 |
| II | |||
| Lysozyme | 2 | 3LTZ | 129 |
| Dienoyl-coa isomerase | 57 | 1DCIA | 275 |
| Sulfate-Binding protein | 139 | 1SBP | 309 |
| Maltodextrin-binding protein | 122 | 3MBP | 370 |
| Alpha-amylase | 118 | 1JAE | 470 |
| III | |||
| Cytochrome C′ | 108 | 1CPQ | 129 |
| Molybdate transport protein | 135 | 1AMF | 231 |
| L-arabinose-binding protein | 81 | 1ABE | 305 |
| Phosphate-binding protein | 133 | 2ABH | 321 |
| Leucine aminopeptidase | 83 | 1LAM | 484 |
Values of ASA (Å2) and the consensus scale for each amino-acid residue.
| Residue | ASA | Consensus scale ( |
|---|---|---|
| Ala | 116.5 | 0.25 |
| Arg | 249.7 | −1.80 |
| Asn | 163.2 | −0.64 |
| Asp | 156.2 | −0.72 |
| Cys | 140.2 | 0.04 |
| Gln | 188.9 | −0.69 |
| Glu | 183.8 | −0.62 |
| Gly | 84.9 | 0.16 |
| His | 204.3 | −0.40 |
| Ile | 183.6 | 0.73 |
| Leu | 193.5 | 0.53 |
| Lys | 211.3 | −1.10 |
| Met | 205.9 | 0.26 |
| Phe | 223.7 | 0.61 |
| Pro | 148.3 | −0.07 |
| Ser | 124.8 | −0.26 |
| Thr | 144.1 | −0.18 |
| Trp | 266.3 | 0.37 |
| Tyr | 236.0 | 0.02 |
| Val | 158.0 | 0.54 |
Solvent-accessible surface area (ASA) of each residue type X in the tripeptide (Gly-X-Gly) at the extended state, calculated using ASC [34];
The hydrophobicity consensus scale of Eisenburg [5].
The results of molecular shape and size.
| PDB code | 02 × | 103× | ||||||
|---|---|---|---|---|---|---|---|---|
| I | ||||||||
| 3PYP | 41.11 | 61.82 | 71.06 | 2.33 | −4.33 | 16.04 | 18.85 | 13.19 |
| 1AKO | 78.40 | 96.64 | 129.61 | 2.18 | 3.05 | 20.92 | 25.46 | 17.45 |
| 2CTC | 84.32 | 100.38 | 141.79 | 2.48 | 5.30 | 21.49 | 26.63 | 18.07 |
| 1CEM | 93.46 | 116.77 | 143.17 | 1.49 | 0.39 | 22.93 | 26.76 | 18.80 |
| 1E70 | 110.18 | 169.06 | 176.62 | 1.91 | −5.02 | 26.42 | 29.72 | 21.35 |
| II | ||||||||
| 3LTZ | 35.84 | 48.75 | 101.55 | 10.49 | 57.61 | 14.54 | 22.53 | 13.64 |
| 1DCIA | 72.74 | 96.28 | 223.74 | 12.84 | 83.38 | 20.56 | 33.45 | 19.82 |
| 1SBP | 56.12 | 105.62 | 206.44 | 12.79 | 51.45 | 20.17 | 32.13 | 19.21 |
| 3MBP | 74.47 | 115.72 | 242.46 | 12.28 | 65.10 | 21.81 | 34.82 | 20.80 |
| 1JAE | 88.54 | 105.91 | 312.60 | 18.13 | 151.02 | 22.05 | 39.53 | 22.52 |
| III | ||||||||
| 1CPQ | 26.97 | 39.53 | 178.83 | 35.41 | 410.89 | 12.89 | 29.90 | 15.66 |
| 1AMF | 35.21 | 75.05 | 208.33 | 24.30 | 188.55 | 16.60 | 32.27 | 17.85 |
| 1ABE | 53.21 | 89.04 | 256.31 | 22.25 | 183.73 | 18.86 | 35.80 | 19.96 |
| 2ABH | 65.13 | 68.09 | 262.66 | 24.53 | 242.78 | 18.25 | 36.24 | 19.90 |
| 1LAM | 44.08 | 141.19 | 388.73 | 28.90 | 211.84 | 21.46 | 44.09 | 23.94 |
The eigenvalues of gyration tensor, obtained by diagonalization of Equation (1). The values were sorted according to increasing magnitude;
The asphericity (δ), calculated according to Equation (3);
The shape parameter (S), calculated according to Equation (4);
Semiaxes (in Å) [32], calculated with a = [5(α1 +α2)/2]1/2 and b = (5α3)1/2;
The mean radius of gyrations (in Å), calculated according Equation (6).
Figure 2Plots of log (R) versus log (N), where R is the mean radius of gyration and N is the number of residues, for proteins in Group I (filled squares), Group II (open circles), and Group III (filled circles). The solid line (R = 0.99), the dashed line (R = 0.98), and the dotted line (R = 0.98) are the regression lines for proteins in Groups I, II, and III, respectively.
Figure 3The distance distribution W (d) as a function of d (Å) for the protein structures, as shown in Table 1: (a) Proteins in Group I; (b) proteins in Group II; and (c) proteins in Group III. The indices, as shown in the legends of these figures, correspond to those shown in column 2 of Table 1.
The details of the spatial distribution of the second-order moment for 3PYP.
| 0.00 | 13 | 51.14 | 47.0 | 88 | 14.18 |
| 0.40 | 17 | 60.63 | 49.0 | 90 | 7.79 |
| 0.60 | 18 | 53.46 | 51.0 | 92 | 1.01 |
| 1.00 | 21 | 55.51 | 53.0 | 98 | −8.13 |
| 3.00 | 24 | 60.15 | 55.0 | 99 | −11.95 |
| 5.00 | 30 | 62.87 | 57.0 | 102 | −12.64 |
| 7.00 | 30 | 62.87 | 59.0 | 103 | −17.51 |
| 9.00 | 33 | 61.17 | 61.0 | 104 | −16.08 |
| 11.0 | 39 | 64.58 | 63.0 | 106 | −12.87 |
| 13.0 | 42 | 66.37 | 65.0 | 109 | −15.63 |
| 15.0 | 44 | 72.18 | 67.0 | 111 | −19.49 |
| 17.0 | 48 | 62.50 | 69.0 | 111 | −19.49 |
| 19.0 | 50 | 60.74 | 71.0 | 114 | −19.64 |
| 21.0 | 54 | 46.35 | 73.0 | 114 | −19.64 |
| 23.0 | 57 | 45.69 | 75.0 | 114 | −19.64 |
| 25.0 | 63 | 41.04 | 77.0 | 117 | −19.73 |
| 27.0 | 64 | 41.57 | 79.0 | 119 | −22.16 |
| 29.0 | 66 | 33.47 | 81.0 | 120 | −20.88 |
| 31.0 | 68 | 32.77 | 83.0 | 121 | −18.87 |
| 33.0 | 70 | 32.65 | 85.0 | 121 | −18.87 |
| 35.0 | 73 | 37.21 | 87.0 | 123 | −24.80 |
| 37.0 | 76 | 30.78 | 89.0 | 124 | −28.33 |
| 39.0 | 77 | 30.14 | 91.0 | 125 | −29.56 |
| 41.0 | 81 | 25.23 | 93.0 | 125 | −29.56 |
| 43.0 | 82 | 26.02 | 95.0 | 125 | −29.56 |
| 45.0 | 86 | 17.47 | 97.0 | 125 | −29.56 |
Number of residues collected in the space defined by p (%);
The second-order hydrophobic moment per residue, calculated according Equation (10).
Figure 4The spatial distribution of the second-order hydrophobic moment per residue, H2(d), for the protein 3PYP, plotted as a function of the percentage of solvent accessibility, p (%). The second-order hydrophobic moment was calculated using h′ selected from the consensus scales (Table 2).
The Pearson correlation coefficient between two data sets: the spatial distribution of the second-order hydrophobic moment calculated with the consensus scale and that calculated with Equation (8) derived from the CHARMM partial atomic charges.
| PDB code | Correlation coefficient (R) | ||
|---|---|---|---|
| | | | | | | |
| I | |||
| 3PYP | 0.89 | 0.89 | 0.94 |
| 1AKO | 0.78 | 0.77 | 0.87 |
| 2CTC | 0.86 | 0.87 | 0.91 |
| 1CEM | 0.85 | 0.86 | 0.92 |
| 1E70 | 0.88 | 0.87 | 0.91 |
| II | |||
| 3LTZ | 0.63 | 0.63 | 0.85 |
| 1DCIA | 0.78 | 0.79 | 0.82 |
| 1SBP | 0.70 | 0.70 | 0.86 |
| 3MBP | 0.59 | 0.59 | 0.82 |
| 1JAE | 0.86 | 0.87 | 0.91 |
| III | |||
| 1CPQ | 0.81 | 0.81 | 0.96 |
| 1AMF | 0.70 | 0.70 | 0.88 |
| 1ABE | 0.72 | 0.74 | 0.81 |
| 2ABH | 0.62 | 0.62 | 0.77 |
| 1LAM | 0.87 | 0.87 | 0.90 |
The Pearson correlation coefficient, calculated according to Equation (11);
The values of the coefficient calculated using the CHARMM partial atomic charges, |q| ≥ 0.25, assigned for polar atoms in the protein structures;
The values of the coefficient calculated using the CHARMM partial atomic charges, |q| ≥ 0.27, assigned for polar atoms in the protein structures;
The values of the coefficient calculated using the CHARMM partial atomic charges, |q| ≥ 0.3, assigned for polar atoms in the protein structures.
Figure 5Comparative plots of the spatial distributions of the second-order hydrophobic moment per residue, H2(d), versus the percentage of solvent accessibility, p (%), for (a) protein IE70, (b) protein 1JAE, and (c) protein 1LAM. The solid line and dashed line represent, respectively, the second-order hydrophobic moments calculated using h′ selected from the consensus scales (Table 2) and the h′ calculated according to Equation (8) with the CHARMM partial atomic charges, |q| ≥ 0.3, assigned for polar atoms. The Pearson correlation coefficients between the spatial distributions of the second-order hydrophobic moments (solid and dashed lines) are 0.91, 0.91, and 0.90 for proteins IE70, 1JAE, and 1LAM, respectively.
The results of the spatial distributions of the second-order hydrophobic moment at p_ (%) or at d̄_ (Å).
| | | | | |||
|---|---|---|---|---|
| I | ||||
| 3PYP | 49 | 53 | 11.45 | 11.79 |
| 1AKO | 59 | 53 | 15.79 | 15.45 |
| 2CTC | 47 | 49 | 16.14 | 16.24 |
| 1CEM | 41 | 49 | 16.80 | 17.10 |
| 1E70 | 33 | 51 | 18.52 | 19.57 |
| II | ||||
| 3LTZ | 39 | 55 | 11.56 | 12.16 |
| 1DCIA | 49 | 57 | 16.66 | 17.33 |
| 1SBP | 61 | 61 | 17.43 | 17.43 |
| 3MBP | 51 | 61 | 18.72 | 19.07 |
| 1JAE | 49 | 53 | 20.56 | 20.65 |
| III | ||||
| 1CPQ | 53 | 81 | 13.92 | 14.70 |
| 1AMF | 49 | 59 | 15.49 | 15.77 |
| 1ABE | 41 | 71 | 17.23 | 18.39 |
| 2ABH | 37 | 61 | 17.00 | 17.95 |
| 1LAM | 43 | 55 | 20.67 | 21.20 |
The space defined by the percentage of solvent accessibility, p_(%), at which the spatial distribution of the second-order moment vanishes and approaches a negative value;
The average distance from the protein centroid to the centroids of the amino-acid residues in the same space defined by (%) p_ as shown in footnote (a);
The values of p_(%), calculated using h′ [Equation (9)] derived from the CHARMM partial atomic charges, |q| ≥ 0.3, assigned for polar atoms in the protein structures.
The values of p_ (%), calculated using the consensus scales.
The values of d̄_ (Å), calculated using h′ [ Equation (9)] derived from the CHARMM partial atomic charges, |q| ≥ 0.3, assigned for polar atoms in the protein structures.
The values of d̄_ (Å), calculated using the consensus scales.