| Literature DB >> 24278298 |
Matthew Z Tien1, Austin G Meyer, Dariya K Sydykova, Stephanie J Spielman, Claus O Wilke.
Abstract
The relative solvent accessibility (RSA) of a residue in a protein measures the extent of burial or exposure of that residue in the 3D structure. RSA is frequently used to describe a protein's biophysical or evolutionary properties. To calculate RSA, a residue's solvent accessibility (ASA) needs to be normalized by a suitable reference value for the given amino acid; several normalization scales have previously been proposed. However, these scales do not provide tight upper bounds on ASA values frequently observed in empirical crystal structures. Instead, they underestimate the largest allowed ASA values, by up to 20%. As a result, many empirical crystal structures contain residues that seem to have RSA values in excess of one. Here, we derive a new normalization scale that does provide a tight upper bound on observed ASA values. We pursue two complementary strategies, one based on extensive analysis of empirical structures and one based on systematic enumeration of biophysically allowed tripeptides. Both approaches yield congruent results that consistently exceed published values. We conclude that previously published ASA normalization values were too small, primarily because the conformations that maximize ASA had not been correctly identified. As an application of our results, we show that empirically derived hydrophobicity scales are sensitive to accurate RSA calculation, and we derive new hydrophobicity scales that show increased correlation with experimentally measured scales.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24278298 PMCID: PMC3836772 DOI: 10.1371/journal.pone.0080635
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Frequency of residues with in empirical protein structures.
Nearly all amino acids, and notably R, D, K, G, and P, show when RSA is calculated using the normalization values of either Rose et al. [2] or Miller et al. [3].
Figure 2Ramachandran plot for alanine residues in our empirical data set.
Coordinates which correspond to RSA values are shown in red and are clearly concentrated around coordinates . We therefore propose that this region contains the maximally exposed conformation of alanine and should be used for calculating maximum ASA.
Figure 3Ramachandran plots for empirical and theoretical maximum ASA values of alanine.
(A) Empirical maximum ASA values for each by bin. All bins in the ALLOWED region are shown. (B) Theoretical maximum ASA values, as determined by computational modeling, shown for non-empty bins in (A). Both the empirical and the theoretical approach find the largest ASA values in the -helix region around . By contrast, the extended conformation leads to relatively low maximum ASA.
Proposed values for ASA normalization (in Å), compared to previously used scales defined by Rose et al. [2] and Miller et al. [3].
| Residue | Theoretical | Empirical | Miller | Rose |
| Alanine | 129.0 | 121.0 | 113.0 | 118.1 |
| Arginine | 274.0 | 265.0 | 241.0 | 256.0 |
| Asparagine | 195.0 | 187.0 | 158.0 | 165.5 |
| Aspartate | 193.0 | 187.0 | 151.0 | 158.7 |
| Cysteine | 167.0 | 148.0 | 140.0 | 146.1 |
| Glutamate | 223.0 | 214.0 | 183.0 | 186.2 |
| Glutamine | 225.0 | 214.0 | 189.0 | 193.2 |
| Glycine | 104.0 | 97.0 | 85.0 | 88.1 |
| Histidine | 224.0 | 216.0 | 194.0 | 202.5 |
| Isoleucine | 197.0 | 195.0 | 182.0 | 181.0 |
| Leucine | 201.0 | 191.0 | 180.0 | 193.1 |
| Lysine | 236.0 | 230.0 | 211.0 | 225.8 |
| Methionine | 224.0 | 203.0 | 204.0 | 203.4 |
| Phenylalanine | 240.0 | 228.0 | 218.0 | 222.8 |
| Proline | 159.0 | 154.0 | 143.0 | 146.8 |
| Serine | 155.0 | 143.0 | 122.0 | 129.8 |
| Threonine | 172.0 | 163.0 | 146.0 | 152.5 |
| Tryptophan | 285.0 | 264.0 | 259.0 | 266.3 |
| Tyrosine | 263.0 | 255.0 | 229.0 | 236.8 |
| Valine | 174.0 | 165.0 | 160.0 | 164.5 |
Both the theoretical and the empirical scale were evaluated for the ALLOWED region. Corresponding scales evaluated for other regions are provided in Table S1 in File S1.
Figure 4Difference between theoretically and empirically determined maximum ASA values for alanine, across by bins.
As the amount of data per bin increases, the difference between theoretical and empirical maximum ASA approaches zero, demonstrating that our two methods converged with increasing amounts of data. Furthermore, the difference between values is frequently close to zero, even when little data is available for a bin. This observation indicates that our theoretically derived maximum ASA values provide a tight bound on the empirically observed ones.
Absolute value of correlation coefficients between empirically derived and experimentally derived hydrophobicity scales.
| Empirical scale | |||||
| Experimental scale | Mean RSA (Rose) | Mean RSA (theor) | Mean RSA (emp) | 100% buried | 95% buried |
| Wolfenden | 0.614 | 0.681 | 0.681 |
| 0.774 |
| Kyte & Doolittle | 0.841 | 0.879 | 0.881 |
| 0.948 |
| Radzicka & Wolfenden | 0.852 | 0.855 | 0.851 | 0.844 |
|
| Moon & Fleming | 0.704 | 0.748 | 0.752 | 0.678 |
|
| Fauchere & Pliska | 0.904 | 0.906 |
| 0.734 | 0.878 |
| Wimley | 0.463 | 0.464 |
| 0.323 | 0.417 |
| MacCallum | 0.27 | 0.265 | 0.285 | 0.116 | 0.227 |
The largest significant correlation in each row is highlighted in bold.
Mean RSA of residues in protein structures, as calculated by Rose et al. [2].
Mean RSA of residues in protein structures, as given in column 2 of Table S3 in File S1.
Mean RSA of residues in protein structures, as given in column 3 of Table S3 in File S1.
Fraction of 100% buried residues, as given in column 4 of Table S3 in File S1.
Fraction of 95% buried residues, as given in column 5 of Table S3 in File S1.
Transfer energy from vapor to water [26].
Hybrid scale based on transfer energy from vapor to water and on the percentages of 95% and 100% buried residues in protein structures [27].
Transfer energy from cyclohexane to water [29].
between the folded and unfolded state of a mutated membrane-inserted protein, outer membrane phospholipase A [31].
Transfer energy calculated from molecular-dynamic simulations of side-chain analogs within a bilayer [30].
Transfer energy between octanol and water [28].
Transfer energy of pentapeptides between octanol and water [32].
Correlation not statistically significant; all other correlations are significant at .