| Literature DB >> 24716445 |
Tsun-Tsao Huang, María Laura del Valle Marcos, Jenn-Kang Hwang, Julian Echave1.
Abstract
BACKGROUND: Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site's rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site's Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site's LPD with its rate of evolution.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24716445 PMCID: PMC4101840 DOI: 10.1186/1471-2148-14-78
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Model comparison: protein-by-protein analysis
| pfANM | Stress | 190508 | 0.97 | 206 | -0.54 | -0.33 | ||
| | Flexibility | 198662 | 0.03 | 7 | 0.45 | -0.06 | ||
| ANM | Stress | 194154 | 0.98 | 209 | -0.52 | -0.39 | ||
| Flexibility | 207258 | 0.02 | 4 | 0.35 | -0.04 |
NOTE: Potential is the ENM potential, Model is either the Stress Model or Flexibility Model, y is in all cases the site-specific rate of evolution inferred using Rate4Site, x is the independent variable of each model. AIC is the Akaike Information Criterion summed over all proteins.
Model comparison: global analysis
| pfANM | Stress | 191424 | 1.00 | -0.55 | -0.32 | ||
| | Flexibility | 199645 | 0.00 | 0.47 | -0.04 | ||
| ANM | Stress | 194589 | 1.00 | -0.52 | -0.40 | ||
| Flexibility | 207993 | 0.00 | 0.36 | -0.02 |
NOTE: Results of global fits for all sites of the dataset. Potential is the ENM potential. Model is either the Stress Model or Flexibility Model. y is in all cases the site-specific rate of evolution inferred using Rate4Site, x is the independent variable of each model. AIC is the Akaike Information Criterion, that quantifies the goodness of fit of a model (the lower the better). w(AIC) is the AIC weight of evidence for each compared model (same ENM). R is Pearson’s correlation coefficient between variables y and x. pR is the partial correlation coefficient when controlling for the x variable of the contrasting model (same ENM). All variables were z-score normalized for each protein before fitting.
Figure 1Site-specific evolutionary rates vs. flexibility and stress. Top panels: parameter-free Anisotropic Network Model (pfANM). Bottom panels: Anisotropic Network Model. Black circles: site-specific rates inferred from the sequence alignments using Rate4Site. Vertical segments: 99% confidence interval of the circles. Solid line: Stress Model predictions. Dashed line: Flexibility Model predictions. For the rate vs. flexibility plots (left panels), sites were evenly split between 20 bins according to their Mean Square Fluctuation (MSF) dynamical flexibility measure, and then MSF and evolutionary rates were averaged within bins. For the rate vs. stress plots (right panels), sites were evenly split into bins according to their Mean Local mutational Stress (MLmS), and average MLmS and rates were obtained for each bin. All variables were z-normalized for each protein.