Literature DB >> 33230664

Site-Specific Amino Acid Distributions Follow a Universal Shape.

Mackenzie M Johnson1,2, Claus O Wilke3.   

Abstract

In many applications of evolutionary inference, a model of protein evolution needs to be fitted to the amino acid variation at individual sites in a multiple sequence alignment. Most existing models fall into one of two extremes: Either they provide a coarse-grained description that lacks biophysical realism (e.g., dN/dS models), or they require a large number of parameters to be fitted (e.g., mutation-selection models). Here, we ask whether a middle ground is possible: Can we obtain a realistic description of site-specific amino acid frequencies while severely restricting the number of free parameters in the model? We show that a distribution with a single free parameter can accurately capture the variation in amino acid frequency at most sites in an alignment, as long as we are willing to restrict our analysis to predicting amino acid frequencies by rank rather than by amino acid identity. This result holds equally well both in alignments of empirical protein sequences and of sequences evolved under a biophysically realistic all-atom force field. Our analysis reveals a near universal shape of the frequency distributions of amino acids. This insight has the potential to lead to new models of evolution that have both increased realism and a limited number of free parameters.

Entities:  

Keywords:  Amino-acid distributions; Evolutionary modeling; Protein site variability

Year:  2020        PMID: 33230664      PMCID: PMC7717668          DOI: 10.1007/s00239-020-09976-8

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  40 in total

1.  Prediction of site-specific amino acid distributions and limits of divergent evolutionary changes in protein sequences.

Authors:  Markus Porto; H Eduardo Roman; Michele Vendruscolo; Ugo Bastolla
Journal:  Mol Biol Evol       Date:  2004-11-10       Impact factor: 16.240

2.  Not so different after all: a comparison of methods for detecting amino acid sites under selection.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

3.  Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage.

Authors:  Ziheng Yang; Rasmus Nielsen
Journal:  Mol Biol Evol       Date:  2008-01-03       Impact factor: 16.240

4.  The Shannon information entropy of protein sequences.

Authors:  B J Strait; T G Dewey
Journal:  Biophys J       Date:  1996-07       Impact factor: 4.033

5.  The Influence of Protein Stability on Sequence Evolution: Applications to Phylogenetic Inference.

Authors:  Ugo Bastolla; Miguel Arenas
Journal:  Methods Mol Biol       Date:  2019

6.  Relationship between protein thermodynamic constraints and variation of evolutionary rates among sites.

Authors:  Julian Echave; Eleisha L Jackson; Claus O Wilke
Journal:  Phys Biol       Date:  2015-03-19       Impact factor: 2.583

7.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

8.  The population genetics of dN/dS.

Authors:  Sergey Kryazhimskiy; Joshua B Plotkin
Journal:  PLoS Genet       Date:  2008-12-12       Impact factor: 5.917

9.  Relative evolutionary rate inference in HyPhy with LEISR.

Authors:  Stephanie J Spielman; Sergei L Kosakovsky Pond
Journal:  PeerJ       Date:  2018-02-05       Impact factor: 2.984

10.  Statistical methods for detecting molecular adaptation.

Authors: 
Journal:  Trends Ecol Evol       Date:  2000-12-01       Impact factor: 17.712

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.