Literature DB >> 30298399

The Influence of Protein Stability on Sequence Evolution: Applications to Phylogenetic Inference.

Ugo Bastolla1, Miguel Arenas2.   

Abstract

Phylogenetic inference from protein data is traditionally based on empirical substitution models of evolution that assume that protein sites evolve independently of each other and under the same substitution process. However, it is well known that the structural properties of a protein site in the native state affect its evolution, in particular the sequence entropy and the substitution rate. Starting from the seminal proposal by Halpern and Bruno, where structural properties are incorporated in the evolutionary model through site-specific amino acid frequencies, several models have been developed to tackle the influence of protein structure on sequence evolution. Here we describe stability-constrained substitution (SCS) models that explicitly consider the stability of the native state against both unfolded and misfolded states. One of them, the mean-field model, provides an independent sites approximation that can be readily incorporated in maximum likelihood methods of phylogenetic inference, including ancestral sequence reconstruction. Next, we describe its validation with simulated and real proteins and its limitations and advantages with respect to empirical models that lack site specificity. We finally provide guidelines and recommendations to analyze protein data accounting for stability constraints, including computer simulations and inferences of protein evolution based on maximum likelihood. Some practical examples are included to illustrate these procedures.

Keywords:  Ancestral protein reconstruction; Mean-field substitution model; Protein evolution; Protein folding stability; Stability-constrained substitution models

Mesh:

Substances:

Year:  2019        PMID: 30298399     DOI: 10.1007/978-1-4939-8736-8_11

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  3 in total

1.  Site-Specific Amino Acid Distributions Follow a Universal Shape.

Authors:  Mackenzie M Johnson; Claus O Wilke
Journal:  J Mol Evol       Date:  2020-11-24       Impact factor: 2.395

2.  ProteinEvolverABC: Coestimation of Recombination and Substitution Rates in Protein Sequences by approximate Bayesian computation.

Authors:  Miguel Arenas
Journal:  Bioinformatics       Date:  2021-08-27       Impact factor: 6.937

3.  HIV Protease and Integrase Empirical Substitution Models of Evolution: Protein-Specific Models Outperform Generalist Models.

Authors:  Roberto Del Amparo; Miguel Arenas
Journal:  Genes (Basel)       Date:  2021-12-27       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.