Literature DB >> 33960831

Parameter Identifiability for a Profile Mixture Model of Protein Evolution.

Samaneh Yourdkhani1, Elizabeth S Allman1, John A Rhodes1.   

Abstract

A profile mixture (PM) model is a model of protein evolution, describing sequence data in which sites are assumed to follow many related substitution processes on a single evolutionary tree. The processes depend, in part, on different amino acid distributions, or profiles, varying over sites in aligned sequences. A fundamental question for any stochastic model, which must be answered positively to justify model-based inference, is whether the parameters are identifiable from the probability distribution they determine. Here, using algebraic methods, we show that a PM model has identifiable parameters under circumstances in which it is likely to be used for empirical analyses. In particular, for a tree relating 9 or more taxa, both the tree topology and all numerical parameters are generically identifiable when the number of profiles is less than 74.

Entities:  

Keywords:  parameter identifiability; phylogenetic trees; profile mixture model

Mesh:

Substances:

Year:  2021        PMID: 33960831      PMCID: PMC8219185          DOI: 10.1089/cmb.2020.0315

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.549


  22 in total

1.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

2.  Identifiability of parameters in MCMC Bayesian inference of phylogeny.

Authors:  Bruce Rannala
Journal:  Syst Biol       Date:  2002-10       Impact factor: 15.683

3.  A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process.

Authors:  Nicolas Lartillot; Hervé Philippe
Journal:  Mol Biol Evol       Date:  2004-03-10       Impact factor: 16.240

4.  Estimating trees from filtered data: identifiability of models for morphological phylogenetics.

Authors:  Elizabeth S Allman; Mark T Holder; John A Rhodes
Journal:  J Theor Biol       Date:  2009-12-11       Impact factor: 2.691

5.  PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating.

Authors:  Nicolas Lartillot; Thomas Lepage; Samuel Blanquart
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

6.  On Rogers' proof of identifiability for the GTR + Γ + I model.

Authors:  Juanjuan Chai; Elizabeth A Housworth
Journal:  Syst Biol       Date:  2011-03-28       Impact factor: 15.683

7.  PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment.

Authors:  Nicolas Lartillot; Nicolas Rodrigue; Daniel Stubbs; Jacques Richer
Journal:  Syst Biol       Date:  2013-04-05       Impact factor: 15.683

8.  Phylogenetic mixtures and linear invariants for equal input models.

Authors:  Marta Casanellas; Mike Steel
Journal:  J Math Biol       Date:  2016-09-07       Impact factor: 2.259

9.  Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods.

Authors:  Z Yang
Journal:  J Mol Evol       Date:  1994-09       Impact factor: 2.395

10.  A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.

Authors:  Huai-Chun Wang; Karen Li; Edward Susko; Andrew J Roger
Journal:  BMC Evol Biol       Date:  2008-12-16       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.