Literature DB >> 22508720

Fitting nonstationary general-time-reversible models to obtain edge-lengths and frequencies for the barry-hartigan model.

Liwen Zou1, Edward Susko, Chris Field, Andrew J Roger.   

Abstract

Among models of nucleotide evolution, the Barry and Hartigan (BH) model (also known as the General Markov Model) is very flexible as it allows separate arbitrary substitution matrices along edges. For a given tree, the estimates of the BH model are a set of joint probability matrices, each giving the pairwise frequencies of nucleotides at the ends of the edge. We have previously shown that, due to an identifiability problem, these cannot be expected to consistently estimate the actual pairwise frequencies. A further consequence is that internal node frequency estimates are likely to be incorrect. Here we define a nonstationary GTR model for each edge that we refer to as the NSGTR model. We fit the NSGTR model by minimizing the sums of squares between the estimates of transition probabilities under the NSGTR model and the estimates provided by a fitted BH model. This NSGTR model provides estimates that avoid the identifiability difficulties of the BH model while closely fitting it. With the best-fitting NSGTR estimates, we are able to get interpretable frequency vectors at internal nodes as well as edge length estimates that are otherwise not yielded by the BH model. These edge lengths are interpretable as the expected number of substitutions along an edge for the model. We also show that for a nonstationary continuous-time model these are not the same as the edge length parameters for conventional substitution matrices that are output by nonstationary model phylogenetic estimation programs such as nhPhyML.

Entities:  

Mesh:

Year:  2012        PMID: 22508720      PMCID: PMC3888140          DOI: 10.1093/sysbio/sys046

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  19 in total

1.  Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions.

Authors:  P G Foster; D A Hickey
Journal:  J Mol Evol       Date:  1999-03       Impact factor: 2.395

2.  Modeling compositional heterogeneity.

Authors:  Peter G Foster
Journal:  Syst Biol       Date:  2004-06       Impact factor: 15.683

3.  Estimation of the number of nucleotide substitutions when there are strong transition-transversion and G+C-content biases.

Authors:  K Tamura
Journal:  Mol Biol Evol       Date:  1992-07       Impact factor: 16.240

4.  Saturation and base composition bias explain phylogenomic conflict in Plasmodium.

Authors:  Liliana M Dávalos; Susan L Perkins
Journal:  Genomics       Date:  2008-03-04       Impact factor: 5.736

5.  Fast, accurate and simulation-free stochastic mapping.

Authors:  Vladimir N Minin; Marc A Suchard
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

6.  Full reconstruction of Markov models on evolutionary trees: identifiability and consistency.

Authors:  J T Chang
Journal:  Math Biosci       Date:  1996-10-01       Impact factor: 2.144

7.  Inferring pattern and process: maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis.

Authors:  N Galtier; M Gouy
Journal:  Mol Biol Evol       Date:  1998-07       Impact factor: 16.240

8.  On the use of nucleic acid sequences to infer early branchings in the tree of life.

Authors:  Z Yang; D Roberts
Journal:  Mol Biol Evol       Date:  1995-05       Impact factor: 16.240

9.  Comparison of methods for estimating the nucleotide substitution matrix.

Authors:  Maribeth Oscamou; Daniel McDonald; Von Bing Yap; Gavin A Huttley; Manuel E Lladser; Rob Knight
Journal:  BMC Bioinformatics       Date:  2008-12-01       Impact factor: 3.169

10.  Estimation of phylogeny using a general Markov model.

Authors:  Vivek Jayaswal; Lars S Jermiin; John Robinson
Journal:  Evol Bioinform Online       Date:  2007-02-25       Impact factor: 1.625

View more
  8 in total

1.  Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution.

Authors:  Tomotaka Matsumoto; Hiroshi Akashi; Ziheng Yang
Journal:  Genetics       Date:  2015-05-06       Impact factor: 4.562

2.  Genetic distance for a general non-stationary markov substitution process.

Authors:  Benjamin D Kaehler; Von Bing Yap; Rongli Zhang; Gavin A Huttley
Journal:  Syst Biol       Date:  2014-12-09       Impact factor: 15.683

3.  A Nonstationary Markov Model Detects Directional Evolution in Hymenopteran Morphology.

Authors:  Seraina Klopfstein; Lars Vilhelmsen; Fredrik Ronquist
Journal:  Syst Biol       Date:  2015-08-12       Impact factor: 15.683

4.  Hot and Cold Spot Areas of Household Tuberculosis Transmission in Southern China: Effects of Socio-Economic Status and Mycobacterium tuberculosis Genotypes.

Authors:  Zhezhe Cui; Dingwen Lin; Virasakdi Chongsuvivatwong; Edward A Graviss; Angkana Chaiprasert; Prasit Palittapongarnpim; Mei Lin; Jing Ou; Jinming Zhao
Journal:  Int J Environ Res Public Health       Date:  2019-05-27       Impact factor: 3.390

5.  The Prevalence and Impact of Model Violations in Phylogenetic Analysis.

Authors:  Suha Naser-Khdour; Bui Quang Minh; Wenqi Zhang; Eric A Stone; Robert Lanfear
Journal:  Genome Biol Evol       Date:  2019-12-01       Impact factor: 3.416

6.  Interrogating Phylogenetic Discordance Resolves Deep Splits in the Rapid Radiation of Old World Fruit Bats (Chiroptera: Pteropodidae).

Authors:  Nicolas Nesi; Georgia Tsagkogeorga; Susan M Tsang; Violaine Nicolas; Aude Lalis; Annette T Scanlon; Silke A Riesle-Sbarbaro; Sigit Wiantoro; Alan T Hitch; Javier Juste; Corinna A Pinzari; Frank J Bonaccorso; Christopher M Todd; Burton K Lim; Nancy B Simmons; Michael R McGowen; Stephen J Rossiter
Journal:  Syst Biol       Date:  2021-10-13       Impact factor: 15.683

7.  A branch-heterogeneous model of protein evolution for efficient inference of ancestral sequences.

Authors:  M Groussin; B Boussau; M Gouy
Journal:  Syst Biol       Date:  2013-03-07       Impact factor: 15.683

Review 8.  Using the Mutation-Selection Framework to Characterize Selection on Protein Sequences.

Authors:  Ashley I Teufel; Andrew M Ritchie; Claus O Wilke; David A Liberles
Journal:  Genes (Basel)       Date:  2018-08-13       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.