Literature DB >> 26416981

Substitution Model Adequacy and Assessing the Reliability of Estimates of Virus Evolutionary Rates and Time Scales.

Sebastián Duchêne1, Francesca Di Giallonardo1, Edward C Holmes2.   

Abstract

Determining the time scale of virus evolution is central to understanding their origins and emergence. The phylogenetic methods commonly used for this purpose can be misleading if the substitution model makes incorrect assumptions about the data. Empirical studies consider a pool of models and select that with the highest statistical fit. However, this does not allow the rejection of all models, even if they poorly describe the data. An alternative is to use model adequacy methods that evaluate the ability of a model to predict hypothetical future observations. This can be done by comparing the empirical data with data generated under the model in question. We conducted simulations to evaluate the sensitivity of such methods with nucleotide, amino acid, and codon data. These effectively detected underparameterized models, but failed to detect mutational saturation and some instances of nonstationary base composition, which can lead to biases in estimates of tree topology and length. To test the applicability of these methods with real data, we analyzed nucleotide and amino acid data sets from the genus Flavivirus of RNA viruses. In most cases these models were inadequate, with the exception of a data set of relatively closely related sequences of Dengue virus, for which the GTR+Γ nucleotide and LG+Γ amino acid substitution models were adequate. Our results partly explain the lack of consensus over estimates of the long-term evolutionary time scale of these viruses, and indicate that assessing the adequacy of substitution models should be routinely used to determine whether estimates are reliable.
© The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian model averaging; model adequacy; parametric bootstrap; posterior predictive simulation; substitution model; virus evolution

Mesh:

Substances:

Year:  2015        PMID: 26416981     DOI: 10.1093/molbev/msv207

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  12 in total

1.  Discovery of Novel Crustacean and Cephalopod Flaviviruses: Insights into the Evolution and Circulation of Flaviviruses between Marine Invertebrate and Vertebrate Hosts.

Authors:  Rhys Parry; Sassan Asgari
Journal:  J Virol       Date:  2019-06-28       Impact factor: 5.103

Review 2.  Methodologies for Microbial Ancestral Sequence Reconstruction.

Authors:  Miguel Arenas
Journal:  Methods Mol Biol       Date:  2022

3.  Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction.

Authors:  Roberto Del Amparo; Miguel Arenas
Journal:  Mol Biol Evol       Date:  2022-07-02       Impact factor: 8.800

4.  Cross-validation to select Bayesian hierarchical models in phylogenetics.

Authors:  Sebastián Duchêne; David A Duchêne; Francesca Di Giallonardo; John-Sebastian Eden; Jemma L Geoghegan; Kathryn E Holt; Simon Y W Ho; Edward C Holmes
Journal:  BMC Evol Biol       Date:  2016-05-26       Impact factor: 3.260

5.  Comparative analysis estimates the relative frequencies of co-divergence and cross-species transmission within viral families.

Authors:  Jemma L Geoghegan; Sebastián Duchêne; Edward C Holmes
Journal:  PLoS Pathog       Date:  2017-02-08       Impact factor: 6.823

6.  Differences in Performance among Test Statistics for Assessing Phylogenomic Model Adequacy.

Authors:  David A Duchêne; Sebastian Duchêne; Simon Y W Ho
Journal:  Genome Biol Evol       Date:  2018-06-01       Impact factor: 3.416

7.  Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence.

Authors:  Sarah K Hilton; Jesse D Bloom
Journal:  Virus Evol       Date:  2018-11-06

8.  Impact of the tree prior on estimating clock rates during epidemic outbreaks.

Authors:  Simon Möller; Louis du Plessis; Tanja Stadler
Journal:  Proc Natl Acad Sci U S A       Date:  2018-04-02       Impact factor: 11.205

9.  Relative Model Fit Does Not Predict Topological Accuracy in Single-Gene Protein Phylogenetics.

Authors:  Stephanie J Spielman
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

10.  PoSE: visualization of patterns of sequence evolution using PAML and MATLAB.

Authors:  Kun Zhao; Elizabeth Henderson; Kelley Bullard; M Steven Oberste; Cara C Burns; Jaume Jorba
Journal:  BMC Bioinformatics       Date:  2018-10-22       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.