Literature DB >> 33381826

Using a GTR+Γ substitution model for dating sequence divergence when stationarity and time-reversibility assumptions are violated.

Jose Barba-Montoya1,2, Qiqing Tao1,2, Sudhir Kumar1,2,3.   

Abstract

MOTIVATION: As the number and diversity of species and genes grow in contemporary datasets, two common assumptions made in all molecular dating methods, namely the time-reversibility and stationarity of the substitution process, become untenable. No software tools for molecular dating allow researchers to relax these two assumptions in their data analyses. Frequently the same General Time Reversible (GTR) model across lineages along with a gamma (+Γ) distributed rates across sites is used in relaxed clock analyses, which assumes time-reversibility and stationarity of the substitution process. Many reports have quantified the impact of violations of these underlying assumptions on molecular phylogeny, but none have systematically analyzed their impact on divergence time estimates.
RESULTS: We quantified the bias on time estimates that resulted from using the GTR + Γ model for the analysis of computer-simulated nucleotide sequence alignments that were evolved with non-stationary (NS) and non-reversible (NR) substitution models. We tested Bayesian and RelTime approaches that do not require a molecular clock for estimating divergence times. Divergence times obtained using a GTR + Γ model differed only slightly (∼3% on average) from the expected times for NR datasets, but the difference was larger for NS datasets (∼10% on average). The use of only a few calibrations reduced these biases considerably (∼5%). Confidence and credibility intervals from GTR + Γ analysis usually contained correct times. Therefore, the bias introduced by the use of the GTR + Γ model to analyze datasets, in which the time-reversibility and stationarity assumptions are violated, is likely not large and can be reduced by applying multiple calibrations.
AVAILABILITY AND IMPLEMENTATION: All datasets are deposited in Figshare: https://doi.org/10.6084/m9.figshare.12594638.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 33381826      PMCID: PMC7773479          DOI: 10.1093/bioinformatics/btaa820

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  Evolutionary distance estimation under heterogeneous substitution pattern among lineages.

Authors:  Koichiro Tamura; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2002-10       Impact factor: 16.240

2.  Sources of error inherent in species-tree estimation: impact of mutational and coalescent effects on accuracy and implications for choosing among different methods.

Authors:  Huateng Huang; Qixin He; Laura S Kubatko; L Lacey Knowles
Journal:  Syst Biol       Date:  2010-09-10       Impact factor: 15.683

3.  Modeling compositional heterogeneity.

Authors:  Peter G Foster
Journal:  Syst Biol       Date:  2004-06       Impact factor: 15.683

4.  Estimating the rate of evolution of the rate of molecular evolution.

Authors:  J L Thorne; H Kishino; I S Painter
Journal:  Mol Biol Evol       Date:  1998-12       Impact factor: 16.240

5.  MEGA-CC: computing core of molecular evolutionary genetics analysis program for automated and iterative data analysis.

Authors:  Sudhir Kumar; Glen Stecher; Daniel Peterson; Koichiro Tamura
Journal:  Bioinformatics       Date:  2012-08-24       Impact factor: 6.937

6.  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

Authors:  M Kimura
Journal:  J Mol Evol       Date:  1980-12       Impact factor: 2.395

7.  Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila.

Authors:  Nadia D Singh; Peter F Arndt; Andrew G Clark; Charles F Aquadro
Journal:  Mol Biol Evol       Date:  2009-04-07       Impact factor: 16.240

8.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

9.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

10.  A Machine Learning Method for Detecting Autocorrelation of Evolutionary Rates in Large Phylogenies.

Authors:  Qiqing Tao; Koichiro Tamura; Fabia U Battistuzzi; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2019-04-01       Impact factor: 16.240

View more
  3 in total

1.  Assessing Rapid Relaxed-Clock Methods for Phylogenomic Dating.

Authors:  Jose Barba-Montoya; Qiqing Tao; Sudhir Kumar
Journal:  Genome Biol Evol       Date:  2021-11-05       Impact factor: 3.416

2.  Embracing Green Computing in Molecular Phylogenetics.

Authors:  Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2022-03-02       Impact factor: 16.240

3.  First Detection of NADC34-like PRRSV as a Main Epidemic Strain on a Large Farm in China.

Authors:  Chao Li; Bangjun Gong; Qi Sun; Hu Xu; Jing Zhao; Lirun Xiang; Yan-Dong Tang; Chaoliang Leng; Wansheng Li; Zhenyang Guo; Jun Fu; Jinmei Peng; Qian Wang; Guohui Zhou; Ying Yu; Fandan Meng; Tongqing An; Xuehui Cai; Zhi-Jun Tian; Hongliang Zhang
Journal:  Pathogens       Date:  2021-12-29
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.