Literature DB >> 29029343

An Evaluation of Different Partitioning Strategies for Bayesian Estimation of Species Divergence Times.

Konstantinos Angelis1, Sandra Álvarez-Carretero2, Mario Dos Reis1,2, Ziheng Yang1.   

Abstract

The explosive growth of molecular sequence data has made it possible to estimate species divergence times under relaxed-clock models using genome-scale data sets with many gene loci. In order to improve both model realism and to best extract information about relative divergence times in the sequence data, it is important to account for the heterogeneity in the evolutionary process across genes or genomic regions. Partitioning is a commonly used approach to achieve those goals. We group sites that have similar evolutionary characteristics into the same partition and those with different characteristics into different partitions, and then use different models or different values of model parameters for different partitions to account for the among-partition heterogeneity. However, how to partition data in practical phylogenetic analysis, and in particular in relaxed-clock dating analysis, is more art than science. Here, we use computer simulation and real data analysis to study the impact of the partition scheme on divergence time estimation. The partition schemes had relatively minor effects on the accuracy of posterior time estimates when the prior assumptions were correct and the clock was not seriously violated, but showed large differences when the clock was seriously violated, when the fossil calibrations were in conflict or incorrect, or when the rate prior was mis-specified. Concatenation produced the widest posterior intervals with the least precision. Use of many partitions increased the precision, as predicted by the infinite-sites theory, but the posterior intervals might fail to include the true ages because of the conflicting fossil calibrations or mis-specified rate priors. We analyzed a data set of 78 plastid genes from 15 plant species with serious clock violation and showed that time estimates differed significantly among partition schemes, irrespective of the rate drift model used. Multiple and precise fossil calibrations reduced the differences among partition schemes and were important to improving the precision of divergence time estimates. While the use of many partitions is an important approach to reducing the uncertainty in posterior time estimates, we do not recommend its general use for the present, given the limitations of current models of rate drift for partitioned data and the challenges of interpreting the fossil evidence to construct accurate and informative calibrations.
© The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.

Entities:  

Keywords:  Bayesian inference; genomic data; infinite-sites theory; molecular clock dating; partition analysis

Mesh:

Year:  2018        PMID: 29029343      PMCID: PMC5790132          DOI: 10.1093/sysbio/syx061

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  50 in total

1.  Codon-substitution models for heterogeneous selection pressure at amino acid sites.

Authors:  Z Yang; R Nielsen; N Goldman; A M Pedersen
Journal:  Genetics       Date:  2000-05       Impact factor: 4.562

2.  The impact of the representation of fossil calibrations on Bayesian estimation of species divergence times.

Authors:  Jun Inoue; Philip C J Donoghue; Ziheng Yang
Journal:  Syst Biol       Date:  2009-11-25       Impact factor: 15.683

Review 3.  Implementing and testing the multispecies coalescent model: A valuable paradigm for phylogenomics.

Authors:  Scott V Edwards; Zhenxiang Xi; Axel Janke; Brant C Faircloth; John E McCormack; Travis C Glenn; Bojian Zhong; Shaoyuan Wu; Emily Moriarty Lemmon; Alan R Lemmon; Adam D Leaché; Liang Liu; Charles C Davis
Journal:  Mol Phylogenet Evol       Date:  2015-10-27       Impact factor: 4.286

4.  Bayesian estimation of species divergence times under a molecular clock using multiple fossil calibrations with soft bounds.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Mol Biol Evol       Date:  2005-09-21       Impact factor: 16.240

5.  Molecular phylogeny of coleoid cephalopods (Mollusca: Cephalopoda) using a multigene approach; the effect of data partitioning on resolving phylogenies in a Bayesian framework.

Authors:  Jan Strugnell; Mark Norman; Jennifer Jackson; Alexei J Drummond; Alan Cooper
Journal:  Mol Phylogenet Evol       Date:  2005-06-02       Impact factor: 4.286

6.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites.

Authors:  Z Yang
Journal:  Mol Biol Evol       Date:  1993-11       Impact factor: 16.240

7.  Phylogenomic datasets provide both precision and accuracy in estimating the timescale of placental mammal phylogeny.

Authors:  Mario dos Reis; Jun Inoue; Masami Hasegawa; Robert J Asher; Philip C J Donoghue; Ziheng Yang
Journal:  Proc Biol Sci       Date:  2012-05-23       Impact factor: 5.349

Review 8.  Bayesian molecular clock dating of species divergences in the genomics era.

Authors:  Mario dos Reis; Philip C J Donoghue; Ziheng Yang
Journal:  Nat Rev Genet       Date:  2015-12-21       Impact factor: 53.242

9.  From algae to angiosperms-inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes.

Authors:  Brad R Ruhfel; Matthew A Gitzendanner; Pamela S Soltis; Douglas E Soltis; J Gordon Burleigh
Journal:  BMC Evol Biol       Date:  2014-02-17       Impact factor: 3.260

10.  Selecting optimal partitioning schemes for phylogenomic datasets.

Authors:  Robert Lanfear; Brett Calcott; David Kainer; Christoph Mayer; Alexandros Stamatakis
Journal:  BMC Evol Biol       Date:  2014-04-17       Impact factor: 3.260

View more
  6 in total

1.  Macroevolutionary trends and diversification dynamics in Atripliceae (Amaranthaceae s.l., Chenopodioideae): a first approach.

Authors:  Nicolás F Brignone; Raúl Pozner; Silvia S Denham
Journal:  Ann Bot       Date:  2022-09-06       Impact factor: 5.040

2.  Strategies for Partitioning Clock Models in Phylogenomic Dating: Application to the Angiosperm Evolutionary Timescale.

Authors:  Charles S P Foster; Simon Y W Ho
Journal:  Genome Biol Evol       Date:  2017-10-01       Impact factor: 3.416

3.  Sphenodontian phylogeny and the impact of model choice in Bayesian morphological clock estimates of divergence times and evolutionary rates.

Authors:  Tiago R Simões; Michael W Caldwell; Stephanie E Pierce
Journal:  BMC Biol       Date:  2020-12-07       Impact factor: 7.431

4.  Localized Phylogenetic Discordance Among Nuclear Loci Due to Incomplete Lineage Sorting and Introgression in the Family of Cotton and Cacao (Malvaceae).

Authors:  Rebeca Hernández-Gutiérrez; Cássio van den Berg; Carolina Granados Mendoza; Marcia Peñafiel Cevallos; Efraín Freire M; Emily Moriarty Lemmon; Alan R Lemmon; Susana Magallón
Journal:  Front Plant Sci       Date:  2022-04-13       Impact factor: 5.753

5.  Global Rate Variation in Bony Vertebrates.

Authors:  Naoko Takezaki
Journal:  Genome Biol Evol       Date:  2018-07-01       Impact factor: 3.416

6.  Integrated likelihood for phylogenomics under a no-common-mechanism model.

Authors:  Hunter Tidwell; Luay Nakhleh
Journal:  BMC Genomics       Date:  2020-04-16       Impact factor: 3.969

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.