Literature DB >> 28595363

More on the Best Evolutionary Rate for Phylogenetic Analysis.

Seraina Klopfstein1,2, Tim Massingham3, Nick Goldman3.   

Abstract

The accumulation of genome-scale molecular data sets for nonmodel taxa brings us ever closer to resolving the tree of life of all living organisms. However, despite the depth of data available, a number of studies that each used thousands of genes have reported conflicting results. The focus of phylogenomic projects must thus shift to more careful experimental design. Even though we still have a limited understanding of what are the best predictors of the phylogenetic informativeness of a gene, there is wide agreement that one key factor is its evolutionary rate; but there is no consensus as to whether the rates derived as optimal in various analytical, empirical, and simulation approaches have any general applicability. We here use simulations to infer optimal rates in a set of realistic phylogenetic scenarios with varying tree sizes, numbers of terminals, and tree shapes. Furthermore, we study the relationship between the optimal rate and rate variation among sites and among lineages. Finally, we examine how well the predictions made by a range of experimental design methods correlate with the observed performance in our simulations.We find that the optimal level of divergence is surprisingly robust to differences in taxon sampling and even to among-site and among-lineage rate variation as often encountered in empirical data sets. This finding encourages the use of methods that rely on a single optimal rate to predict a gene's utility. Focusing on correct recovery either of the most basal node in the phylogeny or of the entire topology, the optimal rate is about 0.45 substitutions from root to tip in average Yule trees and about 0.2 in difficult trees with short basal and long-apical branches, but all rates leading to divergence levels between about 0.1 and 0.5 perform reasonably well.Testing the performance of six methods that can be used to predict a gene's utility against our simulation results, we find that the probability of resolution, signal-noise analysis, and Fisher information are good predictors of phylogenetic informativeness, but they require specification of at least part of a model tree. Likelihood quartet mapping also shows very good performance but only requires sequence alignments and is thus applicable without making assumptions about the phylogeny. Despite them being the most commonly used methods for experimental design, geometric quartet mapping and the integration of phylogenetic informativeness curves perform rather poorly in our comparison. Instead of derived predictors of phylogenetic informativeness, we suggest that the number of sites in a gene that evolve at near-optimal rates (as inferred here) could be used directly to prioritize genes for phylogenetic inference. In combination with measures of model fit, especially with respect to compositional biases and among-site and among-lineage rate variation, such an approach has the potential to greatly improve marker choice and should be tested on empirical data.
© The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biology.

Entities:  

Keywords:  Experimental design; phylogenomics

Mesh:

Year:  2017        PMID: 28595363      PMCID: PMC5790136          DOI: 10.1093/sysbio/syx051

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  61 in total

1.  The biasing effect of compositional heterogeneity on phylogenetic estimates may be underestimated.

Authors:  Lars Jermiin; Simon Y Ho; Faisal Ababneh; John Robinson; Anthony W Larkum
Journal:  Syst Biol       Date:  2004-08       Impact factor: 15.683

2.  Phylogenetic signal and noise: predicting the power of a data set to resolve phylogeny.

Authors:  Jeffrey P Townsend; Zhuo Su; Yonas I Tekle
Journal:  Syst Biol       Date:  2012-03-03       Impact factor: 15.683

3.  An evaluation of phylogenetic informativeness profiles and the molecular phylogeny of diplazontinae (Hymenoptera, Ichneumonidae).

Authors:  Seraina Klopfstein; Christian Kropf; Donald L J Quicke
Journal:  Syst Biol       Date:  2010-01-13       Impact factor: 15.683

4.  Phylogenomics revives traditional views on deep animal relationships.

Authors:  Hervé Philippe; Romain Derelle; Philippe Lopez; Kerstin Pick; Carole Borchiellini; Nicole Boury-Esnault; Jean Vacelet; Emmanuelle Renard; Evelyn Houliston; Eric Quéinnec; Corinne Da Silva; Patrick Wincker; Hervé Le Guyader; Sally Leys; Daniel J Jackson; Fabian Schreiber; Dirk Erpenbeck; Burkhard Morgenstern; Gert Wörheide; Michaël Manuel
Journal:  Curr Biol       Date:  2009-04-02       Impact factor: 10.834

5.  Combining data in phylogenetic analysis.

Authors:  J P Huelsenbeck; J J Bull; C W Cunningham
Journal:  Trends Ecol Evol       Date:  1996-04       Impact factor: 17.712

6.  ACCURACY OF ESTIMATED PHYLOGENIES: EFFECTS OF TREE TOPOLOGY AND EVOLUTIONARY MODEL.

Authors:  F James Rohlf; W S Chang; R R Sokal; Junhyong Kim
Journal:  Evolution       Date:  1990-09       Impact factor: 3.694

7.  Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model.

Authors:  Sen Song; Liang Liu; Scott V Edwards; Shaoyuan Wu
Journal:  Proc Natl Acad Sci U S A       Date:  2012-08-28       Impact factor: 11.205

8.  A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing.

Authors:  Richard O Prum; Jacob S Berv; Alex Dornburg; Daniel J Field; Jeffrey P Townsend; Emily Moriarty Lemmon; Alan R Lemmon
Journal:  Nature       Date:  2015-10-07       Impact factor: 49.962

9.  PhyDesign: an online application for profiling phylogenetic informativeness.

Authors:  Francesc López-Giráldez; Jeffrey P Townsend
Journal:  BMC Evol Biol       Date:  2011-05-31       Impact factor: 3.260

10.  A genome-scale mining strategy for recovering novel rapidly-evolving nuclear single-copy genes for addressing shallow-scale phylogenetics in Hydrangea.

Authors:  Carolina Granados Mendoza; Julia Naumann; Marie-Stéphanie Samain; Paul Goetghebeur; Yannick De Smet; Stefan Wanke
Journal:  BMC Evol Biol       Date:  2015-07-04       Impact factor: 3.260

View more
  7 in total

1.  Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.

Authors:  David A Duchêne; Niklas Mather; Cara Van Der Wal; Simon Y W Ho
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

2.  Evolutionary Rate Variation among Lineages in Gene Trees has a Negative Impact on Species-Tree Inference.

Authors:  Mezzalina Vankan; Simon Y W Ho; David A Duchêne
Journal:  Syst Biol       Date:  2022-02-10       Impact factor: 15.683

3.  Differences in Performance among Test Statistics for Assessing Phylogenomic Model Adequacy.

Authors:  David A Duchêne; Sebastian Duchêne; Simon Y W Ho
Journal:  Genome Biol Evol       Date:  2018-06-01       Impact factor: 3.416

4.  Noise and biases in genomic data may underlie radically different hypotheses for the position of Iguania within Squamata.

Authors:  Nicolás Mongiardino Koch; Jacques A Gauthier
Journal:  PLoS One       Date:  2018-08-22       Impact factor: 3.240

5.  Resolving the Early Divergence Pattern of Teleost Fish Using Genome-Scale Data.

Authors:  Naoko Takezaki
Journal:  Genome Biol Evol       Date:  2021-05-07       Impact factor: 3.416

6.  Mind the Outgroup and Bare Branches in Total-Evidence Dating: a Case Study of Pimpliform Darwin Wasps (Hymenoptera, Ichneumonidae).

Authors:  Tamara Spasojevic; Gavin R Broad; Ilari E Sääksjärvi; Martin Schwarz; Masato Ito; Stanislav Korenko; Seraina Klopfstein
Journal:  Syst Biol       Date:  2021-02-10       Impact factor: 15.683

7.  Optimizing Phylogenomics with Rapidly Evolving Long Exons: Comparison with Anchored Hybrid Enrichment and Ultraconserved Elements.

Authors:  Benjamin R Karin; Tony Gamble; Todd R Jackman
Journal:  Mol Biol Evol       Date:  2020-03-01       Impact factor: 16.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.