Literature DB >> 34508605

Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.

David A Duchêne1, Niklas Mather2, Cara Van Der Wal2, Simon Y W Ho2.   

Abstract

The historical signal in nucleotide sequences becomes eroded over time by substitutions occurring repeatedly at the same sites. This phenomenon, known as substitution saturation, is recognized as one of the primary obstacles to deep-time phylogenetic inference using genome-scale data sets. We present a new test of substitution saturation and demonstrate its performance in simulated and empirical data. For some of the 36 empirical phylogenomic data sets that we examined, we detect substitution saturation in around 50% of loci. We found that saturation tends to be flagged as problematic in loci with highly discordant phylogenetic signals across sites. Within each data set, the loci with smaller numbers of informative sites are more likely to be flagged as containing problematic levels of saturation. The entropy saturation test proposed here is sensitive to high evolutionary rates relative to the evolutionary timeframe, while also being sensitive to several factors known to mislead phylogenetic inference, including short internal branches relative to external branches, short nucleotide sequences, and tree imbalance. Our study demonstrates that excluding loci with substitution saturation can be an effective means of mitigating the negative impact of multiple substitutions on phylogenetic inferences. [Phylogenetic model performance; phylogenomics; substitution model; substitution saturation; test statistics.].
© The Author(s) 2021. Published by Oxford University Press on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2022        PMID: 34508605      PMCID: PMC9016599          DOI: 10.1093/sysbio/syab075

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  75 in total

1.  Constraints on protein evolution and the age of the eubacteria/eukaryote split.

Authors:  M M Miyamoto; W M Fitch
Journal:  Syst Biol       Date:  1996-12       Impact factor: 15.683

2.  An index of substitution saturation and its application.

Authors:  Xuhua Xia; Zheng Xie; Marco Salemi; Lu Chen; Yong Wang
Journal:  Mol Phylogenet Evol       Date:  2003-01       Impact factor: 4.286

3.  Bayesian model adequacy and choice in phylogenetics.

Authors:  Jonathan P Bollback
Journal:  Mol Biol Evol       Date:  2002-07       Impact factor: 16.240

4.  Phylogenetic signal and noise: predicting the power of a data set to resolve phylogeny.

Authors:  Jeffrey P Townsend; Zhuo Su; Yonas I Tekle
Journal:  Syst Biol       Date:  2012-03-03       Impact factor: 15.683

5.  Modeling compositional heterogeneity.

Authors:  Peter G Foster
Journal:  Syst Biol       Date:  2004-06       Impact factor: 15.683

6.  Saturation and base composition bias explain phylogenomic conflict in Plasmodium.

Authors:  Liliana M Dávalos; Susan L Perkins
Journal:  Genomics       Date:  2008-03-04       Impact factor: 5.736

7.  Why Do Phylogenomic Data Sets Yield Conflicting Trees? Data Type Influences the Avian Tree of Life more than Taxon Sampling.

Authors:  Sushma Reddy; Rebecca T Kimball; Akanksha Pandey; Peter A Hosner; Michael J Braun; Shannon J Hackett; Kin-Lan Han; John Harshman; Christopher J Huddleston; Sarah Kingston; Ben D Marks; Kathleen J Miglia; William S Moore; Frederick H Sheldon; Christopher C Witt; Tamaki Yuri; Edward L Braun
Journal:  Syst Biol       Date:  2017-09-01       Impact factor: 15.683

8.  Linking Branch Lengths across Sets of Loci Provides the Highest Statistical Support for Phylogenetic Inference.

Authors:  David A Duchêne; K Jun Tong; Charles S P Foster; Sebastián Duchêne; Robert Lanfear; Simon Y W Ho
Journal:  Mol Biol Evol       Date:  2020-04-01       Impact factor: 16.240

9.  Resolving difficult phylogenetic questions: why more sequences are not enough.

Authors:  Hervé Philippe; Henner Brinkmann; Dennis V Lavrov; D Timothy J Littlewood; Michael Manuel; Gert Wörheide; Denis Baurain
Journal:  PLoS Biol       Date:  2011-03-15       Impact factor: 8.029

10.  Expanding anchored hybrid enrichment to resolve both deep and shallow relationships within the spider tree of life.

Authors:  Chris A Hamilton; Alan R Lemmon; Emily Moriarty Lemmon; Jason E Bond
Journal:  BMC Evol Biol       Date:  2016-10-13       Impact factor: 3.260

View more
  1 in total

1.  Comparative genomics of the Western Hemisphere soft tick-borne relapsing fever borreliae highlights extensive plasmid diversity.

Authors:  Alexander R Kneubehl; Aparna Krishnavajhala; Sebastián Muñoz Leal; Adam J Replogle; Luke C Kingry; Sergio E Bermúdez; Marcelo B Labruna; Job E Lopez
Journal:  BMC Genomics       Date:  2022-05-31       Impact factor: 4.547

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.