Literature DB >> 30247732

Long-Branch Attraction in Species Tree Estimation: Inconsistency of Partitioned Likelihood and Topology-Based Summary Methods.

Sebastien Roch1, Michael Nute2, Tandy Warnow3.   

Abstract

With advances in sequencing technologies, there are now massive amounts of genomic data from across all life, leading to the possibility that a robust Tree of Life can be constructed. However, "gene tree heterogeneity", which is when different genomic regions can evolve differently, is a common phenomenon in multi-locus data sets, and reduces the accuracy of standard methods for species tree estimation that do not take this heterogeneity into account. New methods have been developed for species tree estimation that specifically address gene tree heterogeneity, and that have been proven to converge to the true species tree when the number of loci and number of sites per locus both increase (i.e., the methods are said to be "statistically consistent"). Yet, little is known about the biologically realistic condition where the number of sites per locus is bounded. We show that when the sequence length of each locus is bounded (by any arbitrarily chosen value), the most common approaches to species tree estimation that take heterogeneity into account (i.e., traditional fully partitioned concatenated maximum likelihood and newer approaches, called summary methods, that estimate the species tree by combining estimated gene trees) are not statistically consistent, even when the heterogeneity is extremely constrained. The main challenge is the presence of conditions such as long branch attraction that create biased tree estimation when the number of sites is restricted. Hence, our study uncovers a fundamental challenge to species tree estimation using both traditional and new methods.

Mesh:

Year:  2019        PMID: 30247732     DOI: 10.1093/sysbio/syy061

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  15 in total

1.  Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.

Authors:  Haifeng Xiong; Danying Wang; Chen Shao; Xuchen Yang; Jialin Yang; Tao Ma; Charles C Davis; Liang Liu; Zhenxiang Xi
Journal:  Syst Biol       Date:  2022-10-12       Impact factor: 9.160

2.  QuCo: quartet-based co-estimation of species trees and gene trees.

Authors:  Maryam Rabiee; Siavash Mirarab
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

3.  A stochastic Farris transform for genetic data under the multispecies coalescent with applications to data requirements.

Authors:  Gautam Dasarathy; Elchanan Mossel; Robert Nowak; Sebastien Roch
Journal:  J Math Biol       Date:  2022-04-08       Impact factor: 2.164

4.  Paralogs and off-target sequences improve phylogenetic resolution in a densely-sampled study of the breadfruit genus (Artocarpus, Moraceae).

Authors:  Elliot M Gardner; Matthew G Johnson; Joan T Pereira; Aida Shafreena Ahmad Puad; Deby Arifiani; Norman J Wickett; Nyree J C Zerega
Journal:  Syst Biol       Date:  2020-09-24       Impact factor: 15.683

5.  PhyloWGA: chromosome-aware phylogenetic interrogation of whole genome alignments.

Authors:  Richard H Adams; Todd A Castoe; Michael DeGiorgio
Journal:  Bioinformatics       Date:  2021-07-27       Impact factor: 6.937

6.  TreeMerge: a new method for improving the scalability of species tree estimation methods.

Authors:  Erin K Molloy; Tandy Warnow
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

7.  Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.

Authors:  Erin K Molloy; Tandy Warnow
Journal:  Algorithms Mol Biol       Date:  2019-07-19       Impact factor: 1.405

8.  Phylogenomics Yields New Insight Into Relationships Within Vernonieae (Asteraceae).

Authors:  Carolina M Siniscalchi; Benoit Loeuille; Vicki A Funk; Jennifer R Mandel; José R Pirani
Journal:  Front Plant Sci       Date:  2019-10-17       Impact factor: 5.753

9.  Supergene validation: A model-based protocol for assessing the accuracy of non-model-based supergene methods.

Authors:  Richard H Adams; Todd A Castoe
Journal:  MethodsX       Date:  2019-09-24

10.  Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy.

Authors:  Qiuyi Zhang; Satish Rao; Tandy Warnow
Journal:  Algorithms Mol Biol       Date:  2019-02-06       Impact factor: 1.405

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.