Literature DB >> 32415974

Consistency of SVDQuartets and Maximum Likelihood for Coalescent-Based Species Tree Estimation.

Matthew Wascher1, Laura Kubatko1,2.   

Abstract

Numerous methods for inferring species-level phylogenies under the coalescent model have been proposed within the last 20 years, and debates continue about the relative strengths and weaknesses of these methods. One desirable property of a phylogenetic estimator is that of statistical consistency, which means intuitively that as more data are collected, the probability that the estimated tree has the same topology as the true tree goes to 1. To date, consistency results for species tree inference under the multispecies coalescent (MSC) have been derived only for summary statistics methods, such as ASTRAL and MP-EST. These methods have been found to be consistent given true gene trees but may be inconsistent when gene trees are estimated from data for loci of finite length. Here, we consider the question of statistical consistency for four taxa for SVDQuartets for general data types, as well as for the maximum likelihood (ML) method in the case in which the data are a collection of sites generated under the MSC model such that the sites are conditionally independent given the species tree (we call these data coalescent independent sites [CIS] data). We show that SVDQuartets is statistically consistent for all data types (i.e., for both CIS data and for multilocus data), and we derive its rate of convergence. We additionally show that ML is consistent for CIS data under the JC69 model and discuss why a proof for the more general multilocus case is difficult. Finally, we compare the performance of ML and SDVQuartets using simulation for both data types. [Consistency; gene tree; maximum likelihood; multilocus data; hylogenetic inference; species tree; SVDQuartets.].
© The Author(s) 2020. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Year:  2021        PMID: 32415974     DOI: 10.1093/sysbio/syaa039

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  3 in total

1.  Parameter Identifiability for a Profile Mixture Model of Protein Evolution.

Authors:  Samaneh Yourdkhani; Elizabeth S Allman; John A Rhodes
Journal:  J Comput Biol       Date:  2021-05-06       Impact factor: 1.549

2.  Genome-wide footprints in the carob tree (Ceratonia siliqua) unveil a new domestication pattern of a fruit tree in the Mediterranean.

Authors:  Alex Baumel; Gonzalo Nieto Feliner; Frédéric Médail; Stefano La Malfa; Mario Di Guardo; Magda Bou Dagher Kharrat; Fatma Lakhal-Mirleau; Valentine Frelon; Lahcen Ouahmane; Katia Diadema; Hervé Sanguin; Juan Viruel
Journal:  Mol Ecol       Date:  2022-06-30       Impact factor: 6.622

3.  Hypothesis Testing With Rank Conditions in Phylogenetics.

Authors:  Colby Long; Laura Kubatko
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.