| Literature DB >> 26064786 |
Abstract
Incomplete lineage sorting (ILS), modelled by the multi-species coalescent, is a process that results in a gene tree being different from the species tree. Because ILS is expected to occur for at least some loci within genome-scale analyses, the evaluation of species tree estimation methods in the presence of ILS is of great interest. Performance on simulated and biological data have suggested that concatenation analyses can result in the wrong tree with high support under some conditions, and a recent theoretical result by Roch and Steel proved that concatenation using unpartitioned maximum likelihood analysis can be statistically inconsistent in the presence of ILS. In this study, we survey the major species tree estimation methods, including the newly proposed "statistical binning" methods, and discuss their theoretical properties. We also note that there are two interpretations of the term "statistical consistency", and discuss the theoretical results proven under both interpretations.Entities:
Year: 2015 PMID: 26064786 PMCID: PMC4450984 DOI: 10.1371/currents.tol.8d41ac0f13d1abedf4c4a59f5d17b1f7
Source DB: PubMed Journal: PLoS Curr ISSN: 2157-3999
We present the current status with respect to statistical consistency (of the first or second kind) of some standard phylogenomic estimation methods. The first column is for the first meaning of statistical consistency, which states that the species tree estimated by the method will converge to the true species tree as the number of loci and number of sites per locus both increase. The second column is for the second meaning, which states that the species tree estimated by the method will converge to the true species tree as the number of loci increases, even for bounded number of sites per locus. We also cite the paper in which the theoretical result is established.
| Consistency - first kind | Consistency - second kind | |
|---|---|---|
| MP-EST | YES | UNKNOWN |
| ASTRAL | YES | UNKNOWN |
| Unpartitioned concatenated maximum likelihood | NO ( | NO ( |
| Fully partitioned maximum likelihood | UNKNOWN | UNKNOWN |
| Unweighted statistical binning followed by consistent summary method (e.g., ASTRAL) | NO ( | NO ( |
| Weighted statistical binning followed by consistent summary method (e.g., ASTRAL) | YES ( | UNKNOWN |
| *BEAST | YES | UNKNOWN |