Literature DB >> 22697236

Estimating optimal species trees from incomplete gene trees under deep coalescence.

Md Shamsuzzoha Bayzid1, Tandy Warnow.   

Abstract

The estimation of species trees typically involves the estimation of trees and alignments on many different genes, so that the species tree can be based on many different parts of the genome. This kind of phylogenomic approach to species tree estimation has the potential to produce more accurate species tree estimates, especially when gene trees can differ from the species tree due to processes such as incomplete lineage sorting (ILS), gene duplication and loss, and horizontal gene transfer. Because ILS (also called "deep coalescence") is a frequent problem in systematics, many methods have been developed to estimate species trees from gene trees or alignments that specifically take ILS into consideration. In this paper we consider the problem of estimating species trees from gene trees and alignments for the general case where the gene trees and alignments can be incomplete, which means that not all the genes contain sequences for all the species. We formalize optimization problems for this context and prove theoretical results for these problems. We also present the results of a simulation study evaluating existing methods for estimating species trees from incomplete gene trees. Our simulation study shows that *BEAST, a statistical method for estimating species trees from gene sequence alignments, produces by far the most accurate species trees. However, *BEAST can only be run on small datasets. The second most accurate method, MRP (a standard supertree method), can analyze very large datasets and produces very good trees, making MRP a potentially acceptable alternative to *BEAST for large datasets.

Entities:  

Mesh:

Year:  2012        PMID: 22697236     DOI: 10.1089/cmb.2012.0037

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  9 in total

1.  Large-scale multiple sequence alignment and tree estimation using SATé.

Authors:  Kevin Liu; Tandy Warnow
Journal:  Methods Mol Biol       Date:  2014

2.  IDXL: Species Tree Inference Using Internode Distance and Excess Gene Leaf Count.

Authors:  Sourya Bhattacharyya; Jayanta Mukherjee
Journal:  J Mol Evol       Date:  2017-08-23       Impact factor: 2.395

3.  A Phylogenomic Perspective on the Radiation of Ray-Finned Fishes Based upon Targeted Sequencing of Ultraconserved Elements (UCEs).

Authors:  Brant C Faircloth; Laurie Sorenson; Francesco Santini; Michael E Alfaro
Journal:  PLoS One       Date:  2013-06-18       Impact factor: 3.240

Review 4.  Multilocus inference of species trees and DNA barcoding.

Authors:  Diego Mallo; David Posada
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-09-05       Impact factor: 6.237

5.  Rooting phylogenetic trees under the coalescent model using site pattern probabilities.

Authors:  Yuan Tian; Laura Kubatko
Journal:  BMC Evol Biol       Date:  2017-12-19       Impact factor: 3.260

6.  Machine learning based imputation techniques for estimating phylogenetic trees from incomplete distance matrices.

Authors:  Ananya Bhattacharjee; Md Shamsuzzoha Bayzid
Journal:  BMC Genomics       Date:  2020-07-20       Impact factor: 3.969

7.  STELAR: a statistically consistent coalescent-based species tree estimation method by maximizing triplet consistency.

Authors:  Mazharul Islam; Kowshika Sarker; Trisha Das; Rezwana Reaz; Md Shamsuzzoha Bayzid
Journal:  BMC Genomics       Date:  2020-02-10       Impact factor: 3.969

8.  Phylogenomics and Diversification of the Schistosomatidae Based on Targeted Sequence Capture of Ultra-Conserved Elements.

Authors:  Erika T Ebbs; Eric S Loker; Lijing Bu; Sean A Locke; Vasyl V Tkach; Ramesh Devkota; Veronica R Flores; Hudson A Pinto; Sara V Brant
Journal:  Pathogens       Date:  2022-07-05

9.  Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes.

Authors:  Sara Ruane; Christopher J Raxworthy; Alan R Lemmon; Emily Moriarty Lemmon; Frank T Burbrink
Journal:  BMC Evol Biol       Date:  2015-10-12       Impact factor: 3.260

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.