Literature DB >> 24131054

Exact solutions for species tree inference from discordant gene trees.

Wen-Chieh Chang1, Paweł Górecki, Oliver Eulenstein.   

Abstract

Phylogenetic analysis has to overcome the grant challenge of inferring accurate species trees from evolutionary histories of gene families (gene trees) that are discordant with the species tree along whose branches they have evolved. Two well studied approaches to cope with this challenge are to solve either biologically informed gene tree parsimony (GTP) problems under gene duplication, gene loss, and deep coalescence, or the classic RF supertree problem that does not rely on any biological model. Despite the potential of these problems to infer credible species trees, they are NP-hard. Therefore, these problems are addressed by heuristics that typically lack any provable accuracy and precision. We describe fast dynamic programming algorithms that solve the GTP problems and the RF supertree problem exactly, and demonstrate that our algorithms can solve instances with data sets consisting of as many as 22 taxa. Extensions of our algorithms can also report the number of all optimal species trees, as well as the trees themselves. To better asses the quality of the resulting species trees that best fit the given gene trees, we also compute the worst case species trees, their numbers, and optimization score for each of the computational problems. Finally, we demonstrate the performance of our exact algorithms using empirical and simulated data sets, and analyze the quality of heuristic solutions for the studied problems by contrasting them with our exact solutions.

Entities:  

Mesh:

Year:  2013        PMID: 24131054     DOI: 10.1142/S0219720013420055

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  5 in total

1.  Are the duplication cost and Robinson-Foulds distance equivalent?

Authors:  Yu Zheng; Louxin Zhang
Journal:  J Comput Biol       Date:  2014-07-02       Impact factor: 1.479

2.  A multiple-dimension model for microbiota of patients with colorectal cancer from normal participants and other intestinal disorders.

Authors:  Jian Shen; Gulei Jin; Zhengliang Zhang; Jun Zhang; Yan Sun; Xiaoxiao Xie; Tingting Ma; Yongze Zhu; Yaoqiang Du; Yaofang Niu; Xinwei Shi
Journal:  Appl Microbiol Biotechnol       Date:  2022-02-26       Impact factor: 4.813

3.  Genomic duplication problems for unrooted gene trees.

Authors:  Jarosław Paszek; Paweł Górecki
Journal:  BMC Genomics       Date:  2016-01-11       Impact factor: 3.969

4.  Gene tree parsimony for incomplete gene trees: addressing true biological loss.

Authors:  Md Shamsuzzoha Bayzid; Tandy Warnow
Journal:  Algorithms Mol Biol       Date:  2018-01-19       Impact factor: 1.405

5.  Forcing external constraints on tree inference using ASTRAL.

Authors:  Maryam Rabiee; Siavash Mirarab
Journal:  BMC Genomics       Date:  2020-04-16       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.