Literature DB >> 35590416

Embedding gene trees into phylogenetic networks by conflict resolution algorithms.

Marcin Wawerka1, Dawid Dąbkowski2, Natalia Rutecka2, Agnieszka Mykowiecka2, Paweł Górecki3.   

Abstract

BACKGROUND: Phylogenetic networks are mathematical models of evolutionary processes involving reticulate events such as hybridization, recombination, or horizontal gene transfer. One of the crucial notions in phylogenetic network modelling is displayed tree, which is obtained from a network by removing a set of reticulation edges. Displayed trees may represent an evolutionary history of a gene family if the evolution is shaped by reticulation events.
RESULTS: We address the problem of inferring an optimal tree displayed by a network, given a gene tree G and a tree-child network N, under the deep coalescence and duplication costs. We propose an O(mn)-time dynamic programming algorithm (DP) to compute a lower bound of the optimal displayed tree cost, where m and n are the sizes of G and N, respectively. In addition, our algorithm can verify whether the solution is exact. Moreover, it provides a set of reticulation edges corresponding to the obtained cost. If the cost is exact, the set induces an optimal displayed tree. Otherwise, the set contains pairs of conflicting edges, i.e., edges sharing a reticulation node. Next, we show a conflict resolution algorithm that requires [Formula: see text] invocations of DP in the worst case, where r is the number of reticulations. We propose a similar [Formula: see text]-time algorithm for level-k tree-child networks and a branch and bound solution to compute lower and upper bounds of optimal costs. We also extend the algorithms to a broader class of phylogenetic networks. Based on simulated data, the average runtime is [Formula: see text] under the deep-coalescence cost and [Formula: see text] under the duplication cost.
CONCLUSIONS: Despite exponential complexity in the worst case, our algorithms perform significantly well on empirical and simulated datasets, due to the strategy of resolving internal dissimilarities between gene trees and networks. Therefore, the algorithms are efficient alternatives to enumeration strategies commonly proposed in the literature and enable analyses of complex networks with dozens of reticulations.
© 2022. The Author(s).

Entities:  

Keywords:  Deep coalescence; Gene tree; Optimal displayed tree; Phylogenetic network; Reticulation; Species tree; Tree-child network

Year:  2022        PMID: 35590416      PMCID: PMC9119282          DOI: 10.1186/s13015-022-00218-8

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.721


  44 in total

1.  Unified modeling of gene duplication, loss, and coalescence using a locus tree.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2012-01-23       Impact factor: 9.043

2.  Maximizing Deep Coalescence Cost.

Authors:  Paweł Górecki; Oliver Eulenstein
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2014 Jan-Feb       Impact factor: 3.710

3.  Reconciling event-labeled gene trees with MUL-trees and species networks.

Authors:  Marc Hellmuth; Katharina T Huber; Vincent Moulton
Journal:  J Math Biol       Date:  2019-08-13       Impact factor: 2.259

4.  Improved Maximum Parsimony Models for Phylogenetic Networks.

Authors:  Leo Van Iersel; Mark Jones; Celine Scornavacca
Journal:  Syst Biol       Date:  2018-05-01       Impact factor: 15.683

5.  TreeKO: a duplication-aware algorithm for the comparison of phylogenetic trees.

Authors:  Marina Marcet-Houben; Toni Gabaldón
Journal:  Nucleic Acids Res       Date:  2011-02-18       Impact factor: 16.971

6.  Which Phylogenetic Networks are Merely Trees with Additional Arcs?

Authors:  Andrew R Francis; Mike Steel
Journal:  Syst Biol       Date:  2015-06-11       Impact factor: 15.683

7.  In the light of deep coalescence: revisiting trees within networks.

Authors:  Jiafan Zhu; Yun Yu; Luay Nakhleh
Journal:  BMC Bioinformatics       Date:  2016-11-11       Impact factor: 3.169

8.  GISAID: Global initiative on sharing all influenza data - from vision to reality.

Authors:  Yuelong Shu; John McCauley
Journal:  Euro Surveill       Date:  2017-03-30

9.  Genomic duplication problems for unrooted gene trees.

Authors:  Jarosław Paszek; Paweł Górecki
Journal:  BMC Genomics       Date:  2016-01-11       Impact factor: 3.969

10.  SimPhy: Phylogenomic Simulation of Gene, Locus, and Species Trees.

Authors:  Diego Mallo; Leonardo De Oliveira Martins; David Posada
Journal:  Syst Biol       Date:  2015-11-01       Impact factor: 15.683

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.