Literature DB >> 35385988

Identifiability of species network topologies from genomic sequences using the logDet distance.

Elizabeth S Allman1, Hector Baños2,3, John A Rhodes4.   

Abstract

Inference of network-like evolutionary relationships between species from genomic data must address the interwoven signals from both gene flow and incomplete lineage sorting. The heavy computational demands of standard approaches to this problem severely limit the size of datasets that may be analyzed, in both the number of species and the number of genetic loci. Here we provide a theoretical pointer to more efficient methods, by showing that logDet distances computed from genomic-scale sequences retain sufficient information to recover network relationships in the level-1 ultrametric case. This result is obtained under the Network Multispecies Coalescent model combined with a mixture of General Time-Reversible sequence evolution models across individual gene trees. It applies to both unlinked site data, such as for SNPs, and to sequence data in which many contiguous sites may have evolved on a common tree, such as concatenated gene sequences. Thus under standard stochastic models statistically justifiable inference of network relationships from sequences can be accomplished without consideration of individual genes or gene trees.
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Keywords:  Identifiability; LogDet; Phylogenetic inference; Species network

Mesh:

Year:  2022        PMID: 35385988      PMCID: PMC9192096          DOI: 10.1007/s00285-022-01734-2

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.164


  23 in total

1.  All that glisters is not galled.

Authors:  Francesc Rosselló; Gabriel Valiente
Journal:  Math Biosci       Date:  2009-07-02       Impact factor: 2.144

2.  Defining Species When There is Gene Flow.

Authors:  Xiyun Jiao; Ziheng Yang
Journal:  Syst Biol       Date:  2020-07-03       Impact factor: 15.683

3.  Identifying Species Network Features from Gene Tree Quartets Under the Coalescent Model.

Authors:  Hector Baños
Journal:  Bull Math Biol       Date:  2018-08-09       Impact factor: 1.758

4.  Coestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data.

Authors:  Dingqiao Wen; Luay Nakhleh
Journal:  Syst Biol       Date:  2018-05-01       Impact factor: 15.683

5.  Topological Metrizations of Trees, and New Quartet Methods of Tree Inference.

Authors:  John A Rhodes
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2020-12-08       Impact factor: 3.710

6.  Gene Tree Discord, Simplex Plots, and Statistical Tests under the Coalescent.

Authors:  Elizabeth S Allman; Jonathan D Mitchell; John A Rhodes
Journal:  Syst Biol       Date:  2022-06-16       Impact factor: 9.160

7.  Bayesian Inference of Species Networks from Multilocus Sequence Data.

Authors:  Chi Zhang; Huw A Ogilvie; Alexei J Drummond; Tanja Stadler
Journal:  Mol Biol Evol       Date:  2018-02-01       Impact factor: 16.240

8.  NANUQ: a method for inferring species networks from gene trees under the coalescent model.

Authors:  Elizabeth S Allman; Hector Baños; John A Rhodes
Journal:  Algorithms Mol Biol       Date:  2019-12-06       Impact factor: 1.405

9.  A maximum pseudo-likelihood approach for phylogenetic networks.

Authors:  Yun Yu; Luay Nakhleh
Journal:  BMC Genomics       Date:  2015-10-02       Impact factor: 3.969

10.  Quarnet Inference Rules for Level-1 Networks.

Authors:  Katharina T Huber; Vincent Moulton; Charles Semple; Taoyang Wu
Journal:  Bull Math Biol       Date:  2018-06-04       Impact factor: 1.758

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.