Literature DB >> 35485976

DEPP: Deep Learning Enables Extending Species Trees using Single Genes.

Yueyu Jiang1, Metin Balaban2, Qiyun Zhu3, Siavash Mirarab1.   

Abstract

Placing new sequences onto reference phylogenies is increasingly used for analyzing environmental samples, especially microbiomes. Existing placement methods assume that query sequences have evolved under specific models directly on the reference phylogeny. For example, they assume single-gene data (e.g., 16S rRNA amplicons) have evolved under the GTR model on a gene tree. Placement, however, often has a more ambitious goal: extending a (genome-wide) species tree given data from individual genes without knowing the evolutionary model. Addressing this challenging problem requires new directions. Here, we introduce Deep-learning Enabled Phylogenetic Placement (DEPP), an algorithm that learns to extend species trees using single genes without pre-specified models. In simulations and on real data, we show that DEPP can match the accuracy of model-based methods without any prior knowledge of the model. We also show that DEPP can update the multi-locus microbial tree-of-life with single genes with high accuracy. We further demonstrate that DEPP can combine 16S and metagenomic data onto a single tree, enabling community structure analyses that take advantage of both sources of data.
© The Author(s) 2022. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  Deep learning; Gene tree discordance; Metagenomics; Microbiome analyses; Neural networks; Phylogenetic placement

Year:  2022        PMID: 35485976     DOI: 10.1093/sysbio/syac031

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  3 in total

1.  Phylogeny-Aware Analysis of Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing Taxonomy.

Authors:  Qiyun Zhu; Shi Huang; Antonio Gonzalez; Imran McGrath; Daniel McDonald; Niina Haiminen; George Armstrong; Yoshiki Vázquez-Baeza; Julian Yu; Justin Kuczynski; Gregory D Sepich-Poore; Austin D Swafford; Promi Das; Justin P Shaffer; Franck Lejzerowicz; Pedro Belda-Ferre; Aki S Havulinna; Guillaume Méric; Teemu Niiranen; Leo Lahti; Veikko Salomaa; Ho-Cheol Kim; Mohit Jain; Michael Inouye; Jack A Gilbert; Rob Knight
Journal:  mSystems       Date:  2022-04-04       Impact factor: 7.324

Review 2.  Recent progress on methods for estimating and updating large phylogenies.

Authors:  Paul Zaharias; Tandy Warnow
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

3.  Distance-Based Phylogenetic Placement with Statistical Support.

Authors:  Navid Bin Hasan; Metin Balaban; Avijit Biswas; Md Shamsuzzoha Bayzid; Siavash Mirarab
Journal:  Biology (Basel)       Date:  2022-08-12
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.