Literature DB >> 30698645

Rapid alignment-free phylogenetic identification of metagenomic sequences.

Benjamin Linard1,2,3, Krister Swenson1,4, Fabio Pardi1,4.   

Abstract

MOTIVATION: Taxonomic classification is at the core of environmental DNA analysis. When a phylogenetic tree can be built as a prior hypothesis to such classification, phylogenetic placement (PP) provides the most informative type of classification because each query sequence is assigned to its putative origin in the tree. This is useful whenever precision is sought (e.g. in diagnostics). However, likelihood-based PP algorithms struggle to scale with the ever-increasing throughput of DNA sequencing.
RESULTS: We have developed RAPPAS (Rapid Alignment-free Phylogenetic Placement via Ancestral Sequences) which uses an alignment-free approach, removing the hurdle of query sequence alignment as a preliminary step to PP. Our approach relies on the precomputation of a database of k-mers that may be present with non-negligible probability in relatives of the reference sequences. The placement is performed by inspecting the stored phylogenetic origins of the k-mers in the query, and their probabilities. The database can be reused for the analysis of several different metagenomes. Experiments show that the first implementation of RAPPAS is already faster than competing likelihood-based PP algorithms, while keeping similar accuracy for short reads. RAPPAS scales PP for the era of routine metagenomic diagnostics.
AVAILABILITY AND IMPLEMENTATION: Program and sources freely available for download at https://github.com/blinard-BIOINFO/RAPPAS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30698645     DOI: 10.1093/bioinformatics/btz068

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Phylogeny Estimation Given Sequence Length Heterogeneity.

Authors:  Vladimir Smirnov; Tandy Warnow
Journal:  Syst Biol       Date:  2021-02-10       Impact factor: 15.683

2.  Read-SpaM: assembly-free and alignment-free comparison of bacterial genomes with low sequencing coverage.

Authors:  Anna-Katharina Lau; Svenja Dörrer; Chris-André Leimeister; Christoph Bleidorn; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2019-12-17       Impact factor: 3.169

3.  Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data.

Authors:  Lucas Czech; Pierre Barbera; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

4.  Rapid screening and detection of inter-type viral recombinants using Phylo-K-Mers.

Authors:  Guillaume E Scholz; Benjamin Linard; Nikolai Romashchenko; Eric Rivals; Fabio Pardi
Journal:  Bioinformatics       Date:  2020-12-17       Impact factor: 6.937

5.  Genome-wide alignment-free phylogenetic distance estimation under a no strand-bias model.

Authors:  Metin Balaban; Nishat Anjum Bristy; Ahnaf Faisal; Md Shamsuzzoha Bayzid; Siavash Mirarab
Journal:  Bioinform Adv       Date:  2022-08-12

Review 6.  Recent progress on methods for estimating and updating large phylogenies.

Authors:  Paul Zaharias; Tandy Warnow
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

7.  Distance-Based Phylogenetic Placement with Statistical Support.

Authors:  Navid Bin Hasan; Metin Balaban; Avijit Biswas; Md Shamsuzzoha Bayzid; Siavash Mirarab
Journal:  Biology (Basel)       Date:  2022-08-12

8.  The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances.

Authors:  Sophie Röhling; Alexander Linne; Jendrik Schellhorn; Morteza Hosseini; Thomas Dencker; Burkhard Morgenstern
Journal:  PLoS One       Date:  2020-02-10       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.