Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics.

Literature DB >> 31647547

PhyloMagnet: fast and accurate screening of short-read meta-omics data using gene-centric phylogenetics.

Max E Schön¹, Laura Eme^1,2, Thijs J G Ettema^1,3.

Abstract

MOTIVATION: Metagenomic and metatranscriptomic sequencing have become increasingly popular tools for producing massive amounts of short-read data, often used for the reconstruction of draft genomes or the detection of (active) genes in microbial communities. Unfortunately, sequence assemblies of such datasets generally remain a computationally challenging task. Frequently, researchers are only interested in a specific group of organisms or genes; yet, the assembly of multiple datasets only to identify candidate sequences for a specific question is sometimes prohibitively slow, forcing researchers to select a subset of available datasets to address their question. Here, we present PhyloMagnet, a workflow to screen meta-omics datasets for taxa and genes of interest using gene-centric assembly and phylogenetic placement of sequences.
RESULTS: Using PhyloMagnet, we could identify up to 87% of the genera in an in vitro mock community with variable abundances, while the false positive predictions per single gene tree ranged from 0 to 23%. When applied to a group of metagenomes for which a set of metagenome assembled genomes (MAGs) have been published, we could detect the majority of the taxonomic labels that the MAGs had been annotated with. In a metatranscriptomic setting, the phylogenetic placement of assembled contigs corresponds to that of transcripts obtained from transcriptome assembly.
AVAILABILITY AND IMPLEMENTATION: PhyloMagnet is built using Nextflow, available at github.com/maxemil/PhyloMagnet and is developed and tested on Linux. It is released under the open source GNU GPL licence and documentation is available at phylomagnet.readthedocs.io. Version 0.5 of PhyloMagnet was used for all benchmarking experiments. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: CellLine Chemical Disease Species

Year: 2020 PMID： 31647547 DOI： 10.1093/bioinformatics/btz799

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

2 in total

1. Single cell genomics reveals plastid-lacking Picozoa are close relatives of red algae.

Authors: Max E Schön; Vasily V Zlatogursky; Rohan P Singh; Camille Poirier; Susanne Wilken; Varsha Mathur; Jürgen F H Strassert; Jarone Pinhassi; Alexandra Z Worden; Patrick J Keeling; Thijs J G Ettema; Jeremy G Wideman; Fabien Burki
Journal: Nat Commun Date: 2021-11-17 Impact factor: 14.919

2. Tiara: Deep learning-based classification system for eukaryotic sequences.

Authors: Michał Karlicki; Stanisław Antonowicz; Anna Karnkowska
Journal: Bioinformatics Date: 2021-09-27 Impact factor: 6.937

2 in total