| Literature DB >> 31159881 |
Luis Pedro Coelho1,2,3, Renato Alves1,4, Paulo Monteiro5, Jaime Huerta-Cepas1,6, Ana Teresa Freitas5, Peer Bork7,8,9,10.
Abstract
BACKGROUND: Shotgun metagenomes contain a sample of all the genomic material in an environment, allowing for the characterization of a microbial community. In order to understand these communities, bioinformatics methods are crucial. A common first step in processing metagenomes is to compute abundance estimates of different taxonomic or functional groups from the raw sequencing data. Given the breadth of the field, computational solutions need to be flexible and extensible, enabling the combination of different tools into a larger pipeline.Entities:
Keywords: Domain-specific language; Metagenomics; Next-generation sequencing
Year: 2019 PMID: 31159881 PMCID: PMC6547473 DOI: 10.1186/s40168-019-0684-8
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1Schematic illustration of NG-meta-profiler
Gene catalogs bundled with NG-meta-profiler
| Database | Size (million genes) | Comment |
|---|---|---|
| igc | 9.9 | Integrated gene catalog for the human gut [ |
| om-rgc | 40 | Ocean microbial gene catalog [ |
| mouse-gut | 2.6 | Gene catalog of the mouse gut [ |
| pig-gut | 7.7 | Gene catalog of the pig gut [ |
| dog-gut | 1.2 | Gene catalog of the dog gut [ |
NGLess built-in modules that add extra functionality
| Module name | Comment |
|---|---|
| parallel | Process multiple samples in parallel |
| mocat | Compatibility with MOCAT/MOCAT2 [ |
| specI | specI profiling (reference based metagenomics taxonomic profiling [ |
| motu | mOTU profiling (taxonomic profiling of metagenomes [ |
| minimap2 | minimap2 mapper [ |
Fig. 2Timing comparison of NGLess and other tools. Three replicates are shown for each tool. The bwa/htseq-count pipeline does not include preprocess and filtering steps
Quality of results based on simulated data
| Environment | Tool | Mean | Std. dev. |
|---|---|---|---|
| Human gut | NGLess | 88.44 | 2.07 |
| MOCAT2 | 87.32 | 2.04 | |
| bwa/htseq-count | 85.96 | 1.84 | |
| Marine | NGLess | 82.26 | 6.29 |
| MOCAT2 | 83.07 | 6.42 | |
| bwa/htseq-count | 82.46 | 6.32 |
Shown are the average and std. dev. (over 8 simulations for each environment) of the Spearman rho between the output of each tool and the ground truth
Fig. 3Abridged version of the human profiler, written in the NGLess language