Literature DB >> 27587676

MetaProb: accurate metagenomic reads binning based on probabilistic sequence signatures.

Samuele Girotto1, Cinzia Pizzi1, Matteo Comin1.   

Abstract

MOTIVATION: Sequencing technologies allow the sequencing of microbial communities directly from the environment without prior culturing. Taxonomic analysis of microbial communities, a process referred to as binning, is one of the most challenging tasks when analyzing metagenomic reads data. The major problems are the lack of taxonomically related genomes in existing reference databases, the uneven abundance ratio of species and the limitations due to short read lengths and sequencing errors.
RESULTS: MetaProb is a novel assembly-assisted tool for unsupervised metagenomic binning. The novelty of MetaProb derives from solving a few important problems: how to divide reads into groups of independent reads, so that k-mer frequencies are not overestimated; how to convert k-mer counts into probabilistic sequence signatures, that will correct for variable distribution of k-mers, and for unbalanced groups of reads, in order to produce better estimates of the underlying genome statistic; how to estimate the number of species in a dataset. We show that MetaProb is more accurate and efficient than other state-of-the-art tools in binning both short reads datasets (F-measure 0.87) and long reads datasets (F-measure 0.97) for various abundance ratios. Also, the estimation of the number of species is more accurate than MetaCluster. On a real human stool dataset MetaProb identifies the most predominant species, in line with previous human gut studies.
AVAILABILITY AND IMPLEMENTATION: https://bitbucket.org/samu661/metaprob CONTACTS: cinzia.pizzi@dei.unipd.it or comin@dei.unipd.it SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27587676     DOI: 10.1093/bioinformatics/btw466

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

Review 1.  Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Authors:  Bilal Wajid; Faria Anwar; Imran Wajid; Haseeb Nisar; Sharoze Meraj; Ali Zafar; Mustafa Kamal Al-Shawaqfeh; Ali Riza Ekti; Asia Khatoon; Jan S Suchodolski
Journal:  Funct Integr Genomics       Date:  2021-10-18       Impact factor: 3.410

2.  Binning unassembled short reads based on k-mer abundance covariance using sparse coding.

Authors:  Olexiy Kyrgyzov; Vincent Prost; Stéphane Gazut; Bruno Farcy; Thomas Brüls
Journal:  Gigascience       Date:  2020-04-01       Impact factor: 6.524

3.  Metagenomics Investigation of Agarlytic Genes and Genomes in Mangrove Sediments in China: A Potential Repertory for Carbohydrate-Active Enzymes.

Authors:  Wu Qu; Dan Lin; Zhouhao Zhang; Wenjie Di; Boliang Gao; Runying Zeng
Journal:  Front Microbiol       Date:  2018-08-14       Impact factor: 5.640

4.  CLAME: a new alignment-based binning algorithm allows the genomic description of a novel Xanthomonadaceae from the Colombian Andes.

Authors:  Andres Benavides; Juan Pablo Isaza; Juan Pablo Niño-García; Juan Fernando Alzate; Felipe Cabarcas
Journal:  BMC Genomics       Date:  2018-12-11       Impact factor: 3.969

5.  Efficient computation of spaced seed hashing with block indexing.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  BMC Bioinformatics       Date:  2018-11-30       Impact factor: 3.169

6.  FSH: fast spaced seed hashing exploiting adjacent hashes.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  Algorithms Mol Biol       Date:  2018-03-22       Impact factor: 1.405

7.  Higher recall in metagenomic sequence classification exploiting overlapping reads.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  BMC Genomics       Date:  2017-12-06       Impact factor: 3.969

8.  MetaBCC-LR: metagenomics binning by coverage and composition for long reads.

Authors:  Anuradha Wickramarachchi; Vijini Mallawaarachchi; Vaibhav Rajan; Yu Lin
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

9.  Metagenomic analysis through the extended Burrows-Wheeler transform.

Authors:  Veronica Guerrini; Felipe A Louza; Giovanna Rosone
Journal:  BMC Bioinformatics       Date:  2020-09-16       Impact factor: 3.169

10.  Deconvolute individual genomes from metagenome sequences through short read clustering.

Authors:  Kexue Li; Yakang Lu; Li Deng; Lili Wang; Lizhen Shi; Zhong Wang
Journal:  PeerJ       Date:  2020-04-08       Impact factor: 2.984

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.