Literature DB >> 30649204

MSC: a metagenomic sequence classification algorithm.

Subrata Saha1, Jethro Johnson2, Soumitra Pal3, George M Weinstock2, Sanguthevar Rajasekaran4.   

Abstract

MOTIVATION: Metagenomics is the study of genetic materials directly sampled from natural habitats. It has the potential to reveal previously hidden diversity of microscopic life largely due to the existence of highly parallel and low-cost next-generation sequencing technology. Conventional approaches align metagenomic reads onto known reference genomes to identify microbes in the sample. Since such a collection of reference genomes is very large, the approach often needs high-end computing machines with large memory which is not often available to researchers. Alternative approaches follow an alignment-free methodology where the presence of a microbe is predicted using the information about the unique k-mers present in the microbial genomes. However, such approaches suffer from high false positives due to trading off the value of k with the computational resources. In this article, we propose a highly efficient metagenomic sequence classification (MSC) algorithm that is a hybrid of both approaches. Instead of aligning reads to the full genomes, MSC aligns reads onto a set of carefully chosen, shorter and highly discriminating model sequences built from the unique k-mers of each of the reference sequences.
RESULTS: Microbiome researchers are generally interested in two objectives of a taxonomic classifier: (i) to detect prevalence, i.e. the taxa present in a sample, and (ii) to estimate their relative abundances. MSC is primarily designed to detect prevalence and experimental results show that MSC is indeed a more effective and efficient algorithm compared to the other state-of-the-art algorithms in terms of accuracy, memory and runtime. Moreover, MSC outputs an approximate estimate of the abundances.
AVAILABILITY AND IMPLEMENTATION: The implementations are freely available for non-commercial purposes. They can be downloaded from https://drive.google.com/open?id=1XirkAamkQ3ltWvI1W1igYQFusp9DHtVl.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30649204      PMCID: PMC6931357          DOI: 10.1093/bioinformatics/bty1071

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  Finding motifs using random projections.

Authors:  Jeremy Buhler; Martin Tompa
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

2.  Metagenomic species profiling using universal phylogenetic marker genes.

Authors:  Shinichi Sunagawa; Daniel R Mende; Georg Zeller; Fernando Izquierdo-Carrasco; Simon A Berger; Jens Roat Kultima; Luis Pedro Coelho; Manimozhiyan Arumugam; Julien Tap; Henrik Bjørn Nielsen; Simon Rasmussen; Søren Brunak; Oluf Pedersen; Francisco Guarner; Willem M de Vos; Jun Wang; Junhua Li; Joël Doré; S Dusko Ehrlich; Alexandros Stamatakis; Peer Bork
Journal:  Nat Methods       Date:  2013-10-20       Impact factor: 28.547

3.  Higher classification sensitivity of short metagenomic reads with CLARK-S.

Authors:  Rachid Ounit; Stefano Lonardi
Journal:  Bioinformatics       Date:  2016-08-18       Impact factor: 6.937

4.  Using high throughput sequencing to explore the biodiversity in oral bacterial communities.

Authors:  P I Diaz; A K Dupuy; L Abusleme; B Reese; C Obergfell; L Choquette; A Dongari-Bagtzoglou; D E Peterson; E Terzi; L D Strausbaugh
Journal:  Mol Oral Microbiol       Date:  2012-03-03       Impact factor: 3.563

5.  Scalable metagenomic taxonomy classification using a reference genome database.

Authors:  Sasha K Ames; David A Hysom; Shea N Gardner; G Scott Lloyd; Maya B Gokhale; Jonathan E Allen
Journal:  Bioinformatics       Date:  2013-07-04       Impact factor: 6.937

6.  MetaPalette: a k-mer Painting Approach for Metagenomic Taxonomic Profiling and Quantification of Novel Strain Variation.

Authors:  David Koslicki; Daniel Falush
Journal:  mSystems       Date:  2016-06-07       Impact factor: 6.496

7.  Where less may be more: how the rare biosphere pulls ecosystems strings.

Authors:  Alexandre Jousset; Christina Bienhold; Antonis Chatzinotas; Laure Gallien; Angélique Gobet; Viola Kurm; Kirsten Küsel; Matthias C Rillig; Damian W Rivett; Joana F Salles; Marcel G A van der Heijden; Noha H Youssef; Xiaowei Zhang; Zhong Wei; W H Gera Hol
Journal:  ISME J       Date:  2017-01-10       Impact factor: 10.302

8.  Database indexing for production MegaBLAST searches.

Authors:  Aleksandr Morgulis; George Coulouris; Yan Raytselis; Thomas L Madden; Richa Agarwala; Alejandro A Schäffer
Journal:  Bioinformatics       Date:  2008-06-21       Impact factor: 6.937

9.  An evaluation of the accuracy and speed of metagenome analysis tools.

Authors:  Stinus Lindgreen; Karen L Adair; Paul P Gardner
Journal:  Sci Rep       Date:  2016-01-18       Impact factor: 4.379

10.  Fast and sensitive taxonomic classification for metagenomics with Kaiju.

Authors:  Peter Menzel; Kim Lee Ng; Anders Krogh
Journal:  Nat Commun       Date:  2016-04-13       Impact factor: 14.919

View more
  2 in total

Review 1.  Potential Use of Microbial Community Genomes in Various Dimensions of Agriculture Productivity and Its Management: A Review.

Authors:  Mir Asif Iquebal; Jaisri Jagannadham; Sarika Jaiswal; Ratna Prabha; Anil Rai; Dinesh Kumar
Journal:  Front Microbiol       Date:  2022-05-17       Impact factor: 6.064

2.  WalkIm: Compact image-based encoding for high-performance classification of biological sequences using simple tuning-free CNNs.

Authors:  Saeedeh Akbari Rokn Abadi; Amirhossein Mohammadi; Somayyeh Koohi
Journal:  PLoS One       Date:  2022-04-15       Impact factor: 3.752

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.