Literature DB >> 29177251

A concurrent subtractive assembly approach for identification of disease associated sub-metagenomes.

Wontack Han1, Mingjie Wang1, Yuzhen Ye1.   

Abstract

Comparative analysis of metagenomes can be used to detect sub-metagenomes (species or gene sets) that are associated with specific phenotypes (e.g., host status). The typical workflow is to assemble and annotate metagenomic datasets individually or as a whole, followed by statistical tests to identify differentially abundant species/genes. We previously developed subtractive assembly (SA), a de novo assembly approach for comparative metagenomics that first detects differential reads that distinguish between two groups of metagenomes and then only assembles these reads. Application of SA to type 2 diabetes (T2D) microbiomes revealed new microbial genes associated with T2D. Here we further developed a Concurrent Subtractive Assembly (CoSA) approach, which uses a Wilcoxon rank-sum (WRS) test to detect k-mers that are differentially abundant between two groups of microbiomes (by contrast, SA only checks ratios of k-mer counts in one pooled sample versus the other). It then uses identified differential k-mers to extract reads that are likely sequenced from the sub-metagenome with consistent abundance differences between the groups of microbiomes. Further, CoSA attempts to reduce the redundancy of reads (from abundant common species) by excluding reads containing abundant k-mers. Using simulated microbiome datasets and T2D datasets, we show that CoSA achieves strikingly better performance in detecting consistent changes than SA does, and it enables the detection and assembly of genomes and genes with minor abundance difference. A SVM classifier built upon the microbial genes detected by CoSA from the T2D datasets can accurately discriminates patients from healthy controls, with an AUC of 0.94 (10-fold cross-validation), and therefore these differential genes (207 genes) may serve as potential microbial marker genes for T2D.

Entities:  

Keywords:  Wilcoxon rank-sum test; comparative metagenomics; concurrent subtractive assembly; metagenome

Year:  2017        PMID: 29177251      PMCID: PMC5697791          DOI: 10.1007/978-3-319-56970-3_2

Source DB:  PubMed          Journal:  Res Comput Mol Biol


  45 in total

1.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

Review 3.  MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices.

Authors:  Dinghua Li; Ruibang Luo; Chi-Man Liu; Chi-Ming Leung; Hing-Fung Ting; Kunihiko Sadakane; Hiroshi Yamashita; Tak-Wah Lam
Journal:  Methods       Date:  2016-03-21       Impact factor: 3.608

Review 4.  Global burden of cancers attributable to infections in 2008: a review and synthetic analysis.

Authors:  Catherine de Martel; Jacques Ferlay; Silvia Franceschi; Jérôme Vignat; Freddie Bray; David Forman; Martyn Plummer
Journal:  Lancet Oncol       Date:  2012-05-09       Impact factor: 41.316

5.  Potential of fecal microbiota for early-stage detection of colorectal cancer.

Authors:  Georg Zeller; Julien Tap; Anita Y Voigt; Shinichi Sunagawa; Jens Roat Kultima; Paul I Costea; Aurélien Amiot; Jürgen Böhm; Francesco Brunetti; Nina Habermann; Rajna Hercog; Moritz Koch; Alain Luciani; Daniel R Mende; Martin A Schneider; Petra Schrotz-King; Christophe Tournigand; Jeanne Tran Van Nhieu; Takuji Yamada; Jürgen Zimmermann; Vladimir Benes; Matthias Kloor; Cornelia M Ulrich; Magnus von Knebel Doeberitz; Iradj Sobhani; Peer Bork
Journal:  Mol Syst Biol       Date:  2014-11-28       Impact factor: 11.429

6.  Oral microbiome of deep and shallow dental pockets in chronic periodontitis.

Authors:  Xiuchun Ge; Rafael Rodriguez; My Trinh; John Gunsolley; Ping Xu
Journal:  PLoS One       Date:  2013-06-06       Impact factor: 3.240

7.  MetaSim: a sequencing simulator for genomics and metagenomics.

Authors:  Daniel C Richter; Felix Ott; Alexander F Auch; Ramona Schmid; Daniel H Huson
Journal:  PLoS One       Date:  2008-10-08       Impact factor: 3.240

8.  Reduced incidence of Prevotella and other fermenters in intestinal microflora of autistic children.

Authors:  Dae-Wook Kang; Jin Gyoon Park; Zehra Esra Ilhan; Garrick Wallstrom; Joshua Labaer; James B Adams; Rosa Krajmalnik-Brown
Journal:  PLoS One       Date:  2013-07-03       Impact factor: 3.240

9.  Metatranscriptomics of the human oral microbiome during health and disease.

Authors:  Peter Jorth; Keith H Turner; Pinar Gumus; Nejat Nizam; Nurcan Buduneli; Marvin Whiteley
Journal:  mBio       Date:  2014-04-01       Impact factor: 7.867

10.  Revised Estimates for the Number of Human and Bacteria Cells in the Body.

Authors:  Ron Sender; Shai Fuchs; Ron Milo
Journal:  PLoS Biol       Date:  2016-08-19       Impact factor: 8.029

View more
  6 in total

1.  MetaPheno: A critical evaluation of deep learning and machine learning in metagenome-based disease prediction.

Authors:  Nathan LaPierre; Chelsea J-T Ju; Guangyu Zhou; Wei Wang
Journal:  Methods       Date:  2019-03-16       Impact factor: 3.608

2.  A repository of microbial marker genes related to human health and diseases for host phenotype prediction using microbiome data.

Authors:  Wontack Han; Yuzhen Ye
Journal:  Pac Symp Biocomput       Date:  2019

3.  Information Theoretic Metagenome Assembly Allows the Discovery of Disease Biomarkers in Human Microbiome.

Authors:  O Ufuk Nalbantoglu
Journal:  Entropy (Basel)       Date:  2021-02-02       Impact factor: 2.524

Review 4.  Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation.

Authors:  Alban Mathieu; Mickael Leclercq; Melissa Sanabria; Olivier Perin; Arnaud Droit
Journal:  Front Microbiol       Date:  2022-03-14       Impact factor: 5.640

5.  Locality-Sensitive Hashing-Based k-Mer Clustering for Identification of Differential Microbial Markers Related to Host Phenotype.

Authors:  Wontack Han; Haixu Tang; Yuzhen Ye
Journal:  J Comput Biol       Date:  2022-05-17       Impact factor: 1.549

6.  Identifying Group-Specific Sequences for Microbial Communities Using Long k-mer Sequence Signatures.

Authors:  Ying Wang; Lei Fu; Jie Ren; Zhaoxia Yu; Ting Chen; Fengzhu Sun
Journal:  Front Microbiol       Date:  2018-05-03       Impact factor: 5.640

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.