Literature DB >> 36127929

DIRICHLET-TREE MULTINOMIAL MIXTURES FOR CLUSTERING MICROBIOME COMPOSITIONS.

Jialiang Mao1, L I Ma1.   

Abstract

Studying the human microbiome has gained substantial interest in recent years, and a common task in the analysis of these data is to cluster microbiome compositions into subtypes. This subdivision of samples into subgroups serves as an intermediary step in achieving personalized diagnosis and treatment. In applying existing clustering methods to modern microbiome studies including the American Gut Project (AGP) data, we found that this seemingly standard task, however, is very challenging in the microbiome composition context due to several key features of such data. Standard distance-based clustering algorithms generally do not produce reliable results as they do not take into account the heterogeneity of the cross-sample variability among the bacterial taxa, while existing model-based approaches do not allow sufficient flexibility for the identification of complex within-cluster variation from cross-cluster variation. Direct applications of such methods generally lead to overly dispersed clusters in the AGP data and such phenomenon is common for other microbiome data. To overcome these challenges, we introduce Dirichlet-tree multinomial mixtures (DTMM) as a Bayesian generative model for clustering amplicon sequencing data in microbiome studies. DTMM models the microbiome population with a mixture of Dirichlet-tree kernels that utilizes the phylogenetic tree to offer a more flexible covariance structure in characterizing within-cluster variation, and it provides a means for identifying a subset of signature taxa that distinguish the clusters. We perform extensive simulation studies to evaluate the performance of DTMM and compare it to state-of-the-art model-based and distance-based clustering methods in the microbiome context, and carry out a validation study on a publicly available longitudinal data set to confirm the biological relevance of the clusters. Finally, we report a case study on the fecal data from the AGP to identify compositional clusters among individuals with inflammatory bowel disease and diabetes. Among our most interesting findings is that enterotypes (i.e., gut microbiome clusters) are not always defined by the most dominant species as previous analyses had assumed, but can involve a number of less abundant OTUs, which cannot be identified with existing distance-based and method-based approaches.

Entities:  

Keywords:  Bayesian hierarchical models; Compositional data; Latent variable models; Probabilistic learning

Year:  2022        PMID: 36127929      PMCID: PMC9484567          DOI: 10.1214/21-aoas1552

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   1.959


  21 in total

1.  The impact of Crohn's disease genes on healthy human gut microbiota: a pilot study.

Authors:  Christopher Quince; Elin E Lundin; Anna N Andreasson; Dario Greco; Joseph Rafter; Nicholas J Talley; Lars Agreus; Anders F Andersson; Lars Engstrand; Mauro D'Amato
Journal:  Gut       Date:  2013-01-07       Impact factor: 23.059

Review 2.  The microbiome in inflammatory bowel disease: current status and the future ahead.

Authors:  Aleksandar D Kostic; Ramnik J Xavier; Dirk Gevers
Journal:  Gastroenterology       Date:  2014-02-19       Impact factor: 22.682

Review 3.  Enterotypes in the landscape of gut microbial community composition.

Authors:  Paul I Costea; Falk Hildebrand; Manimozhiyan Arumugam; Fredrik Bäckhed; Martin J Blaser; Frederic D Bushman; Willem M de Vos; S Dusko Ehrlich; Claire M Fraser; Masahira Hattori; Curtis Huttenhower; Ian B Jeffery; Dan Knights; James D Lewis; Ruth E Ley; Howard Ochman; Paul W O'Toole; Christopher Quince; David A Relman; Fergus Shanahan; Shinichi Sunagawa; Jun Wang; George M Weinstock; Gary D Wu; Georg Zeller; Liping Zhao; Jeroen Raes; Rob Knight; Peer Bork
Journal:  Nat Microbiol       Date:  2017-12-18       Impact factor: 17.745

4.  A Dirichlet-tree multinomial regression model for associating dietary nutrients with gut microorganisms.

Authors:  Tao Wang; Hongyu Zhao
Journal:  Biometrics       Date:  2017-01-23       Impact factor: 2.571

5.  Linking long-term dietary patterns with gut microbial enterotypes.

Authors:  Gary D Wu; Jun Chen; Christian Hoffmann; Kyle Bittinger; Ying-Yu Chen; Sue A Keilbaugh; Meenakshi Bewtra; Dan Knights; William A Walters; Rob Knight; Rohini Sinha; Erin Gilroy; Kernika Gupta; Robert Baldassano; Lisa Nessel; Hongzhe Li; Frederic D Bushman; James D Lewis
Journal:  Science       Date:  2011-09-01       Impact factor: 47.728

6.  Bayesian community-wide culture-independent microbial source tracking.

Authors:  Dan Knights; Justin Kuczynski; Emily S Charlson; Jesse Zaneveld; Michael C Mozer; Ronald G Collman; Frederic D Bushman; Rob Knight; Scott T Kelley
Journal:  Nat Methods       Date:  2011-07-17       Impact factor: 28.547

7.  UniFrac: a new phylogenetic method for comparing microbial communities.

Authors:  Catherine Lozupone; Rob Knight
Journal:  Appl Environ Microbiol       Date:  2005-12       Impact factor: 4.792

8.  Hypothesis testing and power calculations for taxonomic-based human microbiome data.

Authors:  Patricio S La Rosa; J Paul Brooks; Elena Deych; Edward L Boone; David J Edwards; Qin Wang; Erica Sodergren; George Weinstock; William D Shannon
Journal:  PLoS One       Date:  2012-12-20       Impact factor: 3.240

9.  A guide to enterotypes across the human body: meta-analysis of microbial community structures in human microbiome datasets.

Authors:  Omry Koren; Dan Knights; Antonio Gonzalez; Levi Waldron; Nicola Segata; Rob Knight; Curtis Huttenhower; Ruth E Ley
Journal:  PLoS Comput Biol       Date:  2013-01-10       Impact factor: 4.475

Review 10.  Context and the human microbiome.

Authors:  Daniel McDonald; Amanda Birmingham; Rob Knight
Journal:  Microbiome       Date:  2015-11-04       Impact factor: 14.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.