Literature DB >> 33634824

Bayesian biclustering for microbial metagenomic sequencing data via multinomial matrix factorization.

Fangting Zhou1, Kejun He2, Qiwei Li3, Robert S Chapkin4, Yang Ni5.   

Abstract

High-throughput sequencing technology provides unprecedented opportunities to quantitatively explore human gut microbiome and its relation to diseases. Microbiome data are compositional, sparse, noisy, and heterogeneous, which pose serious challenges for statistical modeling. We propose an identifiable Bayesian multinomial matrix factorization model to infer overlapping clusters on both microbes and hosts. The proposed method represents the observed over-dispersed zero-inflated count matrix as Dirichlet-multinomial mixtures on which latent cluster structures are built hierarchically. Under the Bayesian framework, the number of clusters is automatically determined and available information from a taxonomic rank tree of microbes is naturally incorporated, which greatly improves the interpretability of our findings. We demonstrate the utility of the proposed approach by comparing to alternative methods in simulations. An application to a human gut microbiome data set involving patients with inflammatory bowel disease reveals interesting clusters, which contain bacteria families Bacteroidaceae, Bifidobacteriaceae, Enterobacteriaceae, Fusobacteriaceae, Lachnospiraceae, Ruminococcaceae, Pasteurellaceae, and Porphyromonadaceae that are known to be related to the inflammatory bowel disease and its subtypes according to biological literature. Our findings can help generate potential hypotheses for future investigation of the heterogeneity of the human gut microbiome.
© The Author 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian nonparametric prior; Compositional data analysis; Feature allocation; Mixture model; Phylogenetic Indian buffet process

Mesh:

Year:  2022        PMID: 33634824      PMCID: PMC9291645          DOI: 10.1093/biostatistics/kxab002

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.279


  36 in total

1.  Are Gibbs-Type Priors the Most Natural Generalization of the Dirichlet Process?

Authors:  Pierpaolo De Blasi; Stefano Favaro; Antonio Lijoi; Ramsés H Mena; Igor Prünster; Matteo Ruggiero
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-02       Impact factor: 6.226

2.  The human microbiome project.

Authors:  Peter J Turnbaugh; Ruth E Ley; Micah Hamady; Claire M Fraser-Liggett; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

3.  The treatment-naive microbiome in new-onset Crohn's disease.

Authors:  Subra Kugathasan; Lee A Denson; Dirk Gevers; Yoshiki Vázquez-Baeza; Will Van Treuren; Boyu Ren; Emma Schwager; Dan Knights; Se Jin Song; Moran Yassour; Xochitl C Morgan; Aleksandar D Kostic; Chengwei Luo; Antonio González; Daniel McDonald; Yael Haberman; Thomas Walters; Susan Baker; Joel Rosh; Michael Stephens; Melvin Heyman; James Markowitz; Robert Baldassano; Anne Griffiths; Francisco Sylvester; David Mack; Sandra Kim; Wallace Crandall; Jeffrey Hyams; Curtis Huttenhower; Rob Knight; Ramnik J Xavier
Journal:  Cell Host Microbe       Date:  2014-03-12       Impact factor: 21.023

Review 4.  Microbiota and diabetes: an evolving relationship.

Authors:  Herbert Tilg; Alexander R Moschen
Journal:  Gut       Date:  2014-05-15       Impact factor: 23.059

5.  BioMiCo: a supervised Bayesian model for inference of microbial community structure.

Authors:  Mahdi Shafiei; Katherine A Dunn; Eva Boon; Shelley M MacDonald; David A Walsh; Hong Gu; Joseph P Bielawski
Journal:  Microbiome       Date:  2015-03-10       Impact factor: 14.650

6.  An integrative Bayesian Dirichlet-multinomial regression model for the analysis of taxonomic abundances in microbiome data.

Authors:  W Duncan Wadsworth; Raffaele Argiento; Michele Guindani; Jessica Galloway-Pena; Samuel A Shelburne; Marina Vannucci
Journal:  BMC Bioinformatics       Date:  2017-02-08       Impact factor: 3.169

7.  FastSpar: rapid and scalable correlation estimation for compositional data.

Authors:  Stephen C Watts; Scott C Ritchie; Michael Inouye; Kathryn E Holt
Journal:  Bioinformatics       Date:  2019-03-15       Impact factor: 6.937

8.  Gut microbiome structure and metabolic activity in inflammatory bowel disease.

Authors:  Eric A Franzosa; Alexandra Sirota-Madi; Julian Avila-Pacheco; Nadine Fornelos; Henry J Haiser; Stefan Reinker; Tommi Vatanen; A Brantley Hall; Himel Mallick; Lauren J McIver; Jenny S Sauk; Robin G Wilson; Betsy W Stevens; Justin M Scott; Kerry Pierce; Amy A Deik; Kevin Bullock; Floris Imhann; Jeffrey A Porter; Alexandra Zhernakova; Jingyuan Fu; Rinse K Weersma; Cisca Wijmenga; Clary B Clish; Hera Vlamakis; Curtis Huttenhower; Ramnik J Xavier
Journal:  Nat Microbiol       Date:  2018-12-10       Impact factor: 17.745

9.  Learning Microbial Community Structures with Supervised and Unsupervised Non-negative Matrix Factorization.

Authors:  Yun Cai; Hong Gu; Toby Kenney
Journal:  Microbiome       Date:  2017-08-31       Impact factor: 14.650

Review 10.  Psoriasis and Microbiota: A Systematic Review.

Authors:  Farida Benhadou; Dillon Mintoff; Benjamin Schnebert; Hok Bing Thio
Journal:  Diseases       Date:  2018-06-02
View more
  3 in total

1.  Xylo-Oligosaccharides in Prevention of Hepatic Steatosis and Adipose Tissue Inflammation: Associating Taxonomic and Metabolomic Patterns in Fecal Microbiomes with Biclustering.

Authors:  Jukka Hintikka; Sanna Lensu; Elina Mäkinen; Sira Karvinen; Marjaana Honkanen; Jere Lindén; Tim Garrels; Satu Pekkala; Leo Lahti
Journal:  Int J Environ Res Public Health       Date:  2021-04-12       Impact factor: 3.390

2.  Healthcare Biclustering-Based Prediction on Gene Expression Dataset.

Authors:  M Ramkumar; N Basker; D Pradeep; Ramesh Prajapati; N Yuvaraj; R Arshath Raja; C Suresh; Rahul Vignesh; U Barakkath Nisha; K Srihari; Assefa Alene
Journal:  Biomed Res Int       Date:  2022-02-22       Impact factor: 3.411

3.  Host Genome-Metagenome Analyses Using Combinatorial Network Methods Reveal Key Metagenomic and Host Genetic Features for Methane Emission and Feed Efficiency in Cattle.

Authors:  Stefano Cardinale; Haja N Kadarmideen
Journal:  Front Genet       Date:  2022-02-23       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.