Literature DB >> 33552122

A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies.

Rebecca A Deek1, Hongzhe Li1.   

Abstract

The human microbiome consists of a community of microbes in varying abundances and is shown to be associated with many diseases. An important first step in many microbiome studies is to identify possible distinct microbial communities in a given data set and to identify the important bacterial taxa that characterize these communities. The data from typical microbiome studies are high dimensional count data with excessive zeros due to both absence of species (structural zeros) and low sequencing depth or dropout. Although methods have been developed for identifying the microbial communities based on mixture models of counts, these methods do not account for excessive zeros observed in the data and do not differentiate structural from sampling zeros. In this paper, we introduce a zero-inflated Latent Dirichlet Allocation model (zinLDA) for sparse count data observed in microbiome studies. zinLDA builds on the flexible Latent Dirichlet Allocation model and allows for zero inflation in observed counts. We develop an efficient Markov chain Monte Carlo (MCMC) sampling procedure to fit the model. Results from our simulations show zinLDA provides better fits to the data and is able to separate structural zeros from sampling zeros. We apply zinLDA to the data set from the American Gut Project and identify microbial communities characterized by different bacterial genera.
Copyright © 2021 Deek and Li.

Entities:  

Keywords:  gibbs sampling; metagenomics; microbial community; mixture models; zero inflated dirchlet distribution

Year:  2021        PMID: 33552122      PMCID: PMC7862749          DOI: 10.3389/fgene.2020.602594

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  18 in total

1.  Finding scientific topics.

Authors:  Thomas L Griffiths; Mark Steyvers
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-10       Impact factor: 11.205

2.  The human microbiome project.

Authors:  Peter J Turnbaugh; Ruth E Ley; Micah Hamady; Claire M Fraser-Liggett; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

3.  Latent variable modeling for the microbiome.

Authors:  Kris Sankaran; Susan P Holmes
Journal:  Biostatistics       Date:  2019-10-01       Impact factor: 5.899

4.  Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis.

Authors:  Zheng-Zheng Tang; Guanhua Chen
Journal:  Biostatistics       Date:  2019-10-01       Impact factor: 5.899

5.  Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis.

Authors:  Jun Chen; Frederic D Bushman; James D Lewis; Gary D Wu; Hongzhe Li
Journal:  Biostatistics       Date:  2012-10-15       Impact factor: 5.899

6.  Sparse and compositionally robust inference of microbial ecological networks.

Authors:  Zachary D Kurtz; Christian L Müller; Emily R Miraldi; Dan R Littman; Martin J Blaser; Richard A Bonneau
Journal:  PLoS Comput Biol       Date:  2015-05-07       Impact factor: 4.475

7.  The Earth Microbiome project: successes and aspirations.

Authors:  Jack A Gilbert; Janet K Jansson; Rob Knight
Journal:  BMC Biol       Date:  2014-08-22       Impact factor: 7.431

8.  Revealing the microbial assemblage structure in the human gut microbiome using latent Dirichlet allocation.

Authors:  Shion Hosoda; Suguru Nishijima; Tsukasa Fukunaga; Masahira Hattori; Michiaki Hamada
Journal:  Microbiome       Date:  2020-06-23       Impact factor: 14.650

9.  A unified framework for unconstrained and constrained ordination of microbiome read count data.

Authors:  Stijn Hawinkel; Frederiek-Maarten Kerckhof; Luc Bijnens; Olivier Thas
Journal:  PLoS One       Date:  2019-02-13       Impact factor: 3.240

10.  A sparse covarying unit that describes healthy and impaired human gut microbiota development.

Authors:  Arjun S Raman; Jeanette L Gehrig; Siddarth Venkatesh; Hao-Wei Chang; Matthew C Hibberd; Sathish Subramanian; Gagandeep Kang; Pascal O Bessong; Aldo A M Lima; Margaret N Kosek; William A Petri; Dmitry A Rodionov; Aleksandr A Arzamasov; Semen A Leyn; Andrei L Osterman; Sayeeda Huq; Ishita Mostafa; Munirul Islam; Mustafa Mahfuz; Rashidul Haque; Tahmeed Ahmed; Michael J Barratt; Jeffrey I Gordon
Journal:  Science       Date:  2019-07-12       Impact factor: 47.728

View more
  2 in total

1.  Negative binomial factor regression with application to microbiome data analysis.

Authors:  Aditya K Mishra; Christian L Müller
Journal:  Stat Med       Date:  2022-04-24       Impact factor: 2.497

2.  Investigating differential abundance methods in microbiome data: A benchmark study.

Authors:  Marco Cappellato; Giacomo Baruzzo; Barbara Di Camillo
Journal:  PLoS Comput Biol       Date:  2022-09-08       Impact factor: 4.779

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.