| Literature DB >> 28125788 |
Yuqing Yang1, Ning Chen2, Ting Chen3.
Abstract
The inference of associations between environmental factors and microbes and among microbes is critical to interpreting metagenomic data, but compositional bias, indirect associations resulting from common factors, and variance within metagenomic sequencing data limit the discovery of associations. To account for these problems, we propose metagenomic Lognormal-Dirichlet-Multinomial (mLDM), a hierarchical Bayesian model with sparsity constraints, to estimate absolute microbial abundance and simultaneously infer both conditionally dependent associations among microbes and direct associations between microbes and environmental factors. We empirically show the effectiveness of the mLDM model using synthetic data, data from the TARA Oceans project, and a colorectal cancer dataset. Finally, we apply mLDM to 16S sequencing data from the western English Channel and report several associations. Our model can be used on both natural environmental and human metagenomic datasets, promoting the understanding of associations in the microbial community.Entities:
Keywords: 16S rRNA sequencing; Lognormal-Dirichlet-Multinomial model; OTU-OTU associations; compositional bias; environmental factor-microbe associations; hierarchical Bayesian statistical model; metagenomics; microbe-microbe associations
Mesh:
Substances:
Year: 2017 PMID: 28125788 DOI: 10.1016/j.cels.2016.12.012
Source DB: PubMed Journal: Cell Syst ISSN: 2405-4712 Impact factor: 10.304