Literature DB >> 30590511

Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data.

Kyu Ha Lee1, Brent A Coull2, Anna-Barbara Moscicki3, Bruce J Paster4, Jacqueline R Starr1.   

Abstract

Microorganisms play critical roles in human health and disease. They live in diverse communities in which they interact synergistically or antagonistically. Thus for estimating microbial associations with clinical covariates, such as treatment effects, joint (multivariate) statistical models are preferred. Multivariate models allow one to estimate and exploit complex interdependencies among multiple taxa, yielding more powerful tests of exposure or treatment effects than application of taxon-specific univariate analyses. Analysis of microbial count data also requires special attention because data commonly exhibit zero inflation, i.e., more zeros than expected from a standard count distribution. To meet these needs, we developed a Bayesian variable selection model for multivariate count data with excess zeros that incorporates information on the covariance structure of the outcomes (counts for multiple taxa), while estimating associations with the mean levels of these outcomes. Though there has been much work on zero-inflated models for longitudinal data, little attention has been given to high-dimensional multivariate zero-inflated data modeled via a general correlation structure. Through simulation, we compared performance of the proposed method to that of existing univariate approaches, for both the binary ("excess zero") and count parts of the model. When outcomes were correlated the proposed variable selection method maintained type I error while boosting the ability to identify true associations in the binary component of the model. For the count part of the model, in some scenarios the univariate method had higher power than the multivariate approach. This higher power was at a cost of a highly inflated false discovery rate not observed with the proposed multivariate method. We applied the approach to oral microbiome data from the Pediatric HIV/AIDS Cohort Oral Health Study and identified five (of 44) species associated with HIV infection.
© The Author 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian variable selection; Markov chain Monte Carlo; Microbiome sequencing data; Multivariate analysis; Zero-inflated models

Mesh:

Year:  2020        PMID: 30590511      PMCID: PMC7308073          DOI: 10.1093/biostatistics/kxy067

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  33 in total

1.  Zero-inflated Poisson and binomial regression with random effects: a case study.

Authors:  D B Hall
Journal:  Biometrics       Date:  2000-12       Impact factor: 2.571

2.  The analysis of zero-inflated count data: beyond zero-inflated Poisson regression.

Authors:  Tom Loeys; Beatrijs Moerkerke; Olivia De Smet; Ann Buysse
Journal:  Br J Math Stat Psychol       Date:  2011-09-23       Impact factor: 3.380

3.  Multi-variate probit analysis.

Authors:  J R Ashford; R R Sowden
Journal:  Biometrics       Date:  1970-09       Impact factor: 2.571

4.  A technique of nonparametric multivariate analysis.

Authors:  N Mantel; R S Valand
Journal:  Biometrics       Date:  1970-09       Impact factor: 2.571

5.  Estimating overall exposure effects for zero-inflated regression models with application to dental caries.

Authors:  Jeffrey M Albert; Wei Wang; Suchitra Nelson
Journal:  Stat Methods Med Res       Date:  2011-09-08       Impact factor: 3.021

6.  Prevalence of and risk factors for substance use among perinatally human immunodeficiency virus-infected and perinatally exposed but uninfected youth.

Authors:  Julie Alperen; Sean Brummel; Katherine Tassiopoulos; Claude A Mellins; Deborah Kacanek; Renee Smith; George R Seage; Anna-Barbara Moscicki
Journal:  J Adolesc Health       Date:  2013-11-13       Impact factor: 5.012

7.  Bacteria of dental caries in primary and permanent teeth in children and young adults.

Authors:  Jørn A Aas; Ann L Griffen; Sara R Dardis; Alice M Lee; Ingar Olsen; Floyd E Dewhirst; Eugene J Leys; Bruce J Paster
Journal:  J Clin Microbiol       Date:  2008-01-23       Impact factor: 5.948

8.  Dirichlet multinomial mixtures: generative models for microbial metagenomics.

Authors:  Ian Holmes; Keith Harris; Christopher Quince
Journal:  PLoS One       Date:  2012-02-03       Impact factor: 3.240

9.  An integrative Bayesian Dirichlet-multinomial regression model for the analysis of taxonomic abundances in microbiome data.

Authors:  W Duncan Wadsworth; Raffaele Argiento; Michele Guindani; Jessica Galloway-Pena; Samuel A Shelburne; Marina Vannucci
Journal:  BMC Bioinformatics       Date:  2017-02-08       Impact factor: 3.169

10.  Oral microbiota in youth with perinatally acquired HIV infection.

Authors:  Jacqueline R Starr; Yanmei Huang; Kyu Ha Lee; C M Murphy; Anna-Barbara Moscicki; Caroline H Shiboski; Mark I Ryder; Tzy-Jyun Yao; Lina L Faller; Russell B Van Dyke; Bruce J Paster
Journal:  Microbiome       Date:  2018-05-31       Impact factor: 14.650

View more
  2 in total

1.  Use of Bayes factors to evaluate the effects of host genetics, litter and cage on the rabbit cecal microbiota.

Authors:  María Velasco-Galilea; Miriam Piles; Yuliaxis Ramayo-Caldas; Luis Varona; Juan Pablo Sánchez
Journal:  Genet Sel Evol       Date:  2022-06-27       Impact factor: 5.100

2.  A zero inflated log-normal model for inference of sparse microbial association networks.

Authors:  Vincent Prost; Stéphane Gazut; Thomas Brüls
Journal:  PLoS Comput Biol       Date:  2021-06-18       Impact factor: 4.475

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.