Literature DB >> 24312162

VARIABLE SELECTION FOR SPARSE DIRICHLET-MULTINOMIAL REGRESSION WITH AN APPLICATION TO MICROBIOME DATA ANALYSIS.

Jun Chen1, Hongzhe Li.   

Abstract

With the development of next generation sequencing technology, researchers have now been able to study the microbiome composition using direct sequencing, whose output are bacterial taxa counts for each microbiome sample. One goal of microbiome study is to associate the microbiome composition with environmental covariates. We propose to model the taxa counts using a Dirichlet-multinomial (DM) regression model in order to account for overdispersion of observed counts. The DM regression model can be used for testing the association between taxa composition and covariates using the likelihood ratio test. However, when the number of the covariates is large, multiple testing can lead to loss of power. To deal with the high dimensionality of the problem, we develop a penalized likelihood approach to estimate the regression parameters and to select the variables by imposing a sparse group [Formula: see text] penalty to encourage both group-level and within-group sparsity. Such a variable selection procedure can lead to selection of the relevant covariates and their associated bacterial taxa. An efficient block-coordinate descent algorithm is developed to solve the optimization problem. We present extensive simulations to demonstrate that the sparse DM regression can result in better identification of the microbiome-associated covariates than models that ignore overdispersion or only consider the proportions. We demonstrate the power of our method in an analysis of a data set evaluating the effects of nutrient intake on human gut microbiome composition. Our results have clearly shown that the nutrient intake is strongly associated with the human gut microbiome.

Entities:  

Keywords:  Coordinate descent; Counts data; Overdispersion; Regularized likelihood; Sparse group penalty

Year:  2013        PMID: 24312162      PMCID: PMC3846354          DOI: 10.1214/12-AOAS592

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  10 in total

1.  Metagenomics and personalized medicine.

Authors:  Herbert W Virgin; John A Todd
Journal:  Cell       Date:  2011-09-30       Impact factor: 41.582

2.  Individuality in gut microbiota composition is a complex polygenic trait shaped by multiple environmental and host genetic factors.

Authors:  Andrew K Benson; Scott A Kelly; Ryan Legge; Fangrui Ma; Soo Jen Low; Jaehyoung Kim; Min Zhang; Phaik Lyn Oh; Derrick Nehrenberg; Kunjie Hua; Stephen D Kachman; Etsuko N Moriyama; Jens Walter; Daniel A Peterson; Daniel Pomp
Journal:  Proc Natl Acad Sci U S A       Date:  2010-10-11       Impact factor: 11.205

Review 3.  Host-bacterial mutualism in the human intestine.

Authors:  Fredrik Bäckhed; Ruth E Ley; Justin L Sonnenburg; Daniel A Peterson; Jeffrey I Gordon
Journal:  Science       Date:  2005-03-25       Impact factor: 47.728

4.  Multi-level zero-inflated poisson regression modelling of correlated count data with excess zeros.

Authors:  Andy H Lee; Kui Wang; Jane A Scott; Kelvin K W Yau; Geoffrey J McLachlan
Journal:  Stat Methods Med Res       Date:  2006-02       Impact factor: 3.021

5.  Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities.

Authors:  Patrick D Schloss; Sarah L Westcott; Thomas Ryabin; Justine R Hall; Martin Hartmann; Emily B Hollister; Ryan A Lesniewski; Brian B Oakley; Donovan H Parks; Courtney J Robinson; Jason W Sahl; Blaz Stres; Gerhard G Thallinger; David J Van Horn; Carolyn F Weber
Journal:  Appl Environ Microbiol       Date:  2009-10-02       Impact factor: 4.792

6.  Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer.

Authors:  Jie Peng; Ji Zhu; Anna Bergamaschi; Wonshik Han; Dong-Young Noh; Jonathan R Pollack; Pei Wang
Journal:  Ann Appl Stat       Date:  2010-03       Impact factor: 2.083

7.  Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of Crohn disease patients.

Authors:  Harry Sokol; Bénédicte Pigneur; Laurie Watterlot; Omar Lakhdari; Luis G Bermúdez-Humarán; Jean-Jacques Gratadoux; Sébastien Blugeon; Chantal Bridonneau; Jean-Pierre Furet; Gérard Corthier; Corinne Grangette; Nadia Vasquez; Philippe Pochart; Germain Trugnan; Ginette Thomas; Hervé M Blottière; Joël Doré; Philippe Marteau; Philippe Seksik; Philippe Langella
Journal:  Proc Natl Acad Sci U S A       Date:  2008-10-20       Impact factor: 11.205

8.  Linking long-term dietary patterns with gut microbial enterotypes.

Authors:  Gary D Wu; Jun Chen; Christian Hoffmann; Kyle Bittinger; Ying-Yu Chen; Sue A Keilbaugh; Meenakshi Bewtra; Dan Knights; William A Walters; Rob Knight; Rohini Sinha; Erin Gilroy; Kernika Gupta; Robert Baldassano; Lisa Nessel; Hongzhe Li; Frederic D Bushman; James D Lewis
Journal:  Science       Date:  2011-09-01       Impact factor: 47.728

9.  QIIME allows analysis of high-throughput community sequencing data.

Authors:  J Gregory Caporaso; Justin Kuczynski; Jesse Stombaugh; Kyle Bittinger; Frederic D Bushman; Elizabeth K Costello; Noah Fierer; Antonio Gonzalez Peña; Julia K Goodrich; Jeffrey I Gordon; Gavin A Huttley; Scott T Kelley; Dan Knights; Jeremy E Koenig; Ruth E Ley; Catherine A Lozupone; Daniel McDonald; Brian D Muegge; Meg Pirrung; Jens Reeder; Joel R Sevinsky; Peter J Turnbaugh; William A Walters; Jeremy Widmann; Tanya Yatsunenko; Jesse Zaneveld; Rob Knight
Journal:  Nat Methods       Date:  2010-04-11       Impact factor: 28.547

10.  pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree.

Authors:  Frederick A Matsen; Robin B Kodner; E Virginia Armbrust
Journal:  BMC Bioinformatics       Date:  2010-10-30       Impact factor: 3.169

  10 in total
  50 in total

1.  Investigating microbial co-occurrence patterns based on metagenomic compositional data.

Authors:  Yuguang Ban; Lingling An; Hongmei Jiang
Journal:  Bioinformatics       Date:  2015-06-16       Impact factor: 6.937

Review 2.  Mapping the microbial interactome: Statistical and experimental approaches for microbiome network inference.

Authors:  Anders B Dohlman; Xiling Shen
Journal:  Exp Biol Med (Maywood)       Date:  2019-03-16

3.  Power and sample size calculations for high-throughput sequencing-based experiments.

Authors:  Chung-I Li; David C Samuels; Ying-Yong Zhao; Yu Shyr; Yan Guo
Journal:  Brief Bioinform       Date:  2018-11-27       Impact factor: 11.622

4.  Latent variable modeling for the microbiome.

Authors:  Kris Sankaran; Susan P Holmes
Journal:  Biostatistics       Date:  2019-10-01       Impact factor: 5.899

5.  Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data.

Authors:  Kyu Ha Lee; Brent A Coull; Anna-Barbara Moscicki; Bruce J Paster; Jacqueline R Starr
Journal:  Biostatistics       Date:  2020-07-01       Impact factor: 5.899

6.  Home dust microbiota is disordered in homes of low-income asthmatic children.

Authors:  Christina E Ciaccio; Charles Barnes; Kevin Kennedy; Marcia Chan; Jay Portnoy; Lanny Rosenwasser
Journal:  J Asthma       Date:  2015-10-29       Impact factor: 2.515

7.  Regression Models For Multivariate Count Data.

Authors:  Yiwen Zhang; Hua Zhou; Jin Zhou; Wei Sun
Journal:  J Comput Graph Stat       Date:  2017-02-16       Impact factor: 2.302

8.  A model for paired-multinomial data and its application to analysis of data on a taxonomic tree.

Authors:  Pixu Shi; Hongzhe Li
Journal:  Biometrics       Date:  2017-03-30       Impact factor: 2.571

9.  MODELING MICROBIAL ABUNDANCES AND DYSBIOSIS WITH BETA-BINOMIAL REGRESSION.

Authors:  Bryan D Martin; Daniela Witten; Amy D Willis
Journal:  Ann Appl Stat       Date:  2020-04-16       Impact factor: 2.083

10.  A GLM-based latent variable ordination method for microbiome samples.

Authors:  Michael B Sohn; Hongzhe Li
Journal:  Biometrics       Date:  2017-10-09       Impact factor: 2.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.