Literature DB >> 32683674

Compositional knockoff filter for high-dimensional regression analysis of microbiome data.

Arun Srinivasan1, Lingzhou Xue1, Xiang Zhan2.   

Abstract

A critical task in microbiome data analysis is to explore the association between a scalar response of interest and a large number of microbial taxa that are summarized as compositional data at different taxonomic levels. Motivated by fine-mapping of the microbiome, we propose a two-step compositional knockoff filter to provide the effective finite-sample false discovery rate (FDR) control in high-dimensional linear log-contrast regression analysis of microbiome compositional data. In the first step, we propose a new compositional screening procedure to remove insignificant microbial taxa while retaining the essential sum-to-zero constraint. In the second step, we extend the knockoff filter to identify the significant microbial taxa in the sparse regression model for compositional data. Thereby, a subset of the microbes is selected from the high-dimensional microbial taxa as related to the response under a prespecified FDR threshold. We study the theoretical properties of the proposed two-step procedure, including both sure screening and effective false discovery control. We demonstrate these properties in numerical simulation studies to compare our methods to some existing ones and show power gain of the new method while controlling the nominal FDR. The potential usefulness of the proposed method is also illustrated with application to an inflammatory bowel disease data set to identify microbial taxa that influence host gene expressions.
© 2020 The International Biometric Society.

Entities:  

Keywords:  FDR control; compositional constraint; compositional screening; knockoff filter; log-contrast model; microbiome

Mesh:

Year:  2020        PMID: 32683674      PMCID: PMC7831267          DOI: 10.1111/biom.13336

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   1.701


  16 in total

1.  Generalized linear models with linear constraints for microbiome compositional data.

Authors:  Jiarui Lu; Pixu Shi; Hongzhe Li
Journal:  Biometrics       Date:  2018-08-10       Impact factor: 2.571

2.  A broken promise: microbiome differential abundance methods do not control the false discovery rate.

Authors:  Stijn Hawinkel; Federico Mattiello; Luc Bijnens; Olivier Thas
Journal:  Brief Bioinform       Date:  2019-01-18       Impact factor: 11.622

3.  The discarding of variables in multivariate analysis.

Authors:  E M Beale; M G Kendall; D W Mann
Journal:  Biometrika       Date:  1967-12       Impact factor: 2.445

Review 4.  The human microbiome: at the interface of health and disease.

Authors:  Ilseung Cho; Martin J Blaser
Journal:  Nat Rev Genet       Date:  2012-03-13       Impact factor: 53.242

5.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

6.  Feature Screening via Distance Correlation Learning.

Authors:  Runze Li; Wei Zhong; Liping Zhu
Journal:  J Am Stat Assoc       Date:  2012-07-01       Impact factor: 5.033

7.  False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.

Authors:  Jian Xiao; Hongyuan Cao; Jun Chen
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

8.  Associations between host gene expression, the mucosal microbiome, and clinical outcome in the pelvic pouch of patients with inflammatory bowel disease.

Authors:  Xochitl C Morgan; Boyko Kabakchiev; Levi Waldron; Andrea D Tyler; Timothy L Tickle; Raquel Milgrom; Joanne M Stempak; Dirk Gevers; Ramnik J Xavier; Mark S Silverberg; Curtis Huttenhower
Journal:  Genome Biol       Date:  2015-04-08       Impact factor: 13.583

9.  Normalization and microbial differential abundance strategies depend upon data characteristics.

Authors:  Sophie Weiss; Zhenjiang Zech Xu; Shyamal Peddada; Amnon Amir; Kyle Bittinger; Antonio Gonzalez; Catherine Lozupone; Jesse R Zaneveld; Yoshiki Vázquez-Baeza; Amanda Birmingham; Embriette R Hyde; Rob Knight
Journal:  Microbiome       Date:  2017-03-03       Impact factor: 14.650

10.  A two-stage microbial association mapping framework with advanced FDR control.

Authors:  Jiyuan Hu; Hyunwook Koh; Linchen He; Menghan Liu; Martin J Blaser; Huilin Li
Journal:  Microbiome       Date:  2018-07-25       Impact factor: 14.650

View more
  2 in total

1.  Adaptive and powerful microbiome multivariate association analysis via feature selection.

Authors:  Kalins Banerjee; Jun Chen; Xiang Zhan
Journal:  NAR Genom Bioinform       Date:  2022-01-14

2.  Opportunities and limits of combining microbiome and genome data for complex trait prediction.

Authors:  Miguel Pérez-Enciso; Laura M Zingaretti; Yuliaxis Ramayo-Caldas; Gustavo de Los Campos
Journal:  Genet Sel Evol       Date:  2021-08-06       Impact factor: 4.297

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.