Literature DB >> 31886477

Sufficient dimension reduction for compositional data.

Diego Tomassi1, Liliana Forzani2, Sabrina Duarte3, Ruth M Pfeiffer.   

Abstract

Recent efforts to characterize the human microbiome and its relation to chronic diseases have led to a surge in statistical development for compositional data. We develop likelihood-based sufficient dimension reduction methods (SDR) to find linear combinations that contain all the information in the compositional data on an outcome variable, i.e., are sufficient for modeling and prediction of the outcome. We consider several models for the inverse regression of the compositional vector or transformations of it, as a function of outcome. They include normal, multinomial, and Poisson graphical models that allow for complex dependencies among observed counts. These methods yield efficient estimators of the reduction and can be applied to continuous or categorical outcomes. We incorporate variable selection into the estimation via penalties and address important invariance issues arising from the compositional nature of the data. We illustrate and compare our methods and some established methods for analyzing microbiome data in simulations and using data from the Human Microbiome Project. Displaying the data in the coordinate system of the SDR linear combinations allows visual inspection and facilitates comparisons across studies. Published by Oxford University Press on behalf of biostatistics 2019. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Entities:  

Keywords:  Count data; Penalized likelihood; Prediction; Regression; Sufficient statistic; visualization

Mesh:

Year:  2021        PMID: 31886477      PMCID: PMC8677486          DOI: 10.1093/biostatistics/kxz060

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  10 in total

1.  A logistic normal multinomial regression model for microbiome compositional data analysis.

Authors:  Fan Xia; Jun Chen; Wing Kam Fung; Hongzhe Li
Journal:  Biometrics       Date:  2013-10-15       Impact factor: 2.571

2.  Testing in Microbiome-Profiling Studies with MiRKAT, the Microbiome Regression-Based Kernel Association Test.

Authors:  Ni Zhao; Jun Chen; Ian M Carroll; Tamar Ringel-Kulka; Michael P Epstein; Hua Zhou; Jin J Zhou; Yehuda Ringel; Hongzhe Li; Michael C Wu
Journal:  Am J Hum Genet       Date:  2015-05-07       Impact factor: 11.025

3.  Prediction analysis for microbiome sequencing data.

Authors:  Tao Wang; Can Yang; Hongyu Zhao
Journal:  Biometrics       Date:  2019-04-17       Impact factor: 2.571

4.  Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis.

Authors:  Jun Chen; Frederic D Bushman; James D Lewis; Gary D Wu; Hongzhe Li
Journal:  Biostatistics       Date:  2012-10-15       Impact factor: 5.899

5.  VARIABLE SELECTION FOR SPARSE DIRICHLET-MULTINOMIAL REGRESSION WITH AN APPLICATION TO MICROBIOME DATA ANALYSIS.

Authors:  Jun Chen; Hongzhe Li
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

6.  Small Sample Kernel Association Tests for Human Genetic and Microbiome Association Studies.

Authors:  Jun Chen; Wenan Chen; Ni Zhao; Michael C Wu; Daniel J Schaid
Journal:  Genet Epidemiol       Date:  2015-12-07       Impact factor: 2.135

7.  A GLM-based latent variable ordination method for microbiome samples.

Authors:  Michael B Sohn; Hongzhe Li
Journal:  Biometrics       Date:  2017-10-09       Impact factor: 2.571

8.  A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution.

Authors:  David Inouye; Eunho Yang; Genevera Allen; Pradeep Ravikumar
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2017-03-28

9.  Chapter 12: Human microbiome analysis.

Authors:  Xochitl C Morgan; Curtis Huttenhower
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

10.  An adaptive association test for microbiome data.

Authors:  Chong Wu; Jun Chen; Junghi Kim; Wei Pan
Journal:  Genome Med       Date:  2016-05-19       Impact factor: 11.117

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.