Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction analysis for microbiome sequencing data.

Literature DB >> 30994187

Prediction analysis for microbiome sequencing data.

Tao Wang^1,2,3, Can Yang⁴, Hongyu Zhao^3,5.

Abstract

One goal of human microbiome studies is to relate host traits with human microbiome compositions. The analysis of microbial community sequencing data presents great statistical challenges, especially when the samples have different library sizes and the data are overdispersed with many zeros. To address these challenges, we introduce a new statistical framework, called predictive analysis in metagenomics via inverse regression (PAMIR), to analyze microbiome sequencing data. Within this framework, an inverse regression model is developed for overdispersed microbiota counts given the trait, and then a prediction rule is constructed by taking advantage of the dimension-reduction structure in the model. An efficient Monte Carlo expectation-maximization algorithm is proposed for maximum likelihood estimation. The method is further generalized to accommodate other types of covariates. We demonstrate the advantages of PAMIR through simulations and two real data examples.

Entities: Species

Keywords: expectation-maximization algorithm; log ratios; metagenomic data; model-based dimension reduction; multinomial-logit regression

Mesh：

Year: 2019 PMID： 30994187 DOI： 10.1111/biom.13061

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

Keyword Cloud
Cited

2 in total

1. DCMD: Distance-based classification using mixture distributions on microbiome data.

Authors: Konstantin Shestopaloff; Mei Dong; Fan Gao; Wei Xu
Journal: PLoS Comput Biol Date: 2021-03-12 Impact factor: 4.475

2. Sufficient dimension reduction for compositional data.

Authors: Diego Tomassi; Liliana Forzani; Sabrina Duarte; Ruth M Pfeiffer
Journal: Biostatistics Date: 2021-10-13 Impact factor: 5.899

2 in total