Literature DB >> 33575644

Shrinkage improves estimation of microbial associations under different normalization methods.

Michelle Badri1, Zachary D Kurtz2, Richard Bonneau1, Christian L Müller3.   

Abstract

Estimation of statistical associations in microbial genomic survey count data is fundamental to microbiome research. Experimental limitations, including count compositionality, low sample sizes and technical variability, obstruct standard application of association measures and require data normalization prior to statistical estimation. Here, we investigate the interplay between data normalization, microbial association estimation and available sample size by leveraging the large-scale American Gut Project (AGP) survey data. We analyze the statistical properties of two prominent linear association estimators, correlation and proportionality, under different sample scenarios and data normalization schemes, including RNA-seq analysis workflows and log-ratio transformations. We show that shrinkage estimation, a standard statistical regularization technique, can universally improve the quality of taxon-taxon association estimates for microbiome data. We find that large-scale association patterns in the AGP data can be grouped into five normalization-dependent classes. Using microbial association network construction and clustering as downstream data analysis examples, we show that variance-stabilizing and log-ratio approaches enable the most taxonomically and structurally coherent estimates. Taken together, the findings from our reproducible analysis workflow have important implications for microbiome studies in multiple stages of analysis, particularly when only small sample sizes are available.
© The Author(s) 2020. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2020        PMID: 33575644      PMCID: PMC7745771          DOI: 10.1093/nargab/lqaa100

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  7 in total

1.  NetCoMi: network construction and comparison for microbiome data in R.

Authors:  Stefanie Peschel; Christian L Müller; Erika von Mutius; Anne-Laure Boulesteix; Martin Depner
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  Identifying correlations driven by influential observations in large datasets.

Authors:  Kevin Bu; David S Wallach; Zach Wilson; Nan Shen; Leopoldo N Segal; Emilia Bagiella; Jose C Clemente
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  Adaptive and powerful microbiome multivariate association analysis via feature selection.

Authors:  Kalins Banerjee; Jun Chen; Xiang Zhan
Journal:  NAR Genom Bioinform       Date:  2022-01-14

4.  Bacterial low-abundant taxa are key determinants of a healthy airway metagenome in the early years of human life.

Authors:  Marie-Madlen Pust; Burkhard Tümmler
Journal:  Comput Struct Biotechnol J       Date:  2021-12-15       Impact factor: 7.271

5.  Negative binomial factor regression with application to microbiome data analysis.

Authors:  Aditya K Mishra; Christian L Müller
Journal:  Stat Med       Date:  2022-04-24       Impact factor: 2.497

6.  Metagenomic study of the gut microbiota associated with cow milk consumption in Chinese peri-/postmenopausal women.

Authors:  Bo Tian; Jia-Heng Yao; Xu Lin; Wan-Qiang Lv; Lin-Dong Jiang; Zhuo-Qi Wang; Jie Shen; Hong-Mei Xiao; Hanli Xu; Lu-Lu Xu; Xiyu Cheng; Hui Shen; Chuan Qiu; Zhe Luo; Lan-Juan Zhao; Qiong Yan; Hong-Wen Deng; Li-Shu Zhang
Journal:  Front Microbiol       Date:  2022-08-16       Impact factor: 6.064

7.  Tree-aggregated predictive modeling of microbiome data.

Authors:  Jacob Bien; Xiaohan Yan; Léo Simpson; Christian L Müller
Journal:  Sci Rep       Date:  2021-07-15       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.