Literature DB >> 32493208

An empirical Bayes approach to normalization and differential abundance testing for microbiome data.

Tiantian Liu1,2, Hongyu Zhao3,2, Tao Wang4,5,6.   

Abstract

BACKGROUND: Advances in DNA sequencing have offered researchers an unprecedented opportunity to better study the variety of species living in and on the human body. However, the analysis of microbiome data is complicated by several challenges. First, the sequencing depth may vary by orders of magnitude across samples. Second, species are rare and the data often contain many zeros. Third, the specimen is a fraction of the microbial ecosystem, and so the data are compositional carrying only relative information. Other characteristics of microbiome data include pronounced over-dispersion in taxon abundances, and the existence of a phylogenetic tree that relates all bacterial species. To address some of these challenges, microbiome analysis workflows often normalize the read counts prior to downstream analysis. However, there are limitations in the current literature on the normalization of microbiome data.
RESULTS: Under the multinomial distribution for the read counts and a prior for the unknown proportions, we propose an empirical Bayes approach to microbiome data normalization. Using a tree-based extension of the Dirichlet prior, we further extend our method by incorporating the phylogenetic tree into the normalization process. We study the impact of normalization on differential abundance analysis. In the presence of tree structure, we propose a phylogeny-aware detection procedure.
CONCLUSIONS: Extensive simulations and gut microbiome data applications are conducted to demonstrate the superior performance of our empirical Bayes method over other normalization methods, and over commonly-used methods for differential abundance testing. Original R scripts are available at GitHub (https://github.com/liudoubletian/eBay).

Entities:  

Keywords:  Bayesian shrinkage; Differentially abundant OTUs; MetagenomeSeq; Phylogeny-aware analysis; Rarefying

Year:  2020        PMID: 32493208     DOI: 10.1186/s12859-020-03552-z

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  4 in total

1.  phyloMDA: an R package for phylogeny-aware microbiome data analysis.

Authors:  Tiantian Liu; Chao Zhou; Huimin Wang; Hongyu Zhao; Tao Wang
Journal:  BMC Bioinformatics       Date:  2022-06-06       Impact factor: 3.307

2.  Beware to ignore the rare: how imputing zero-values can improve the quality of 16S rRNA gene studies results.

Authors:  Giacomo Baruzzo; Ilaria Patuzzi; Barbara Di Camillo
Journal:  BMC Bioinformatics       Date:  2022-02-07       Impact factor: 3.169

3.  A comprehensive evaluation of microbial differential abundance analysis methods: current status and potential solutions.

Authors:  Lu Yang; Jun Chen
Journal:  Microbiome       Date:  2022-08-19       Impact factor: 16.837

4.  Investigating differential abundance methods in microbiome data: A benchmark study.

Authors:  Marco Cappellato; Giacomo Baruzzo; Barbara Di Camillo
Journal:  PLoS Comput Biol       Date:  2022-09-08       Impact factor: 4.779

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.