Literature DB >> 31719220

Successful strategies for human microbiome data generation, storage and analyses.

Susan Holmes1.   

Abstract

Current interest in the potential for clinical use of new tools for improving human health are now focused on techniques for the study of the human microbiome and its interaction with environmental and clinical covariates. This review outlines the use of statistical strategies that have been developed in past studies and can inform successful design and analyses of controlled perturbation experiments performed in the human microbiome. We carefully outline what the data are, their imperfections and how we need to transform, decontaminate and denoise them. We show how to identify the important unknown parameters and how to can leverage variability we see to produce efficient models for prediction and uncertainty quantification. We encourage a reproducible strategy that builds on best practice principles that can be adapted for effective experimental design and reproducible workflows. Nonparametric, data-driven denoising strategies already provide the best strain identification and decontamination methods. Data driven models can be combined with uncertainty quantification to provide reproducible aids to decision making in the clinical context, as long as careful, separate, registered confirmatory testing are undertaken. Here we provide guidelines for effective longitudinal studies and their analyses. Lessons learned along the way are that visualizations at every step can pinpoint problems and outliers, normalization and filtering improve power in downstream testing. We recommend collecting and binding the metadata and covariates to sample descriptors and recording complete computer scripts into an R markdown supplement that can reduce opportunities for human error and enable collaborators and readers to replicate all the steps of the study. Finally, we note that optimizing the bioinformatic and statistical workflow involves adopting a wait-and-see approach that is particularly effective in cases where the features such as 'mass spectrometry peaks' and metagenomic tables can only be partially annotated.

Entities:  

Mesh:

Year:  2019        PMID: 31719220

Source DB:  PubMed          Journal:  J Biosci        ISSN: 0250-5991            Impact factor:   1.826


  18 in total

1.  Bayesian Nonparametric Ordination for the Analysis of Microbial Communities.

Authors:  Boyu Ren; Sergio Bacallado; Stefano Favaro; Susan Holmes; Lorenzo Trippa
Journal:  J Am Stat Assoc       Date:  2017-02-28       Impact factor: 5.033

2.  DADA2: High-resolution sample inference from Illumina amplicon data.

Authors:  Benjamin J Callahan; Paul J McMurdie; Michael J Rosen; Andrew W Han; Amy Jo A Johnson; Susan P Holmes
Journal:  Nat Methods       Date:  2016-05-23       Impact factor: 28.547

3.  Dirichlet multinomial mixtures: generative models for microbial metagenomics.

Authors:  Ian Holmes; Keith Harris; Christopher Quince
Journal:  PLoS One       Date:  2012-02-03       Impact factor: 3.240

4.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

5.  Exact sequence variants should replace operational taxonomic units in marker-gene data analysis.

Authors:  Benjamin J Callahan; Paul J McMurdie; Susan P Holmes
Journal:  ISME J       Date:  2017-07-21       Impact factor: 10.302

6.  Multidomain analyses of a longitudinal human microbiome intestinal cleanout perturbation experiment.

Authors:  Julia Fukuyama; Laurie Rumker; Kris Sankaran; Pratheepa Jeganathan; Les Dethlefsen; David A Relman; Susan P Holmes
Journal:  PLoS Comput Biol       Date:  2017-08-18       Impact factor: 4.475

7.  Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data.

Authors:  Nicole M Davis; Diana M Proctor; Susan P Holmes; David A Relman; Benjamin J Callahan
Journal:  Microbiome       Date:  2018-12-17       Impact factor: 14.650

8.  phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

Authors:  Paul J McMurdie; Susan Holmes
Journal:  PLoS One       Date:  2013-04-22       Impact factor: 3.240

9.  Why most published research findings are false.

Authors:  John P A Ioannidis
Journal:  PLoS Med       Date:  2005-08-30       Impact factor: 11.613

10.  Bioconductor workflow for microbiome data analysis: from raw reads to community analyses.

Authors:  Ben J Callahan; Kris Sankaran; Julia A Fukuyama; Paul J McMurdie; Susan P Holmes
Journal:  F1000Res       Date:  2016-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.