| Literature DB >> 28449106 |
Achal Dhariwal1, Jasmine Chong2, Salam Habib3, Irah L King4,5, Luis B Agellon3, Jianguo Xia1,2,4,5.
Abstract
The widespread application of next-generation sequencing technologies has revolutionized microbiome research by enabling high-throughput profiling of the genetic contents of microbial communities. How to analyze the resulting large complex datasets remains a key challenge in current microbiome studies. Over the past decade, powerful computational pipelines and robust protocols have been established to enable efficient raw data processing and annotation. The focus has shifted toward downstream statistical analysis and functional interpretation. Here, we introduce MicrobiomeAnalyst, a user-friendly tool that integrates recent progress in statistics and visualization techniques, coupled with novel knowledge bases, to enable comprehensive analysis of common data outputs produced from microbiome studies. MicrobiomeAnalyst contains four modules - the Marker Data Profiling module offers various options for community profiling, comparative analysis and functional prediction based on 16S rRNA marker gene data; the Shotgun Data Profiling module supports exploratory data analysis, functional profiling and metabolic network visualization of shotgun metagenomics or metatranscriptomics data; the Taxon Set Enrichment Analysis module helps interpret taxonomic signatures via enrichment analysis against >300 taxon sets manually curated from literature and public databases; finally, the Projection with Public Data module allows users to visually explore their data with a public reference data for pattern discovery and biological insights. MicrobiomeAnalyst is freely available at http://www.microbiomeanalyst.ca.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28449106 PMCID: PMC5570177 DOI: 10.1093/nar/gkx295
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.MicrobiomeAnalyst flow chart. MicrobiomeAnalyst accepts taxa/gene lists, OTU/gene abundance tables, or BIOM files. Three consecutive steps are performed - data processing, data analysis, and result exploration. The associated web interface offers a rich set of options, and produces various tables and graphics to allow users to intuitively navigate the data analysis tasks.
Figure 2.Example outputs from MicrobiomeAnalyst. (A) A box plot summary of the Shannon diversity index across different groups. (B) A stacked bar chart showing Phylum level abundance profiles across samples. Sample names in red and green colors indicate mice fed a low fat diet (LFD) and a western-style diet (WSD), respectively. (C) A PCoA plot with sample colors based on different diets. (D) The same PCoA plot with color gradients based on the abundance levels of family Bacteroidaceae. (E) A graphical summary of the classification performance on different diets using the Random Forests algorithm. (F) A dendrogram showing the clustering of samples with colors based on diet and sex. (G) A clustered heatmap showing the variation of taxonomic abundance with regard to diet and sex. (H) An interactive network summarizing enriched taxon sets from TSEA. (I) A 3D PCoA plot from the PPD module, with the taxonomic composition of the currently selected sample shown in the middle and the session history on the right. (J) A screenshot showing functional enrichment analysis and visualization within the global metabolic network.
Comparison of MicrobiomeAnalyst with other web-based tools. The URL for each tool is given below the table. Tools dedicated solely for sequence annotation are not included
| Tools | Microbiome-Analyst | METAGEN-assist | EBI-Metagenomics | MG-RAST | VAMPS |
|---|---|---|---|---|---|
| Registration | No | No | Yes | Yes | Yes |
| Data Processing | |||||
| Input | Count tables; BIOM; mothur output | Count tables; BIOM; outputs from 4 tools | Sequences | Sequences | Sequences |
| Filtering | Abundance, variance, manual | Abundance, variance | – | Abundance | Abundance |
| Normalization | Scaling, transformation, rarefying | Scaling, transformation | – | Scaling, trans-formation | Scaling |
| Taxonomic Profiling | |||||
| Alpha-diversity | Multiple | – | – | Shannon | Multiple |
| Beta-diversity | PCoA & NMDS (2D & 3D) | PCA, PLS-DA | PCA | PCoA | PCoA & NMDS (2D only) |
| Functional profiling | |||||
| Functional prediction | PICRUSt & Tax4Fun | – | – | – | – |
| Functional annotation | COG & KEGG | – | GO | SEED, KEGG COG, eggNOG | – |
| Pathway visualization | Yes (JavaScript) | – | – | Yes (SVG) | – |
| Comparative analysis | |||||
| Differential analysis | Univariate methods, DESeq2, edgeR, metagenomeSeq | Univariate methods | – | – | – |
| Biomarker discovery & classification | LEfSe, Random Forests | SVM, Random Forests | – | – | – |
| Meta-analysis | |||||
| Taxon set enrichment analysis | 105 strain sets, 174 species sets, 42 others | – | – | – | – |
| Integration with public data | Visual analytics with 3D PCoA | – | – | – | – |
• MicrobiomeAnalyst: http://www.microbiomeanalyst.ca/
• METAGENassist: http://www.metagenassist.ca/
• EBI-Metagenomics: https://www.ebi.ac.uk/metagenomics/
• MG-RAST: http://metagenomics.anl.gov/
• VAMPS: https://vamps2.mbl.edu/