| Literature DB >> 22645318 |
David Arndt1, Jianguo Xia, Yifeng Liu, You Zhou, An Chi Guo, Joseph A Cruz, Igor Sinelnikov, Karen Budwill, Camilla L Nesbø, David S Wishart.
Abstract
With recent improvements in DNA sequencing and sample extraction techniques, the quantity and quality of metagenomic data are now growing exponentially. This abundance of richly annotated metagenomic data and bacterial census information has spawned a new branch of microbiology called comparative metagenomics. Comparative metagenomics involves the comparison of bacterial populations between different environmental samples, different culture conditions or different microbial hosts. However, in order to do comparative metagenomics, one typically requires a sophisticated knowledge of multivariate statistics and/or advanced software programming skills. To make comparative metagenomics more accessible to microbiologists, we have developed a freely accessible, easy-to-use web server for comparative metagenomic analysis called METAGENassist. Users can upload their bacterial census data from a wide variety of common formats, using either amplified 16S rRNA data or shotgun metagenomic data. Metadata concerning environmental, culture, or host conditions can also be uploaded. During the data upload process, METAGENassist also performs an automated taxonomic-to-phenotypic mapping. Phenotypic information covering nearly 20 functional categories such as GC content, genome size, oxygen requirements, energy sources and preferred temperature range is automatically generated from the taxonomic input data. Using this phenotypically enriched data, users can then perform a variety of multivariate and univariate data analyses including fold change analysis, t-tests, PCA, PLS-DA, clustering and classification. To facilitate data processing, users are guided through a step-by-step analysis workflow using a variety of menus, information hyperlinks and check boxes. METAGENassist also generates colorful, publication quality tables and graphs that can be downloaded and used directly in the preparation of scientific papers. METAGENassist is available at http://www.metagenassist.ca.Entities:
Mesh:
Year: 2012 PMID: 22645318 PMCID: PMC3394294 DOI: 10.1093/nar/gks497
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Sample output from METAGENassist for phenotype mapping and univariate analysis. (A) A pie chart showing the breakdown of energy source phenotypes for a single sample after taxonomic mapping; (B) One-way ANOVA plot with red dots indicating significant features scoring above a given threshold (the horizontal dotted line).
Figure 2.Sample outputs from METAGENassist for multivariate analysis. (A) A PCA plot of environmental coal samples for samples versus metabolism phenotypes, with colored groupings according to sample-associated coal type. The dotted ellipses indicate the 95% confidence intervals. (B) A heatmap combined with agglomerative hierarchical clustering using the same coal sample data.
Comparison of comparative metagenomic software
| METAGENassist | Mothur | QIIME | MG-RAST | MEGAN 4 | STAMP 2.0 | |
|---|---|---|---|---|---|---|
| Platform | Web server | Command-line | Command-line | Web server | Desktop and command-line | Desktop |
| Graphical user interface | Yes | No | No | Yes | Yes | Yes |
| Compare 16S rRNA samples | Yes | Yes | Yes | Yes | Yes | Yes |
| Taxonomy-centered analysis of shotgun data | Yes | No | Partial (using imported file) | Yes | Yes | Yes |
| Sequence annotation | No | Yes | Yes | Yes | Using input BLAST file | No |
| Univariate statistics | Fold change analysis, | Metastats( | Paired | No | Directed Homogeneity test ( | |
| Multivariate statistics | PCA, PLS-DA | PCoA, PCA, NMDS | PCoA, NMDS | PCoA | No | PCA |
| Clustering | Dendrograms, heatmaps, K-means, SOM | Dendrograms, heatmaps | Heatmaps | Heatmap- dendrogram | No | No |
| Supervised classification | Random forest, SVM | No | Random forest | No | No | No |
| Interactive normalization | Yes | No | No | No | No | No |
| Metadata overlay | Yes | No | Yes | Yes | No | Yes |
| Taxonomy-to-phenotype mapping | Yes | No | No | No | Yes | No |
aAs of the writing of this paper, use on Mac and Linux required either virtualization or compilation from source code.
bSome graphical interaction is available on script-generated HTML pages.
cDoes not read files from mothur or QIIME.