| Literature DB >> 29790941 |
Bérénice Batut1,2, Kévin Gravouil1,3,4,5, Clémence Defois1,3, Saskia Hiltemann6, Jean-François Brugère1, Eric Peyretaillade1,4, Pierre Peyret1,3.
Abstract
Background: New generations of sequencing platforms coupled to numerous bioinformatics tools have led to rapid technological progress in metagenomics and metatranscriptomics to investigate complex microorganism communities. Nevertheless, a combination of different bioinformatic tools remains necessary to draw conclusions out of microbiota studies. Modular and user-friendly tools would greatly improve such studies. Findings: We therefore developed ASaiM, an Open-Source Galaxy-based framework dedicated to microbiota data analyses. ASaiM provides an extensive collection of tools to assemble, extract, explore, and visualize microbiota information from raw metataxonomic, metagenomic, or metatranscriptomic sequences. To guide the analyses, several customizable workflows are included and are supported by tutorials and Galaxy interactive tours, which guide users through the analyses step by step. ASaiM is implemented as a Galaxy Docker flavour. It is scalable to thousands of datasets but also can be used on a normal PC. The associated source code is available under Apache 2 license at https://github.com/ASaiM/framework and documentation can be found online (http://asaim.readthedocs.io). Conclusions: Based on the Galaxy framework, ASaiM offers a sophisticated environment with a variety of tools, workflows, documentation, and training to scientists working on complex microorganism communities. It makes analysis and exploration analyses of microbiota data easy, quick, transparent, reproducible, and shareable.Entities:
Mesh:
Year: 2018 PMID: 29790941 PMCID: PMC6007547 DOI: 10.1093/gigascience/giy057
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Available tools in ASaiM
| Section | Subsection | Tools |
|---|---|---|
| File and meta tools | Data retrieval | EBISearch [ |
| Text manipulation | Tools from Galaxy ToolShed | |
| Sequence file manipulation | Tools from Galaxy ToolShed | |
| BAM/SAM file manipulation | SAM tools [ | |
| BIOM file manipulation | BIOM-Format tools [ | |
| Genomics tools | Quality control | FastQC [ |
| Clustering | CD-Hit [ | |
| Sorting and prediction | SortMeRNA [ | |
| Mapping | BWA [ | |
| Similarity search | NCBI Blast+ [ | |
| Alignment | HMMER3 [ | |
| Microbiota dedicated tools | Metagenomics data manipulation | VSEARCH [ |
| Assembly | MEGAHIT [ | |
| Metataxonomic sequence analysis | Mothur [ | |
| Taxonomy assignation on WGS sequences | MetaPhlAn2 [ | |
| Metabolism assignation | HUMAnN2 [ | |
| Combination of functional and taxonomic results | Combine MetaPhlAn2 and HUMAnN2 outputs | |
| Visualization | Export2graphlan [ |
This table presents the tools, organized in sections and subsections to help users. A more detailed table of the available tools and some documentation can be found in the online documentation (http://asaim.readthedocs.io/en/latest/tools/).
Figure 1:Main ASaiM workflow to analyze raw sequences. This workflow takes as input a dataset of raw shotgun sequences (in FastQ format) from microbiota, preprocess it (yellow boxes), extracts taxonomic (red boxes) and functional (purple boxes) assignations, and combines them (green boxes). Image available under CC-BY license (https://doi.org/10.6084/m9.figshare.5371396.v3).
Figure 2:Comparisons of the community structure for SRR072233. This figure compares the community structure between the expectations (mapping of the sequences on the expected genomes), data found on EBI Metagenomics database (extracted with the EBI Metagenomics pipeline), and the results of the main ASaiM workflow (Fig. 1).
Figure 3:Example of an investigation of the relation between community structure and functions. The involved species and their relative involvement in fatty acid biosynthesis pathways have been extracted with ASaiM workflow (Fig. 1) for SRR072233.