| Literature DB >> 31289831 |
Ken Youens-Clark1, Matt Bomhoff1, Alise J Ponsero1, Elisha M Wood-Charlson2, Joshua Lynch1, Illyoung Choi3, John H Hartman3, Bonnie L Hurwitz1,4.
Abstract
BACKGROUND: Scientists have amassed a wealth of microbiome datasets, making it possible to study microbes in biotic and abiotic systems on a population or planetary scale; however, this potential has not been fully realized given that the tools, datasets, and computation are available in diverse repositories and locations. To address this challenge, we developed iMicrobe.us, a community-driven microbiome data marketplace and tool exchange for users to integrate their own data and tools with those from the broader community.Entities:
Keywords: bioinformatics; cloud computing; cyberinfrastructure; metagenomics
Mesh:
Year: 2019 PMID: 31289831 PMCID: PMC6615980 DOI: 10.1093/gigascience/giz083
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:iMicrobe's architecture allows for the integration of datasets hosted by iMicrobe that can be placed into the data cart, those private to the user, and others publicly accessible on the Internet. Analyses created by iMicrobe or other developers run on Stampede2, and the results go into the user's home directory in the CyVerse Data Store.
Current list of available apps in iMicrobe
| App Name | Purpose |
|---|---|
| 16s_cluster-0.0.1u2 | Cluster 16S sequences [ |
| c-microbial-map-0.0.1u1 | Visualize geographic distribution of 16S sequences in the ocean [ |
| centrifuge-1.0.4u1 | Short-read taxonomic classification [ |
| centrifuge-bubble-0.0.5u1 | Visualization of Centrifuge analysis [ |
| ClusterGenomes-1.1.3u2 | Clusters genomes based on all-versus-all alignments |
| DIAMOND-0.9.10u1 | Fast read alignment of DNA or proteins [ |
| fizkin-0.0.3u1 | Pairwise sample comparison via |
| FragGeneScan-1.30.0u1 | Short-read ORF prediction [ |
| graftm-0.11.1u3 | Rapid community profiles from metagenomes [ |
| imicrobe-demultiplexer-0.0.1u1 | Demultiplexing pipeline for single and paired-end data [ |
| imicrobe-megahit-0.0.2u1 | Metagenomics read assembler [ |
| imicrobe-prokka-0.0.2u1 | Prokaryotic genome annotation [ |
| imicrobe-soapdenovo2–0.0.3u1 | Short-read assembler [ |
| libra-1.0 | Pairwise sample comparison via |
| MArVD-1.0.0u1 | Metagenomic Archaeal Virus Detector [ |
| mash-all-vs-all-0.0.5u1 | Pairwise sample comparison via Mash [ |
| MetaGeneAnnotator-1.1.0u1 | Prokaryotic and phage gene prediction [ |
| ohana-blast-0.0.9u2 | BLAST search to Ohana gene catalog [ |
| prodigal-2.6.3u3 | Gene prediction [ |
| Prokka-1.12.0u2 | Prokaryotic genome annotation [ |
| puma-0.3.0u1 | Annotation of human papillomavirus genomes [ |
| Read2RefMapper-1.1.0u2 | Filtering coverage of BAM files to a reference dataset [ |
| sra-fastq-dump-0.0.1u1 | Save sequences from SRA in CyVerse Data Store [ |
| trim-galore-0.4.5u1 | QC tool for trimming reads [ |
| Trimmomatic-0.36.0u2 | QC tool for trimming reads [ |
| uproc_dna-1.2.0u3 | Protein sequence classification [ |
| vContact-0.1.60u2 | Viral Contig Automatic Cluster Taxonomy [ |
| vContact_PCs-0.1.60u2 | Viral Contig Automatic Cluster Taxonomy [ |
| WIsH-Build-1.0.0u2 | Identify bacterial hosts from metagenomic data [ |
| WIsH-Predict-1.0.0u2 | Identify bacterial hosts from metagenomic data [ |
Figure 4:The app launch interface allows users to select input files and set parameters for the app. The input files may come from the user's own Data Store, any publicly available data in the Data Store such as files associated with their data cart or iMicrobe sample files, or any other file available over FTP or HTTP. The Agave API will copy the input files to the compute node when the job is run.
Comparison of metagenomic platforms’ cyberinfrastructure capabilities
| Capability | KBase | MGnify | MG-RAST | IMG/M | QIITA | iMicrobe |
|---|---|---|---|---|---|---|
| Create apps | ✓ | ✓ | ||||
| Run apps at will | ✓ | ✓ | ✓ | ✓ | ||
| Upload private data | ✓ | ✓ | ✓ | ✓ | ✓ | |
| Share private data | ✓ | |||||
| Search public data | ✓ | ✓ | ✓ | ✓ | ✓ |
*For users of JGI sequencing services.
Comparison of metagenomics platforms’ app capabilities
| Category | Capability | KBase | MGnify | MG-RAST | IMG/M | QIITA | iMicrobe |
|---|---|---|---|---|---|---|---|
| General | QC | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Genomics | Assembly | ✓ | ✓ | ||||
| Gene calling | ✓ | ✓ | |||||
| Gene annotation | ✓ | ✓ | |||||
| Metabolic modeling | ✓ | ||||||
| Sequence analysis | ✓ | ✓ | |||||
| Comparative genomics | ✓ | ||||||
| Metagenomics (assembly-based analysis) | Assembly | ✓ | ✓ | ✓ | |||
| Gene calling | ✓ | ✓ | ✓ | ||||
| Gene annotation | ✓ | ✓ | ✓ | ||||
| Taxonomic classification of contigs | ✓ | ✓ | ✓ | ||||
| Protein clustering | ✓ | ✓ | |||||
| Read mapping to contigs or other reference | ✓ | ||||||
| Metagenomics (read-based analysis) | Read taxonomic classification | ✓ | ✓ | ✓ | ✓ | ||
| Read ORF prediction | ✓ | ✓ | ✓ | ||||
| Read functional annotation | ✓ | ✓ | ✓ | ||||
| Read clustering | ✓ | ||||||
| Amplicon | Operational taxonomic units and taxonomic lineage | ✓ | ✓ | ✓ | ✓ | ✓ |
Figure 5:The iMicrobe user interface (UI) comprises a back end written in Node/JS that talks to MySQL and MongoDB databases to deliver JSON to a front end written in Elm, which also communicates with the Agave API for the computing resources of CyVerse Data Store and TACC's Stampede2 HPC cluster.