| Literature DB >> 29106667 |
Francislon S Oliveira1,2, John Brestelli3,4, Shon Cade5, Jie Zheng3,4, John Iodice3,4, Steve Fischer3,4, Cristina Aurrecoechea6, Jessica C Kissinger6,7,8, Brian P Brunk3,5, Christian J Stoeckert3,4, Gabriel R Fernandes1, David S Roos5, Daniel P Beiting9.
Abstract
MicrobiomeDB (http://microbiomeDB.org) is a data discovery and analysis platform that empowers researchers to fully leverage experimental variables to interrogate microbiome datasets. MicrobiomeDB was developed in collaboration with the Eukaryotic Pathogens Bioinformatics Resource Center (http://EuPathDB.org) and leverages the infrastructure and user interface of EuPathDB, which allows users to construct in silico experiments using an intuitive graphical 'strategy' approach. The current release of the database integrates microbial census data with sample details for nearly 14 000 samples originating from human, animal and environmental sources, including over 9000 samples from healthy human subjects in the Human Microbiome Project (http://portal.ihmpdcc.org/). Query results can be statistically analyzed and graphically visualized via interactive web applications launched directly in the browser, providing insight into microbial community diversity and allowing users to identify taxa associated with any experimental covariate.Entities:
Mesh:
Year: 2018 PMID: 29106667 PMCID: PMC5753346 DOI: 10.1093/nar/gkx1027
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic showing the automated data loading workflow for MicrobiomeDB. Greengenes identifiers are extracted from .biom files containing microbial community census data and used to retrieve NCBI taxon identifiers, full 16S rRNA gene sequences, and taxon strings. User-provided sample details are mapped to an OBO Foundry ontology to expand a EuPathDB local application ontology. Sample details are formatted as an Investigation, Study, Assay (ISA) file and, along with microbiome census data, are structured in a GUS4 schema for loading into MicrobiomeDB. Manual curation is used to produce a custom microbiome display terminology for searching sample details on the website.
Figure 2.Screenshot of the filter page for searching by sample details. (A) The filter list shows all sample details describing all the samples in the database. This list is searchable via a reactive text box (red arrow) (B) Selecting any term from the filter list shows all the values associated with that term and the number of samples from the database that match each value. (B) Users filter the samples in the database by selecting values of interest. (C) Any filter applied by the user remains accessible through filter history at the top of the page.
Figure 3.Screenshot of the query results page. (A) The strategy panel provides users with an interface to name, share and expand on their initial query, thereby constructing in silico experiments. Query results are shown as ‘Step 1’, and additional queries can be added as additional steps. (B) The sample results panel shows all samples matching the search strategy, which can be downloaded (black arrow). Users can visualize and statistically analyze their query results by accessing a suite of interactive web apps (magenta arrow). (C and D) Details and data for individual samples can be viewed by clicking on the sample identifier.
Figure 4.Screenshot of the relative abundance app. (A) The analysis tab of the results page provides access to a suite of interactive web apps for visualization and analysis of microbial community diversity and composition. (B) Selecting the relative abundance app displays a horizontal stacked bar chart of the top ten most abundant taxa. Users can customize this graphic by selecting taxonomic level and sample details to partition the samples into groups. (C) Navigating to the ‘top taxa comparison’ tab of the app displays this data as a box-and-whisker plot that allows the same customization as the stacked bar chart. (D) Double clicking on any single taxon in panel B, or navigating to the ‘single taxon’ tab of the app and entering a taxon of interest, displays a graph of that taxon with statistical analysis comparing the relative abundance between the user-defined group(s).
Figure 5.Screenshot of the differential abundance app. (A) The analysis tab of the results page provides access to a suite of interactive web apps for visualization and analysis of microbial community diversity and composition. (B) Selecting the differential abundance app presents users with several drop down menus to customize their analysis. After choosing the taxonomic level, the design factor, and the pairwise comparison of interest, DESeq2 is run to identify differentially abundant taxa. Results are displayed as a ‘lollipop’ chart where color indicates phylum, length of the lollipop indicates log2 fold change (X-axis), and size of the lollipop reflects statistical significance. (C) Moving the cursor over any lollipop displays a plot of relative abundance with statistics for that taxon.