| Literature DB >> 28398290 |
Aria S Hahn1,2, Tomer Altman3,4, Kishori M Konwar1,2,5, Niels W Hanson1, Dongjae Kim6, David A Relman7,8,9, David L Dill10, Steven J Hallam1,2,11.
Abstract
Advances in high-throughput sequencing are reshaping how we perceive microbial communities inhabiting the human body, with implications for therapeutic interventions. Several large-scale datasets derived from hundreds of human microbiome samples sourced from multiple studies are now publicly available. However, idiosyncratic data processing methods between studies introduce systematic differences that confound comparative analyses. To overcome these challenges, we developed GutCyc, a compendium of environmental pathway genome databases (ePGDBs) constructed from 418 assembled human microbiome datasets using MetaPathways, enabling reproducible functional metagenomic annotation. We also generated metabolic network reconstructions for each metagenome using the Pathway Tools software, empowering researchers and clinicians interested in visualizing and interpreting metabolic pathways encoded by the human gut microbiome. For the first time, GutCyc provides consistent annotations and metabolic pathway predictions, making possible comparative community analyses between health and disease states in inflammatory bowel disease, Crohn's disease, and type 2 diabetes. GutCyc data products are searchable online, or may be downloaded and explored locally using MetaPathways and Pathway Tools.Entities:
Mesh:
Year: 2017 PMID: 28398290 PMCID: PMC5387927 DOI: 10.1038/sdata.2017.35
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Figure 1The GutCyc pipeline diagram.
The MetaPathways pipeline consists of five modular stages including (1) Quality control (QC) and open reading frame (ORF) prediction (2) Functional and taxonomic annotation, (3) Analysis (4) ePGDB construction, and (5) Pathway export. Inputs and programs are depicted on the left with corresponding output directories and exported files on the right.
Summary statistics for the GutCyc Collection across 418 samples.
| The statistics for the number of bases processed is in units of Megabases. ‘Func. Annots.’ are functional annotations. ‘Trans. Reactions’ are transport reactions. ‘Compounds’ are small molecule metabolites. ‘Base Pathways’ include all pathways except complex pathways known as Super-Pathways. | |||||
|---|---|---|---|---|---|
| Bases | 0.98 | 54.75 | 81.35 | 113.75 | 370.51 |
| Contigs | 2,506 | 27,788 | 47,486.5 | 76,275.75 | 399,331 |
| ORFs | 2,448 | 61,703.5 | 95.531 | 139,690 | 550,312 |
| Func. Annots. | 2,176 | 57,102.25 | 86,054.5 | 123,747.25 | 425,033 |
| Reactions | 1,635 | 2,385.5 | 3,438 | 3,667.75 | 4,881 |
| Trans. Reactions | 12 | 26 | 31 | 34 | 46 |
| Compounds | 1,052 | 1,678 | 2,008.5 | 2,119.5 | 2,676 |
| Base Pathways | 257 | 350 | 616 | 654 | 832 |
Figure 2GutCyc ePGDB use cases.
In the upper left and upper right insets, a GutCyc ePGDB is opened in MetaPathwaysIn the upper left, we display the Pipeline Execution step, and the Process Monitor interfaces. In the upper right, we display the Summary Table (with exportable sample statistics), and the Pathway Table (with exportable pathway abundances) interfaces. In the lower four inset images, a GutCyc ePGDB is opened in Pathway Tools. Clockwise from the upper left, we display the ePGDB summary statistics, interactive metabolic network visualization, the Pathway View, and the biochemical Reaction View.
Figure 3The Cellular Overview for the SRS056259Cyc ePGDB, at three different zoom levels.
Compounds are highlighted in red if identified from a mass spectrometry analysis of the gut microbiome[63], and otherwise appear in grey. Reactions with enzyme data in SRS056259Cyc are drawn in blue. The top left inset shows a fraction of the full metabolic map. The middle inset shows a zoom-in of the ‘Secondary Metabolite Degradation’ pathway class. Bottom right inset shows zoom-in on Pathway P562-PWY, ‘myo-, chiro-, and scillo-inositol degradation pathway’, showing four mass-spectrometry identified compounds in red.