| Literature DB >> 22879885 |
Gregory Vey1, Gabriel Moreno-Hagelsieb.
Abstract
The derivation and comparison of biological interaction networks are vital for understanding the functional capacity and hierarchical organization of integrated microbial communities. In the current work we present metagenomic annotation networks as a novel taxonomy-free approach for understanding the functional architecture of metagenomes. Specifically, metagenomic operon predictions are exploited to derive functional interactions that are translated and categorized according to their associated functional annotations. The result is a collection of discrete networks of weighted annotation linkages. These networks are subsequently examined for the occurrence of annotation modules that portray the functional and organizational characteristics of various microbial communities. A variety of network perspectives and annotation categories are applied to recover a diverse range of modules with different degrees of annotative cohesiveness. Applications to biocatalyst discovery and human health issues are discussed, as well as the limitations of the current implementation.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22879885 PMCID: PMC3413691 DOI: 10.1371/journal.pone.0041283
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Data Source Diversity.
The relative proportions (%) of various data source types that were used are shown categorized according to IMG/M microbiome taxa at the class level. Panel A shows the proportions (%) with respect to the total number of data sets while Panel B shows the proportions (%) with respect to the total number of genes.
Figure 2Network Construction Workflow.
Operonic genes are predicted on the basis of co-direction and intergenic proximity using scaffolds containing more than one gene (Panel A). Operons and their constituent genes can be filtered according to the presence or absence of a target annotation such that at least one member of an operon is required to possess a target descriptor (Panel B). Note that the filter step is optional and can applied to obtain target perspective networks while being omitted in the construction of source perspective networks. Each gene in a given operon is mined for its various types of functional annotations where any particular type has a domain of existing values (Panel C). For each operon, the obtained functional annotations are used to infer bidirectional functional interactions for annotations having the same type but different values (Panel D). Note that interactions are inferred directly for immediately adjacent gene pairs and also transitively for downstream members within the same operon.
Figure 3Target Stringency versus Network Coverage.
Four polyketide target perspective networks were constructed with progressively increasing target stringency and each network was translated into each of the four annotation categories. The proportion of nodes and edges in each polyketide network was compared to its corresponding overall network. Panel A shows that coverage for nodes decreased for all annotation categories with increasing target stringency and Panel B shows that coverage for edges also decreased for all annotation categories with increasing target stringency.
Summary of Network Features.
| Network Type | Annotation | Perspective | Nodes | Edges | Modules |
| Cellulase | MetaCyc | Target | 213 | 779 | 5 |
| Cellulase | COG | Target | 301 | 763 | 33 |
| Human Gut | KEGG | Source | 153 | 192 | 11 |
| Human Gut | TIGRFAM | Source | 543 | 607 | 57 |
| Gut Intersection | TIGRFAM | Comparative | 407 | 278 | 19 |
| Gut Difference | TIGRFAM | Comparative | 356 | 329 | 20 |
The general features of each metagenomic functional network are shown including the type of network, the category of annotations used to construct the network, the network perspective, the number of nodes and edges that compose the network, and the number of predicted functional modules contained within the network.
Figure 4Metagenomic Cellulase Networks.
The target perspective networks for cellulase functional interactions are shown where large node diameter represents high node degree within each respective network. Panel A shows a network constructed using MetaCyc annotations with a highly connected central hub having the annotation PWY-1001: cellulose biosynthesis. The highlighted nodes represent the top ranking module which is enlarged in Panel B. Panel C shows a network constructed using COG annotations and features a highly connected central hub with the annotation COG1363: cellulase M and related proteins. The highlighted nodes represent the top ranking module which is enlarged in Panel D.
Top Ranked Functional Modules.
| Source | Score | Nodes | Edges | Members |
| MetaCyc Cellulase Network | 14.0 | 14 | 91 | Alanine biosynthesis I; Isoleucine biosynthesis I (from threonine); Isoleucine biosynthesis II; Isoleucine biosynthesis III; Isoleucine biosynthesis IV; Isoleucine biosynthesis V; Isoleucine degradation I; Isoleucine degradation II; Leucine biosynthesis; Leucine degradation I; Leucine degradation III; Valine biosynthesis; Valine degradation I; Valine degradation II |
| COG Cellulase Network | 15.0 | 15 | 105 | 3-polyprenyl-4-hydroxybenzoate decarboxylase; 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases; ABC-type cobalt transport system, permease component CbiQ and related transporters; ABC-type uncharacterized transport system, permease component; ABC-type uncharacterized transport systems, ATPase components; ATPase components of various ABC-type transport systems, contain duplicated ATPase; Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases; Adenine deaminase; Amidases related to nicotinamidase; Carbon dioxide concentrating mechanism/carboxysome shell protein; Inosine-uridine nucleoside N-ribohydrolase; Predicted metal-dependent hydrolase; Ribulose kinase; Uncharacterized ABC-type transport system, permease component; Uncharacterized conserved protein |
| KEGG Human GutNetwork | 6.2 | 13 | 37 | Bacitracin transport system; C5 isoprenoid biosynthesis, mevalonate pathway; Ceramide biosynthesis; Eicosanoid biosynthesis, arachidonate = >8(S)-HETE; Gluconeogenesis, oxaloacetate = > fructose-6P; Glycolysis (Embden-Meyerhof pathway), glucose = >pyruvate; Glycolysis, core module involving three-carbon compounds; Indolepyruvate:ferredoxin oxidoreductase; Non-phosphorylative Entner-Doudoroff pathway, gluconate = > glyceraldehyde + pyruvate; Phosphatidylcholine (PC) biosynthesis, PE = >PC; Phosphatidylethanolamine (PE) biosynthesis, ethanolamine = >PE; Pyruvate oxidation, pyruvate = > acetyl-CoA; Semi-phosphorylative Entner-Doudoroff pathway, gluconate = > glyceraldehyde-3P + pyruvate |
| TIGRFAM Human Gut Network | 8.2 | 18 | 70 | 30S ribosomal protein S17; 50S ribosomal protein L3, bacterial; 50S ribosomal protein L4, bacterial/organelle; Preprotein translocase, SecY subunit; Ribosomal protein L14, bacterial/organelle; Ribosomal protein L15, bacterial/organelle; Ribosomal protein L16, bacterial/organelle; Ribosomal protein L18, bacterial type; Ribosomal protein L2, bacterial/organellar; Ribosomal protein L22, bacterial type; Ribosomal protein L24, bacterial/organelle; Ribosomal protein L29; Ribosomal protein L30, bacterial/organelle; Ribosomal protein L6, bacterial type; Ribosomal protein S10, bacterial/organelle; Ribosomal protein S19, bacterial/organelle; Ribosomal protein S3, bacterial type; Ribosomal protein S5, bacterial/organelle type |
| Human Gut IntersectionNetwork | 8.2 | 13 | 49 | 30S ribosomal protein S17; 50S ribosomal protein L3, bacterial; 50S ribosomal protein L4, bacterial/organelle; Ribosomal protein L14, bacterial/organelle; Ribosomal protein L16, bacterial/organelle; Ribosomal protein L18, bacterial type; Ribosomal protein L2, bacterial/organellar; Ribosomal protein L22, bacterial type; Ribosomal protein L24, bacterial/organelle; Ribosomal protein L29; Ribosomal protein L6, bacterial type; Ribosomal protein S19, bacterial/organelle; Ribosomal protein S3, bacterial type |
| Human Gut DifferenceNetwork | 7.7 | 8 | 27 | 50S ribosomal protein L14P; 50S ribosomal protein L30P, archaeal; Archaeal ribosomal protein S17P; Ribosomal protein L24p/L26e, archaeal/eukaryotic; Ribosomal protein L29; Ribosomal protein S3, eukaryotic/archaeal type; Ribosomal protein S5(archaeal type)/S2(eukaryote cytosolic type); Translation initation factor SUI1, putative, prokaryotic |
The general features of the highest scoring functional module from each network are shown including the source network, the score assigned by MINE, the number of nodes and edges that compose the module, and the member annotations derived from the nodes. Member annotations are sorted in ascending lexicographical order and delimited using the semicolon symbol. For all top ranked modules each member is unique.
Figure 5Thematic Set Diagram.
The annotative themes for the top ranked MetaCyc module are depicted where the numeric values indicate the number of annotations belonging to a thematic category. Specifically, amino acid categories are represented vertically and metabolic categories are represented horizontally. Note, the vertical themes are encapsulatory while the horizontal themes are mutually exclusive. A variety of functional perspectives can be simultaneously visualized by way of the interacting and overlapping thematic sets.
Figure 6Human Gut Networks.
The source perspective networks for human gut functional interactions are shown where large node diameter represents high node degree within each respective network. Panel A shows a network constructed using KEGG annotations where the highlighted nodes represent the top ranking module which is enlarged in Panel B. Panel C shows a network constructed using TIGRFAM annotations where the highlighted nodes represent the top ranking module which is enlarged in Panel D.
Figure 7Comparative Gut Networks.
The comparative networks for human gut functional interactions are shown where large node diameter represents high node degree within each respective network. Specifically, two networks were constructed using TIGRFAM annotations and compared for mutual versus exclusive nodes. Panel A shows the intersection of the networks where the highlighted nodes represent the top ranking module which is enlarged in Panel B. Panel C shows the difference of the networks where the highlighted nodes represent the top ranking module which is enlarged in Panel D.