| Literature DB >> 28659815 |
Nirupama Benis1, Soumya K Kar1, Vitor A P Martins Dos Santos2,3, Mari A Smits4,5, Dirkjan Schokker4, Maria Suarez-Diez2.
Abstract
The genotype and external phenotype of organisms are linked by so-called internal phenotypes which are influenced by environmental conditions. In this study, we used five existing -omics datasets representing five different layers of internal phenotypes, which were simultaneously measured in dietarily perturbed mice. We performed 10 pair-wise correlation analyses verified with a null model built from randomized data. Subsequently, the inferred networks were merged and literature mined for co-occurrences of identified linked nodes. Densely connected internal phenotypes emerged. Forty-five nodes have links with all other data-types and we denote them "connectivity hubs." In literature, we found proof of 6% of the 577 connections, suggesting a biological meaning for the observed correlations. The observed connectivities between metabolite and cytokines hubs showed higher numbers of literature hits as compared to the number of literature hits on the connectivities between the microbiota and gene expression internal phenotypes. We conclude that multi-level integrated networks may help to generate hypotheses and to design experiments aiming to further close the gap between genotype and phenotype. We describe and/or hypothesize on the biological relevance of four identified multi-level connectivity hubs.Entities:
Keywords: data integration; gastrointestinal tract; internal phenotype; metabolomics; microbiota; proteomics; systems biology; transcriptomics
Year: 2017 PMID: 28659815 PMCID: PMC5467433 DOI: 10.3389/fphys.2017.00388
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
Figure 1Relationship between the external phenotype (P), the genotype (G), the environment (E), and the G&E interactions. The internal phenotypic layers and the environmental factor with a darker outline are included in the present study.
Pre-processing and specificities of each data-type.
| Sampling | Ileum | Ileum | Serum | Serum | Urine |
| Before pre-processing | 16,410 * 33 | 148 * 33 | 23 * 36 | 41 * 36 | 16 * 28 |
| After pre-processing | 52 * 33 | 22 * 33 | 13 * 36 | 26 * 36 | 16 * 28 |
Details of the site of sampling and data dimensions before and after pre-processing are indicated. The first number indicates the number of variables in the data and the second number denotes the number of samples.
The 10 individual correlation networks.
| Metabolomics Serum & Metabolomics Urine | −0.51 | 0.6 | 14 | 12 | 2 |
| Metabolomics Serum & Microbiota | −0.38 | 0.3 | 21 | 14 | 1 |
| Metabolomics Serum & Transcriptomics | −0.31 | 0.35 | 26 | 33 | 2 |
| Metabolomics Serum & Cytokine | −0.33 | 0.5 | 18 | 13 | 2 |
| Metabolomics Urine & Microbiota | −0.28 | 0.42 | 16 | 12 | 1 |
| Metabolomics Urine & Transcriptomics | −0.55 | 0.54 | 14 | 29 | 1 |
| Metabolomics Urine & Cytokine | −0.32 | 0.55 | 10 | 8 | 2 |
| Microbiota & Transcriptomics | −0.28 | 0.27 | 19 | 48 | 1 |
| Microbiota & Cytokine | −0.38 | 0.35 | 11 | 11 | 1 |
| Transcriptomics & Cytokine | −0.27 | 0.34 | 31 | 13 | 2 |
Each row represents one of the 10 correlation networks. Low Threshold and High Threshold represent the thresholds used for the correlation values. The 3rd and 4th columns have the number of nodes in the network that belong to the first and second data, respectively. The last column displays the number of connected graphs in the network.
Figure 2Multilevel integration. This schematic image shows the number of connections between each internal phenotypic level with the other levels in a merged network. The colors of the parallelograms denote the internal phenotypic level to which the data-types belong. Green is Metabolomics (light green—Metabolite from Serum and dark green—Metabolite from Urine), blue is Cytokines, red is Transcriptomics, and pink is Microbiota. Each line connects two levels and the vertical number above the line indicates the number of edges in the correlation network between those two phenotypic levels. The number of connected nodes in each level is given in circles above and below the connecting lines.
Figure 3Distribution of network correlations and random network cut-offs of the Metabolomics Urine and Transcriptomics networks. The x-axis depicts the range of correlation values and the y- axis shows its frequency. The gray bars denote the distribution of the thresholds of the 1,000 random correlation networks with frequency on the left y-axis. The red bars are distributions of the correlation values of the inferred network with frequency on the right y-axis.
Characteristics of the merged network.
| Total number of nodes | 112 (45) |
| Total number of edges | 577 |
| Number of Metabolomics Urine nodes | 15 (8) |
| Number of Metabolomics Serum nodes | 24 (11) |
| Number of Cytokine nodes | 13 (7) |
| Number of Transcriptomics nodes | 43 (12) |
| Number of Microbiota nodes | 17 (7) |
| Degree range | 2–57 |
| Average number of neighbors | 10.35 |
| Clustering coefficient | 0.20 |
| Characteristic path length | 2.31 |
| Network density | 0.09 |
| Connected components | 1 |
Characteristics of the merged correlation network. The number of nodes from each data-type are given in rows three to seven. Between brackets the number of connectivity hubs is indicated.
Overview of text mining results.
| Cytokines & Metabolomics Serum | 9,554 | 16 |
| Metabolomics Serum & Metabolomics Urine | 906 | 6 |
| Microbiota & Metabolomics Serum | 254 | 7 |
| Microbiota & Cytokines | 250 | 5 |
| Transcriptomics & Microbiota | 83 | 3 |
| Metabolomics Serum & Transcriptomics | 59 | 2 |
The first column shows the types of data that are connected by the edges that were found in the PubMed literature search.
Figure 4Glutathione sub-network. This figure shows the 21 connections of the node Glutathione in the merged network. The different colors of nodes indicate the data-type of internal phenotypic level of that node, pink is Microbiota, red is Transcriptomics, blue is Cytokines, and green is Metabolites (light green—Metabolites from Serum and dark green—Metabolites from Urine). Oval nodes are connectivity hubs. Dotted lines show un-validated edges and continuous, thicker edges show connections also present in the results retrieved from scientific literature. Edge color, yellow and purple, indicates positive and negative correlations, respectively.