| Literature DB >> 29731765 |
Kevin Schwahn1, Zoran Nikoloski1,2,3.
Abstract
The availability of high-throughput data from transcriptomics and metabolomics technologies provides the opportunity to characterize the transcriptional effects on metabolism. Here we propose and evaluate two computational approaches rooted in data reduction techniques to identify and categorize transcriptional effects on metabolism by combining data on gene expression and metabolite levels. The approaches determine the partial correlation between two metabolite data profiles upon control of given principal components extracted from transcriptomics data profiles. Therefore, they allow us to investigate both data types with all features simultaneously without doing preselection of genes. The proposed approaches allow us to categorize the relation between pairs of metabolites as being under transcriptional or post-transcriptional regulation. The resulting classification is compared to existing literature and accumulated evidence about regulatory mechanism of reactions and pathways in the cases of Escherichia coli, Saccharomycies cerevisiae, and Arabidopsis thaliana.Entities:
Keywords: A. thaliana; E. coli; S. cerevisiae; data reduction; metabolomics; partial correlation; principal component analysis; regulation
Year: 2018 PMID: 29731765 PMCID: PMC5920133 DOI: 10.3389/fpls.2018.00538
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Schematic overview of the two approaches introduced in the study. The approaches Transcriptional dependent Partial Correlation (TPC) and Post-transcriptional dependent Partial Correlation (PPC) use the first p PCs of the transcriptomic data as control variables in the partial correlation calculation.
Figure 2Changes from Pearson to partial correlation. Changes from Pearson to partial correlation for all three organism (E. coli, S. cerevisiae and A. thaliana) and for TPC, positve correlation; TPC, negative correlations; PPC, positive correlations; PPC, negative correlations. The blue portion of the bar represents the percentage of significant correlations whose absolute value increased from Pearson to partial correlation. The red portion of the bar represents the percentage of significant correlations whose absolute value decreased from Pearson to partial correlation.
Number of significant correlations above certain thresholds for the TPC and PPC approach for the data of Oliveira et al. (2015).
| > 0.9 | 0 | 4 |
| > 0.8 | 0 | 151 |
| > 0.7 | 0 | 567 |
| > 0.6 | 2 | 1,170 |
| > 0.5 | 43 | 2,152 |
| > 0.4 | 456 | 3,260 |
| < −0.4 | 49 | 495 |
| < −0.5 | 1 | 125 |
| < −0.6 | 0 | 16 |
| < −0.7 | 0 | 0 |
Metabolite pairs found within the downstream motif of the approach by Oliveira et al. (2015) and our Post-transcriptional dependent Partial Correlation (PPC) approach.
| Pyrroline-3H-5C | Adenosine | 0.484 | 0.433 |
| Pyrroline-3H-5C | dGuanosine | 0.483 | 0.433 |
| Pyrroline-3H-5C | IMP | 0.759 | 0.736 |
| Indole-3-acetate | Adenosine | 0.584 | 0.538 |
| Indole-3-acetate | dGuanosine | 0.584 | 0.538 |
| Indole-3-acetate | IMP | 0.616 | 0.571 |
| Adenosine | IMP | 0.744 | 0.708 |
| Adenosine | L-Aspartate | −0.471 | −0.418 |
| dGuanosine | L-Aspartate | −0.471 | −0.418 |
| dGuanosine | IMP | 0.744 | 0.701 |
Metabolite pairs found within the upstream and parallel motif of the approach by Oliveira et al. (2015) and our Transcriptional dependent Partial Correlation (TPC) approach.
| NAD | AICAR | 0.468 | 0.508 |
| Thiamin triphosphate | AICAR | 0.468 | 0.392 |
| Thiamin triphosphate | L-Leucine | 0.367 | 0.397 |
| Thiamin triphosphate | 5-L-Glutamyl-L-alanine | 0.452 | 0.398 |
| Ornithine | Dihydroxyacetone | −0.415 | −0.377 |
| Ornithine | Glyceraldehyde | −0.415 | −0.377 |
| Ornithine | D-Lactate | −0.415 | −0.377 |
| Ornithine | Imidazole glycerol-P | −0.374 | −0.289 |
| L-Leucine | AICAR | 0.434 | 0.462 |
| GABA | Glyceraldehyde | −0.384 | −0.348 |
| GABA | Glutamine | −0.388 | −0.319 |
Figure 3Distribution of the absolute difference of Pearson and partial correlation. The boxplots show the absolute difference of the Pearson and partial correlations for each of the three organism (E. coli, S. cerevisiae, and A. thaliana) and the two approaches, respectively.