| Literature DB >> 27327084 |
Min Kyung Kim1, Anatoliy Lane2, James J Kelley1, Desmond S Lun1,2,3,4.
Abstract
BACKGROUND: Several methods have been developed to predict system-wide and condition-specific intracellular metabolic fluxes by integrating transcriptomic data with genome-scale metabolic models. While powerful in many settings, existing methods have several shortcomings, and it is unclear which method has the best accuracy in general because of limited validation against experimentally measured intracellular fluxes.Entities:
Mesh:
Year: 2016 PMID: 27327084 PMCID: PMC4915706 DOI: 10.1371/journal.pone.0157101
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the desirable features of a method for predicting intracellular metabolic fluxes using transcriptomic data-integrated genomic models.
| Desirable features | Benefits | |
|---|---|---|
| 1 | Requirement for only a single gene expression data as input | Simpler analysis with less effort and cost |
| 2 | Use of continuous gene expression values without using arbitrary thresholds | Acquisition of more fine-grained information by avoiding arbitrary classification of gene expression levels |
| 3 | Capability to be used when an appropriate objective function is unknown | Applications to microorganisms with variable biomass composition, pathogens in dormancy or in latent phase, or cells of a multi-cellular organism |
| 4 | Capability to produce a unique metabolic flux distribution | More reproducible analysis independent of hardwares and softwares used to solve optimization problems |
| 5 | Capability to be used when the carbon source of the system and its uptake rate is unknown | Applications to microorganisms living in intact tissues or in natural environments |
Five desirable features of a new method are listed in the left side of the table, and the corresponding benefits are described in the right column.
Datasets and metabolic models used for this study.
| Genome-scale metabolic model | For | Yeast 5 [ | |
| For | Yeast 5 [ | ||
| Transcriptomic data & measured flux data | Dataset 1 | ||
| Dataset 2 | |||
| Total | 11 conditions in | 9 conditions in | |
Fig 1Flow chart illustrating how to choose between E-Flux2 and SPOT.
Fig 3Test of our methods onto older models of E. coli and S. cerevisiae.
We tested our methods on older models of E. coli (iJR904 and iAF1260) and those of S. cerevisiae (iND750 and iMM904) to examine the applicability of our methods to the relatively incomplete models. The x-axis represents the four different optimization strategies and the y-axis identifies the average Pearson correlation coefficient between the predicted fluxes and the measured fluxes of E. coli (Fig 3a) and S. cerevisiae (Fig 3b). Error bars represent standard error of the mean (SEM).
Validation of the accuracy of our predictions against measured intracellular fluxes.
| Known C source (glucose, in this case) | Unknown C source | ||||||
| Known objective function (biomass, in this case) | Standard FBA | pFBA | FBA+min | DC+E-Flux | AC+E-Flux | ||
| Known C uptake rate | Unknown C uptake rate | ||||||
| Unknown objective function | DC+Lee | AC+Lee | |||||
The Pearson correlation between the predicted and the measured intracellular fluxes was calculated to validate the predictive accuracy of our method. The correlation values were grouped into four different cases depending on the availability of carbon source or objective function information. The bold number in each category of the table presents the average correlation of 11 samples in E. coli and 9 samples in S. cerevisiae. The number to the right of the ± indicates its standard deviation. Since the fluxes predicted by standard FBA, pFBA and E-Flux are not unique, the output flux obtained using our specific implementation was used to calculate the average correlation. For FBA and E-Flux solutions, the minimum and the maximum correlations between predicted fluxes and the measured fluxes that each method can theoretically achieve are given within square brackets after their average correlations. The way that we calculated the possible range of correlations of each method is described in Supplementary Methods in S1 File. Note that the maximum possible correlation can be calculated only when we already have the measured flux datasets. There is no way to force each method to produce a metabolic flux distribution that achieves the best correlation with the measured fluxes. Our methods, E-Flux2 and SPOT, were developed during the process of testing various strategies for producing unique flux distributions and identifying those that achieve good correlation on average with measured fluxes.
1) metabolic flux distributions produced by these methods—FBA, pFBA, E-Flux- are not unique
Fig 2Comparison of the predicted fluxes with the measured fluxes of E. coli data (WT 0.5h-1 sample).
The x-axis represents metabolic reactions used to calculate correlation between the measured (blue bars in the figure) and the predicted fluxes (red bars in the figure), and the y-axis indicates flux value. The scale and the units on the y-axis are based on those of the measured flux.