| Literature DB >> 22934050 |
Anna S Blazier1, Jason A Papin.
Abstract
With the advent of high-throughput technologies, the field of systems biology has amassed an abundance of "omics" data, quantifying thousands of cellular components across a variety of scales, ranging from mRNA transcript levels to metabolite quantities. Methods are needed to not only integrate this omics data but to also use this data to heighten the predictive capabilities of computational models. Several recent studies have successfully demonstrated how flux balance analysis (FBA), a constraint-based modeling approach, can be used to integrate transcriptomic data into genome-scale metabolic network reconstructions to generate predictive computational models. In this review, we summarize such FBA-based methods for integrating expression data into genome-scale metabolic network reconstructions, highlighting their advantages as well as their limitations.Entities:
Keywords: data integration; expression data; flux balance analysis; metabolic networks; transcriptomics
Year: 2012 PMID: 22934050 PMCID: PMC3429070 DOI: 10.3389/fphys.2012.00299
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
Figure 1(A) Toy network and sample expression data for three different experimental conditions. Panels (B), (C), (D), (E) and (F) demonstrate how the discussed algorithms use the sample expression data to modify the flow of reaction flux, shown in the boldness of the reaction arrow, through the toy network. (B) GIMME compares the expression data to a threshold and subsequently removes reactions whose expression levels fall below the threshold. In the second case where the threshold is 0.50, the removal of R3 causes the network to not achieve its user-specified objective function, resulting in a non-functional model; thus, it is added back into the network to create a functional model. (C) iMAT discretizes the expression data into lowly, moderately, and highly expressed reaction sets and removes reactions that are lowly expressed from the network. Because the network is not required to achieve a specified objective function, both cases of thresholding result in a functional model. However, in the second case where the low cutoff is 0.50 and the high cutoff is 0.75, R3 is considered to be post-transcriptionally down-regulated because, even though expression data shows it to be highly expressed, the resulting toy network indicates that R3 has little, if any, reaction flux. (D) MADE determines a sequence of binary expression states by maximizing the statistically significant changes across the expression states and simultaneously solves the flux balance analysis problem for each of the experimental conditions, resulting in unique toy networks for each condition. (E) E-Flux uses the expression data to modify the maximum possible flux, MaxFluxi, through the relevant reactions. (F) PROM first binarizes the data according to a threshold to determine on/off states and then determines the probability that a gene is on when its transcription factor is also on. In this example, we assume that the transcription factors are on for all of the genes. Using this probability, PROM modifies the MaxFluxi through the relevant reactions.
Summary of the algorithms for the integration of expression data.
| GIMME | Compares expression levels to a threshold to determine sets of active reactions and sets of inactive reactions in a reconstruction and returns a functioning model that meets an assumed objective function | Only requires one gene expression data set | Requires thresholding of mRNA transcript levels relative to a user-specified value | Becker and Palsson, |
| iMAT | Uses expression data to determine highly active, moderately active, and lowly active sets of reactions in the reconstruction and solves a MILP problem to return a functioning model | Does not require | Requires discretization of expression data into lowly, moderately and highly expressed genes | Shlomi et al., |
| MADE | Requires two or more sets of microarray data to create a sequence of binary expression states for a reconstruction's reactions, removing the need for arbitrary thresholding | Does not require a user-supplied threshold to determine which reactions are highly expressed and which are lowly expressed | Requires multiple datasets for differential expression | Jensen and Papin, |
| E-Flux | Compares expression levels to a threshold and subsequently constrains the upper bounds of the reactions that are lowly expressed | Does not reduce gene expression data to binary on-off states | Requires a function to convert expression levels into an upper bound on fluxes | Colijn et al., |
| PROM | Determines the probability that a gene is active relative to the activity of its transcription factor according to expression data and subsequently constrains the maximum flux for relevant reactions by a factor of this probability | Incorporates regulatory interactions without a mechanistic model | Requires a large dataset for calculating transcription factor and target gene interactions | Chandrasekaran and Price, |