| Literature DB >> 19038030 |
Miguel Rocha1, Paulo Maia, Rui Mendes, José P Pinto, Eugénio C Ferreira, Jens Nielsen, Kiran Raosaheb Patil, Isabel Rocha.
Abstract
BACKGROUND: One of the greatest challenges in Metabolic Engineering is to develop quantitative models and algorithms to identify a set of genetic manipulations that will result in a microbial strain with a desirable metabolic phenotype which typically means having a high yield/productivity. This challenge is not only due to the inherent complexity of the metabolic and regulatory networks, but also to the lack of appropriate modelling and optimization tools. To this end, Evolutionary Algorithms (EAs) have been proposed for in silico metabolic engineering, for example, to identify sets of gene deletions towards maximization of a desired physiological objective function. In this approach, each mutant strain is evaluated by resorting to the simulation of its phenotype using the Flux-Balance Analysis (FBA) approach, together with the premise that microorganisms have maximized their growth along natural evolution.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19038030 PMCID: PMC2612012 DOI: 10.1186/1471-2105-9-499
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The process of solution decoding and evaluation. The solutions (individuals) are encoded using the proposed set-based representation where the genes to be deleted are represented. Each individual is decoded by imposing additional constraints to the original metabolic model. FBA is the method used to simulate the cellular behaviour that will be associated to a given fitness, related with the productivity in a given interesting compound.
Figure 2Comparison of the EA and SA algorithms. A scheme illustrating the major steps of the EA and SA algorithms developed in this work. Each individual represents a set of genes/reactions to be deleted from the model. The task is to find one or more individuals that are predicted to have high yield or productivity (in general, any flux phenotype). The prediction of the flux phenotype is based on optimality principles of metabolic networks such as Flux Balance Analysis.
Statistics of the genome-scale models used in the case studies
| Number of fluxes | 1104 | 1075 |
| Number of metabolites | 825 | 761 |
| Number of fluxes after pre-processing | 460 | 549 |
| Number of variables in the optimization | 268 | 301 |
The details on the original models are given, together with the statistics of the models after the pre-processing steps.
Figure 3Boxplots with the results obtained by the SA and SEA. The eight graphs (one for each case study and algorithm SA or SEA) show a set of boxplots (one for each value of the k, the number of knockouts, and one for the variable size variant) with the following statistics: best value (maximum), quartiles, median and minimum value; the mean value is also shown as a blue dot (or a red dot in the case of the variable size). All values are calculated over the 30 runs for each scenario.
Size of the best solutions obtained by the variable size SEA/SA
| 38 | 17 | |
| 8.7 | 9 | |
| 3 | 3.6 | |
| 15 | 16 | |
This table shows the average number of knockouts in the solutions obtained by the variable size variants of SEA and SA. Only the best solutions found over the 30 runs are considered.
Results obtained by the greedy algorithm for the case studies
| 0.03260 | 23 | |
| 0.00000 | - | |
| 0.25527 | 3 | |
| 0.07779 | 3 | |
This table shows the results obtained by the greedy algorithm. The first column shows the details on the case study (organism, desired product, conditions); the second column states the value of the objective function (BPCY) and the last column the number of knockouts, both obtained for the best solution found.
Figure 4Convergence plot of SA and SEA. Convergence plot of the SA and SEA algorithms (variable size variant) in the case study with E. coli production of succinate (x-axis represents the number of function evaluations; y-axis plots BPCY values)
Figure 5Analysis of the best solution and partial solutions for the . One of the best solutions found for the E. coli, succinate case study was analyzed. All sub-solutions, i.e. solutions with a sub-set of the original gene deletions, were evaluated. The best ones for each set size are shown in the figure. Possible ways to reach a solution are shown (in black, grow mutations; in red, crossover operations).
Best overall mutants obtained for each case study
| 0.05398 | PGI1_1, PGI1_2, FBP1, PDC6, ADH4, SDH3_2, AAH1_1, URH1_1, U30_, MET3, ALD4_2, GSH1, U103_, YER053C, CTP1_1* | |
| 0.39850 | ALCD19, DRPA, GLYCDx, F6PA, TPI, LDH_D2, EDA, TKT2, LDH_D- | |
| 0.25527 | FRD3, GART, ADHEr * | |
| 0.35785 | MALS, ORNDC, FUM, GLYCL, GHMT2, ADPT, DCYTD, DUTPDP, URIDK2r, NTD8, PUNP1, THD2, GND, PFL, SUCFUMt * | |
This table shows the list of knockouts in the best solution found in all the runs of both algorithms. In the rows marked with *, other solutions with the same BPCT exist; the one with less knockouts was selected (see Additional file 2 for the complete list)