| Literature DB >> 20863397 |
Moritz von Stosch1, Joana Peres, Sebastião Feyo de Azevedo, Rui Oliveira.
Abstract
BACKGROUND: This paper presents a method for modelling dynamical biochemical networks with intrinsic time delays. Since the fundamental mechanisms leading to such delays are many times unknown, non conventional modelling approaches become necessary. Herein, a hybrid semi-parametric identification methodology is proposed in which discrete time series are incorporated into fundamental material balance models. This integration results in hybrid delay differential equations which can be applied to identify unknown cellular dynamics.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20863397 PMCID: PMC2955604 DOI: 10.1186/1752-0509-4-131
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Network Structures. Delay TF-A transcription model. (A) true network structure (B) DDEHM network without prior knowledge, (C) DDEHM network with some prior knowledge. In structures (B) and (C), the ANN comprises three layers. The nodes of the input and output layer have linear transition functions, except for the input node of the time which has a hyperbolic tangential transition function as do the nodes of the hidden layer.
Figure 2Impact of delays on the TF-A profile. Demonstration of the impact of the delay on the trajectory of TF-A transcription model over time. The TF-A model trajectory without delay is the blue dashed line while the TF-A trajectory with delay is the green continuous line.
Results for Case Study I
| NN | τ | BIC | MSE | NN | τ | BIC | MSE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| train | valid | test | train | valid | test | train | valid | test | train | valid | test | ||||
| 5 | 0 | -12217 | -5836 | -5997 | 0.0141 | 0.0152 | 0.0210 | 6 | 0 | -12220 | -5869 | -6039 | 0.0139 | 0.0157 | 0.0222 |
| 2 | 100 | -13118 | -6209 | -6190 | 0.0368 | 0.0350 | 0.0337 | 2 | 110 | -13058 | -6150 | -6157 | 0.0347 | 0.0310 | 0.0315 |
| 3 | 100 | -13087 | -6273 | -6336 | 0.0350 | 0.0384 | 0.0437 | 3 | 110 | -13043 | -6269 | -6275 | 0.0334 | 0.0381 | 0.0385 |
| 4 | 100 | -11826 | -5650 | -5888 | 0.0096 | 0.0105 | 0.0170 | 4 | 110 | -12273 | -5805 | -5832 | 0.0151 | 0.0144 | 0.0152 |
| 5 | 100 | -11386 | -5379 | -5733 | 0.0060 | 0.0059 | 0.0120 | 5 | 110 | -12302 | -5864 | -6008 | 0.0152 | 0.0156 | 0.0210 |
| 6 | 100 | -12873 | -6174 | -6176 | 0.0265 | 0.0282 | 0.0284 | 6 | 110 | -13162 | -6336 | -6329 | 0.0355 | 0.0392 | 0.0386 |
| 7 | 100 | -13144 | -6269 | -6176 | 0.0342 | 0.0330 | 0.0273 | 7 | 110 | -11516 | -5572 | -5731 | 0.0066 | 0.0081 | 0.0111 |
| 2 | 120 | -13047 | -6148 | -6139 | 0.0343 | 0.0309 | 0.0303 | 2 | 130 | -13242 | -6332 | -6371 | 0.0417 | 0.0449 | 0.0486 |
| 3 | 120 | -12105 | -5782 | -5960 | 0.0130 | 0.0142 | 0.0204 | 3 | 130 | -13076 | -6173 | -6203 | 0.0346 | 0.0314 | 0.0333 |
| 4 | 120 | -11974 | -5761 | -5891 | 0.0111 | 0.0132 | 0.0171 | 4 | 130 | -12652 | -6087 | -6090 | 0.0221 | 0.0254 | 0.0256 |
| 5 | 120 | -11436 | -5462 | -5489 | 0.0062 | 0.0068 | 0.0071 | 5 | 130 | -11823 | -5604 | -5676 | 0.0094 | 0.0092 | 0.0107 |
| 6 | 120 | -10820 | -5170 | -5714 | 0.0033 | 0.0036 | 0.0108 | 6 | 130 | -12679 | -6093 | -6108 | 0.0218 | 0.0240 | 0.0247 |
| 7 | 120 | -12533 | -6002 | -5881 | 0.0184 | 0.0193 | 0.0151 | 7 | 130 | -13269 | -6384 | -6393 | 0.0388 | 0.0417 | 0.0424 |
| 2 | 140 | -13069 | -6155 | -6167 | 0.0351 | 0.0313 | 0.0321 | 2 | 160 | -13195 | -6295 | -6257 | 0.0398 | 0.0416 | 0.0385 |
| 3 | 140 | -12303 | -5805 | -5803 | 0.0158 | 0.0149 | 0.0149 | 3 | 160 | -12252 | -5823 | -5771 | 0.0151 | 0.0155 | 0.0139 |
| 4 | 140 | -13288 | -6375 | -6384 | 0.0420 | 0.0455 | 0.0464 | 4 | 160 | -13063 | -6186 | -6241 | 0.0334 | 0.0311 | 0.0347 |
| 5 | 140 | -12537 | -6043 | -6039 | 0.0193 | 0.0225 | 0.0223 | 5 | 160 | -12022 | -5716 | -5909 | 0.0114 | 0.0116 | 0.0171 |
| 6 | 140 | -12564 | -6067 | -6078 | 0.0194 | 0.0228 | 0.0233 | 6 | 160 | -12052 | -5800 | -5995 | 0.0116 | 0.0133 | 0.0197 |
| 7 | 140 | -11439 | -5535 | -5994 | 0.0061 | 0.0075 | 0.0189 | 7 | 160 | -11466 | -5431 | -5441 | 0.0063 | 0.0061 | 0.0062 |
| 2 | 80, 120 | -13016 | -6146 | -6079 | 0.0330 | 0.0305 | 0.0266 | 2 | 120, 160 | -12984 | -6137 | -6027 | 0.0320 | 0.0299 | 0.0240 |
| 3 | 80, 120 | -12334 | -5860 | -5968 | 0.0162 | 0.0164 | 0.0204 | 3 | 120, 160 | -13115 | -6296 | -6163 | 0.0357 | 0.0397 | 0.0303 |
| 4 | 80, 120 | -11221 | -5276 | -5566 | 0.0051 | 0.0048 | 0.0087 | 4 | 120, 160 | -12250 | -5872 | -5934 | 0.0145 | 0.0162 | 0.0183 |
| 5 | 80, 120 | -12780 | -6221 | -6207 | 0.0243 | 0.0314 | 0.0305 | 5 | 120, 160 | -12293 | -5872 | -5984 | 0.0148 | 0.0155 | 0.0194 |
| 6 | 80, 120 | -12233 | -5837 | -5944 | 0.0136 | 0.0139 | 0.0172 | 6 | 120, 160 | -11240 | -5352 | -7991 | 0.0050 | 0.0052 | 1.0762 |
| 7 | 80, 120 | -11688 | -5663 | -5630 | 0.0077 | 0.0094 | 0.0088 | 7 | 120, 160 | -11703 | -5623 | -6004 | 0.0078 | 0.0086 | 0.0187 |
| 2 | 80, 120,160 | -12994 | -6144 | -6034 | 0.0321 | 0.0300 | 0.0241 | 5 | 80, 120, 160 | -12487 | -5953 | -6045 | 0.0178 | 0.0178 | 0.0215 |
| 3 | 80, 120, 160 | -11855 | -5641 | -5937 | 0.0099 | 0.0104 | 0.0189 | 6 | 80, 120, 160 | -12824 | -6193 | -6213 | 0.0244 | 0.0276 | 0.0288 |
| 4 | 80, 120, 160 | -11879 | -5605 | -5734 | 0.0099 | 0.0092 | 0.0120 | 7 | 80, 120, 160 | -12167 | -5758 | -5774 | 0.0122 | 0.0110 | 0.0113 |
Effect of structure parameters (number of nodes in the hidden layer, NN, and number and values of time delays) on the performance of the structure displayed in Fig. 1C. For every structure incorporating delays two random initial weight sets were investigated. For those without delays four different random initial weight changes were investigated. At least 25 iterations were carried out for each set of weights. The number of iterations was expanded if network learning was observed during the last iterations. Integration of the material balances along with the differential equations resulting from the sensitivity method for parameter identification is carried out for this simulation case with the dde23 MATLAB function for the studies with delays, and with the ode23 MATLAB function for the ones without delays. This results in higher simulation times, but as the dimension of the set of equations is rather small, the total simulation time is maintainable.
Figure 3Qualitative results on the time curse of TF-A. TF-A modelling results for the two runs of test data. On the left side the whole simulation region of the data set is shown while on the right side the most interesting section of the respective data set is highlighted: red circles are "measured" TF-A data over time, green line are the identification results by model structure (1C) with 5 hidden nodes and τ =120 minutes; blue dashed line are the identification results by model (1C) with 5 hidden nodes and no time delay.
Figure 4. (A) network with a quadratic distributed time delay kernel of cell growth and protein expression over methanol uptake. The respective equations are listed in Table 2. This network was used to generate simulation data (B) Approximation of network (A) by a hybrid network. Structure (B) was investigated to see if the novel framework is able to identify unknown distributed delay dynamics.
Mathematical model for data generation
| Reactor model equations: | |
|---|---|
Parameters and initial Values:
D ,-, (1/h); F ,-, (g/l); S,10, (g/l); K,0.1184, (1/h); K,4.7376, (g/mol); K,0.48, (-); K,0.0008, (1/h); K,10, (g/l); m,0.0015, (mol/(g.h)); P ,0, (mg/l); r,0.19, (1/h); S ,40, (g/l); S,1260, (g/l); t ,-, h ; V ,15, l ; W , W0 = S0 , (g/l); X ,1, (g/l); Z , Z0 = S0 , (g/l); τ,1, h ; β, h; μ ,-, (1/h);
Mathematical model of MUT+ Pichia pastoris expression with a quadratic distributed delay kernel. This model was used to generate six data sets. Three of which contain the clean, noise-free data and the other three the associated white noise corrupted data. One data set of the noise corrupted sets was used to train the hybrid model, one was used for validation and the third one for testing. Integration was performed with the ode45 MATLAB function which integrates the differential equation with a Runge-Kutta (4,5) integration scheme. The obtained state variables, namely concentrations of biomass, substrate and product, the reactor volume and as well the feed concentration are recorded and assumed as measured data for the evaluation. Variation in the data was obtained by application of varying initial values, i.e. the initial values were 5% Gaussian distributed. Note that model equations (A5 and A6) are derived from equation (A 12) using the linear chain trick [17,18] and that (A 12) is never used for model calculations.
Figure 5Impact of delays on the specific biomass growth rate. Green full line, is the specific growth rate when considering the network shown in Fig. 4A; blue dashed line is the specific growth rate when no delay between substrate uptake and biomass growth is considered.
Results for Case Study II
| NN | Nlag | BIC | MSE | NN | Nlag | BIC | MSE | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| train | valid | test | train | valid | test | train | valid | test | train | valid | test | ||||||
| 3 | 0 | 0 | -18561 | -5147 | -5000 | 0.0155 | 0.0187 | 0.0157 | 6 | 0 | 0 | -18500 | -5199 | -5030 | 0.0148 | 0.0187 | 0.0154 |
| 4 | 0 | 0 | -18514 | -5164 | -4994 | 0.0155 | 0.0187 | 0.0150 | 7 | 0 | 0 | -18576 | -5197 | -4990 | 0.0153 | 0.0180 | 0.0144 |
| 5 | 0 | 0 | -18531 | -5186 | -5011 | 0.0151 | 0.0189 | 0.0153 | 8 | 0 | 0 | -18430 | -5211 | -5053 | 0.0146 | 0.0182 | 0.0158 |
| 2 | 1 | 2 | -17144 | -4697 | -4475 | 0.0086 | 0.0098 | 0.0065 | 2 | 1 | 2.5 | -16343 | -4293 | -4331 | 0.0077 | 0.0076 | 0.0067 |
| 3 | 1 | 2 | -16981 | -4588 | -4470 | 0.0085 | 0.0108 | 0.0067 | 3 | 1 | 2.5 | -17230 | -4725 | -4584 | 0.0186 | 0.0327 | 0.0183 |
| 4 | 1 | 2 | -16877 | -4581 | -4484 | 0.0087 | 0.0117 | 0.0072 | 4 | 1 | 2.5 | -16635 | -4506 | -4690 | 0.0154 | 0.0220 | 0.0170 |
| 5 | 1 | 2 | -16792 | -4544 | -4490 | 0.0078 | 0.0093 | 0.0064 | 5 | 1 | 2.5 | -16927 | -4614 | -4524 | 0.0086 | 0.0111 | 0.0072 |
| 6 | 1 | 2 | -16911 | -4643 | -4544 | 0.0087 | 0.0119 | 0.0073 | 6 | 1 | 2.5 | -16944 | -4523 | -4488 | 0.0082 | 0.0092 | 0.0066 |
| 7 | 1 | 2 | -17197 | -4632 | -4505 | 0.0097 | 0.0151 | 0.0084 | 7 | 1 | 2.5 | -17044 | -4742 | -4608 | 0.0082 | 0.0107 | 0.0075 |
| 8 | 1 | 2 | -17181 | -4530 | -4496 | 0.0123 | 0.0155 | 0.0105 | 8 | 1 | 2.5 | -16976 | -4694 | -4608 | 0.0084 | 0.0110 | 0.0076 |
| 2 | 2 | 2 | -16466 | -4512 | -4525 | 0.0067 | 0.0079 | 0.0067 | 2 | 2 | 2.5 | -17813 | -4833 | -4759 | 0.0130 | 0.0152 | 0.0129 |
| 3 | 2 | 2 | -16856 | -4656 | -4535 | 0.0081 | 0.0116 | 0.0073 | 3 | 2 | 2.5 | -16637 | -4684 | -4595 | 0.0079 | 0.0123 | 0.0079 |
| 4 | 2 | 2 | -16788 | -4616 | -4525 | 0.0079 | 0.0111 | 0.0072 | 4 | 2 | 2.5 | -16703 | -4507 | -4517 | 0.0073 | 0.0083 | 0.0064 |
| 5 | 2 | 2 | -16734 | -4430 | -4446 | 0.0075 | 0.0088 | 0.0061 | 5 | 2 | 2.5 | -16384 | -4327 | -4404 | 0.0068 | 0.0074 | 0.0063 |
| 6 | 2 | 2 | -16573 | -4271 | -4353 | 0.0077 | 0.0081 | 0.0065 | 6 | 2 | 2.5 | -16601 | -4400 | -4432 | 0.0071 | 0.0079 | 0.0062 |
| 7 | 2 | 2 | -16704 | -4569 | -4541 | 0.0071 | 0.0089 | 0.0065 | 7 | 2 | 2.5 | -16569 | -4405 | -4466 | 0.0068 | 0.0072 | 0.0061 |
| 8 | 2 | 2 | -16921 | -4728 | -4632 | 0.0082 | 0.0132 | 0.0079 | 8 | 2 | 2.5 | -16833 | -4790 | -4664 | 0.0080 | 0.0127 | 0.0079 |
| 2 | 3 | 2 | -19006 | -5136 | -5080 | 0.0181 | 0.0177 | 0.0158 | 2 | 3 | 2.5 | -16619 | -4566 | -4514 | 0.0077 | 0.0099 | 0.0071 |
| 3 | 3 | 2 | -16811 | -4692 | -4549 | 0.0078 | 0.0119 | 0.0073 | 3 | 3 | 2.5 | -16037 | -4218 | -4259 | 0.0064 | 0.0072 | 0.0058 |
| 4 | 3 | 2 | -16737 | -4474 | -4466 | 0.0069 | 0.0089 | 0.0063 | 4 | 3 | 2.5 | -16439 | -4224 | -4287 | 0.0068 | 0.0078 | 0.0057 |
| 5 | 3 | 2 | -16519 | -4357 | -4408 | 0.0066 | 0.0076 | 0.0058 | 5 | 3 | 2.5 | -16199 | -4358 | -4295 | 0.0063 | 0.0076 | 0.0056 |
| 6 | 3 | 2 | -16832 | -4506 | -4415 | 0.0090 | 0.0107 | 0.0078 | 6 | 3 | 2.5 | -16604 | -4577 | -4556 | 0.0072 | 0.0094 | 0.0067 |
| 7 | 3 | 2 | -16565 | -4385 | -4439 | 0.0066 | 0.0072 | 0.0058 | 7 | 3 | 2.5 | -16344 | -4475 | -4432 | 0.0064 | 0.0078 | 0.0060 |
| 8 | 3 | 2 | -16758 | -4672 | -4569 | 0.0079 | 0.0117 | 0.0069 | 8 | 3 | 2.5 | -16471 | -4374 | -4505 | 0.0066 | 0.0069 | 0.0061 |
| 2 | 4 | 2 | -16655 | -4365 | -4504 | 0.0071 | 0.0107 | 0.0077 | 2 | 4 | 2.5 | -16562 | -4532 | -4503 | 0.0079 | 0.0097 | 0.0073 |
| 3 | 4 | 2 | -16377 | -4301 | -4431 | 0.0067 | 0.0078 | 0.0064 | 3 | 4 | 2.5 | -16325 | -4471 | -4470 | 0.0072 | 0.0086 | 0.0066 |
| 4 | 4 | 2 | -16215 | -4183 | -4316 | 0.0062 | 0.0067 | 0.0057 | 4 | 4 | 2.5 | -16261 | -4189 | -4281 | 0.0064 | 0.0068 | 0.0058 |
| 5 | 4 | 2 | -16611 | -4481 | -4484 | 0.0070 | 0.0101 | 0.0066 | 5 | 4 | 2.5 | -15954 | -4190 | -4255 | 0.0056 | 0.0060 | 0.0052 |
| 6 | 4 | 2 | -17503 | -4762 | -4597 | 0.0189 | 0.0374 | 0.0197 | 6 | 4 | 2.5 | -16439 | -4334 | -4476 | 0.0064 | 0.0073 | 0.0060 |
| 7 | 4 | 2 | -25835 | -7171 | -7226 | 10.160 | 9.2368 | 11.843 | 7 | 4 | 2.5 | -25949 | -7288 | -7280 | 13.781 | 19.040 | 17.824 |
| 8 | 4 | 2 | -16256 | -4328 | -4369 | 0.0058 | 0.0061 | 0.0052 | 8 | 4 | 2.5 | -16296 | -4326 | -4381 | 0.0061 | 0.0065 | 0.0054 |
Results of the performance measures, BIC and MSE, over structure parameters, namely Numbers of Nodes in the hidden layer of the ANN, NN, Number of time lags, Nlag and the time lag, τ, for Pichia pastoris cells with distributed time delays using the structure of Fig. 4B. Integration of material balances along with the equations obtained from the sensitivity method is carried out using the linear approximation integration schema described in the Methods section. The times series were chosen such that one of the delays matched the maximum of the time delay of the weighting function of the simulation case (see Eqs. A 12).
Figure 6Qualitative results for trajectories of concentrations. Pichia Pastoris distributed delay modelling result for a fed-batch of the test data set: red circles are "measured" data over time; green line are the identification results by model structure (4B) with 5 hidden nodes and a series of 4 time lagged variables of 2.5 hours; blue dashed line are the identification results by model (4B) with 7 hidden nodes and no time delay.