Peter Sinner1, Marlene Stiegler1, Oliver Goldbeck2, Gerd M Seibold3, Christoph Herwig1, Julian Kager1,4. 1. Research Unit of Biochemical Engineering, Institute of Chemical, Environmental and Bioscience Engineering, Technische Universität Wien, Vienna, Austria. 2. Institute of Microbiology and Biotechnology, University of Ulm, Ulm, Germany. 3. Section for Synthetic Biology, Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark. 4. Competence Center CHASE GmbH, Linz, Austria.
Abstract
Model-based state estimators enable online monitoring of bioprocesses and, thereby, quantitative process understanding during running operations. During prolonged continuous bioprocesses strain physiology is affected by selection pressure. This can cause time-variable metabolic capacities that lead to a considerable model-plant mismatch reducing monitoring performance if model parameters are not adapted accordingly. Variability of metabolic capacities therefore needs to be integrated in the in silico representation of a process using model-based monitoring approaches. To enable online monitoring of multiple concentrations as well as metabolic capacities during continuous bioprocessing of spent sulfite liquor with Corynebacterium glutamicum, this study presents a particle filtering framework that takes account of parametric variability. Physiological parameters are continuously adapted by Bayesian inference, using noninvasive off-gas measurements. Additional information on current parameter importance is derived from time-resolved sensitivity analysis. Experimental results show that the presented framework enables accurate online monitoring of long-term culture dynamics, whereas state estimation without parameter adaption failed to quantify substrate metabolization and growth capacities under conditions of high selection pressure. Online estimated metabolic capacities are further deployed for multiobjective optimization to identify time-variable optimal operating points. Thereby, the presented monitoring system forms a basis for adaptive control during continuous bioprocessing of lignocellulosic by-product streams.
Model-based state estimators enable online monitoring of bioprocesses and, thereby, quantitative process understanding during running operations. During prolonged continuous bioprocesses strain physiology is affected by selection pressure. This can cause time-variable metabolic capacities that lead to a considerable model-plant mismatch reducing monitoring performance if model parameters are not adapted accordingly. Variability of metabolic capacities therefore needs to be integrated in the in silico representation of a process using model-based monitoring approaches. To enable online monitoring of multiple concentrations as well as metabolic capacities during continuous bioprocessing of spent sulfite liquor with Corynebacterium glutamicum, this study presents a particle filtering framework that takes account of parametric variability. Physiological parameters are continuously adapted by Bayesian inference, using noninvasive off-gas measurements. Additional information on current parameter importance is derived from time-resolved sensitivity analysis. Experimental results show that the presented framework enables accurate online monitoring of long-term culture dynamics, whereas state estimation without parameter adaption failed to quantify substrate metabolization and growth capacities under conditions of high selection pressure. Online estimated metabolic capacities are further deployed for multiobjective optimization to identify time-variable optimal operating points. Thereby, the presented monitoring system forms a basis for adaptive control during continuous bioprocessing of lignocellulosic by-product streams.
Continuous bioprocessing can enable high space time yields under consistent production rates and product quality. Thereby, process intensification in comparison with regular batch or fed‐batch operations is feasible. Applications of continuous microbial bioprocesses are versatile, ranging from large scale ethanol production (Sahm & Bringer‐Meyer, 1987) to recent developments for recombinant protein production (Kittler et al., 2020; Schuller et al., 2020). Despite technological advantages such as reduced culture volumes and potential for continuous downstream integration (Gagnon et al., 2018), physiological instabilities still hamper the implementation of robust continuous processes (Kopp et al., 2019).To additionally improve sustainability of biotechnological processes, lignocellulosic by‐product streams such as spent sulfite liquor (SSL) can be utilized as alternative feedstocks. SSL is a complex mixture containing various hemicellulose hydrolysis products, with mannose, xylose, and glucose being the main carbohydrates accessible for bioprocessing. With quantities of around 90 billion liters of SSL being generated yearly during sulfite pulping of wood (Lawford & Rousseau, 1993) and due to its high carbohydrate content, SSL is an attractive substrate stream for industrial biotechnology. Continuous processes utilizing SSL were described for the production of various value‐added‐products, such as single cell protein (Gold et al., 1981), bioethanol (Mandenius, 1988; Vees et al., 2020), or succinic acid (Ladakis et al., 2018).In this study, a recombinant Corynebacterium glutamicum strain is used for continuous SSL bioprocessing. C. glutamicum is an important industrial producer of amino acids and can be genetically modified for a wide range of potential products, including chemical building blocks (Becker et al., 2018) as well as recombinant proteins (Hemmerich et al., 2019). Strain improvements also enabled hemicellulosic sugar conversion (Buschke et al., 2013; Choi et al., 2019; Meiswinkel et al., 2013), thus making C. glutamicum a promising new platform for the utilization of lignocellulosic by‐product streams.Bioprocessing of by‐product streams such as SSL remains challenging due to the presence of potential growth inhibitors, batch‐to‐batch raw material fluctuations and multiple carbon sources that can lead to reciprocal inhibition effects and therefore to substrate accumulation or inefficient utilization. During continuous processes, selection pressure on the culture introduces an additional element of complexity. Besides of process instabilities caused by the metabolic burden of recombinant gene expression (Hoffmann & Rinas, 2004; Neubauer et al., 2003; Schuller et al., 2020), the selection pressure in continuous operation can also lead to favorable biological adaptions (Wortel et al., 2016). Phenotype changes are caused by genetic (Arensdorf et al., 2002; Jansen et al., 2005; Wenger et al., 2011) and nongenetic (Herwig & von Stockar, 2002; Wright et al., 2020) adaptions and their regulatory implications have to be considered (Sato et al., 2016). Based on aforementioned effects, continuous cultivations in chemostat or turbidostat mode can be regarded as highly efficient systems for directed evolution and have therefore also been adopted for strain and protein engineering applications (Arensdorf et al., 2002; Badran & Liu, 2015; d'Oelsnitz & Ellington, 2018; Koppram et al., 2012). Long‐term adaption effects that change the metabolic capacities of the culture during running processes can lead to unplanned deviations from assumed steady‐state behavior. These deviations can challenge reproducibility of experiments (Wick et al., 2002) as well as stability of production processes (Kazemi Seresht et al., 2013) and limit the applicability of process models not considering parametric variability. Different models that directly simulate adaption during continuous cultivations were proposed for single carbon substrate processes (Bayen & Mairet, 2017; Koprinkova‐Hristova & Patarinska, 2006; Patarinska et al., 2000; Sonnleitner & Hahnemann, 1994). However, stochastic evolution can limit the transferability of such models (Wick et al., 2002). To operate at optimal dilution rates, guaranteeing high space time yields and efficient substrate utilization, changing metabolic capacities therefore need to be monitored throughout the process.Monitoring strategies using at‐line flow cytometry (Vees et al., 2020) as well as in situ online probes, such as turbidity sensors (Gold et al., 1981), were previously applied to characterize continuous bioprocesses utilizing SSL. The use of time‐consuming as well as costly at‐line measurements reduces the degree of process automation and hampers targeted control actions during running operations. Online monitoring by in‐line probes enables a high time resolution but in situ reactor modifications (Gold et al., 1981) or recirculation loops (Cabaneros Lopez et al., 2020a) are often necessary. Off‐gas analysis offers a promising noninvasive measurement for online monitoring (Aehle et al., 2011). Online measurements can be fused with previous process knowledge in the form of dynamic models to efficiently exploit their information content. For this purpose, state observers (Ali et al., 2015) are needed and were implemented for bioprocess applications using, for example, extended Kalman filters (Narayanan et al., 2020) or particle filters (Goffaux & Vande Wouwer, 2005; Stelzer et al., 2017). Both base on Bayesian inference, whereas the particle filter approximates the probability density function by single particle simulations and the Kalman filter directly by local linearization and assumed normal distributions (Simon, 2006). Although computationally more demanding the particle filter has shown good performance for biotechnological processes (Goffaux & Vande Wouwer, 2005; Stelzer et al., 2017). In recent publications (Cabaneros Lopez et al., 2020b; Sinner et al., 2021) it was shown how model‐based state estimation can be used for online monitoring during batch and fed‐batch operation of lignocellulosic by‐product streams. Hereby, state estimators were applied to handle different initial batch conditions or raw material uncertainties.This study introduces state estimation for continuous bioprocesses utilizing the multisubstrate stream SSL, which are subject to elevated selection pressure. Previous state estimators for continuous stirred‐tank reactors (CSTR) (diè ne Benyahia et al., 2012; Patarinska & Popova, 2003) focused on single carbon substrate processes only. Also, changes in the physiological capacity due to long‐term process dynamics were not considered in these previous works. To enable state estimation of continuous processes we included potential adaption mechanisms during continuous cultures growing on SSL. To do so, simultaneous state and model parameter estimation is necessary. A straightforward approach builds on state augmentation with random walk parameter dynamics as proposed by Kitagawa (1998). To the best of the authors' knowledge, simultaneous state and parameter estimation was previously not experimentally implemented for continuous bioprocesses subject to long‐term adaption effects. The presented strategy uses off‐gas measurements and is therefore noninvasive. Furthermore, estimator results are exploitable for setpoint optimization considering time‐variable metabolic capacities.The remainder of this paper is structured as follows. In Section 2 the experimental and mathematical methodologies are summarized. Obtained results for modeling and online state estimation are presented in Section 3 and discussed with a particular focus on adaption of metabolic capacities. Conclusions of the study are given in Section 4.
MATERIAL AND METHODS
Experimental set‐up and cultivation conditions
Cultivation experiments were performed with a derivative of C. glutamicum ATCC 13032 transformed with pEKEx3‐xylAB (Meiswinkel et al., 2013) (, pBL1 ) and pVWEx1‐manA (Schwentner et al., 2021) (, pHM1519 ). Xylose utilization is enabled by pEKEx3‐xylAB, containing the xylose isomerase gene xylA and the xylulokinase gene xylB. pVWEx1‐manA overexpresses the mannose‐6‐phosphate isomerase gene manA for improved mannose utilization. All batch and feeding media were supplemented with antibiotics ( kanamycin sulfate and spectinomycin dihydrochloride). The seed train was performed as described in Sinner et al. (2021). Bioreactor processes were inoculated to reach an initial optical density at 600 nm of 8 and were conducted with working volumes of 0.2 L in a DASbox Mini Bioreactor System (Eppendorf AG) equipped with dual Rushton turbines. CGXII medium (Keilhauer et al., 1993) without urea and MOPS buffer, supplemented with 10% (v/v) ultrafiltrated SSL (UF‐SSL) from softwood pulping and 1 mM IPTG, was used as batch medium. Temperature was kept at and culture pH was controlled at 6.5 by addition of 6 M ammonium hydroxide and 1 M phosphoric acid. Dissolved oxygen was controlled above 30% by a cascaded adaption of stirrer speed (400–1200 rpm) and oxygen concentration of the inlet gas (20.95% to 27.5% (v/v)) by supplementing pressurized air with pure oxygen at an overall gassing rate of 1 vvm. The chemostat phase was started when an optical density at 600 nm of 30 was reached by feeding CGXII medium without urea and MOPS buffer, supplemented with 20% (v/v) UF‐SSL and 1 mM IPTG. Dilution rate setpoints in the range of 0.013– were applied during chemostat experiments.
Offline analytics
For biomass dry cell weight determination, 1.8 ml culture broth were centrifuged at 10,000 rpm for 10 min at 4°C, the pellet was washed with phosphate buffered saline and was centrifuged again under identical conditions. The washed pellet was then dried in preweighed sample tubes at 105°C for 72 h before gravimetric analysis. Mannose, xylose and glucose concentrations in filtrated culture supernatants were quantified by HPLC‐RID using a lead column (Nucleogel Sugar Pb, Macherey Nagel). The column was operated at 80°C and deionized water was used as eluent at an isocratic flow rate of .
Online measurements and data analysis
Reactor parameters, gas and liquid inflow rates were retrieved online from the DASbox reactor system using the DASware control software (Eppendorf AG) (see Figure 1). and off‐gas concentrations were measured online using a DASGIP GA4 gas analyzer (Eppendorf AG) based on galvanic cells for and infrared sensor modules for determination. Online data transfer between the DASware control software and the numerical computing environment Matlab R2020a (The MathWorks, Inc.) was enabled by OPC DA. All subsequent data analysis was conducted in Matlab running on a conventional work station (Intel Core i5‐4670U CPU 3.40 GHz, 16 GB RAM). Using online off‐gas concentration measurements and inflow of air and pure oxygen, carbon dioxide evolution (CER) and oxygen uptake (OUR) rates were calculated according to Aehle et al. (2011). Hereby, an equilibrium between liquid and gaseous phase was assumed for constant pH and temperature during controlled bioreactor conditions (Aehle et al., 2011; Spérandio & Paul, 1997). Biomass specific growth and substrate uptake rates were derived from the offline analytics (Section 2.2) by solving the mass balance assuming a constant rate between two sampling time points as described by Kroll et al. (2017).
Figure 1
Experimental setup using a DASbox reactor system (Eppendorf AG). The measurements needed for particle filtering were transferred to MATLAB by OPC (Open Platform Communication). Based on the resulting estimated states and metabolic capacities the current optimal operating conditions can be re‐evaluated
Experimental setup using a DASbox reactor system (Eppendorf AG). The measurements needed for particle filtering were transferred to MATLAB by OPC (Open Platform Communication). Based on the resulting estimated states and metabolic capacities the current optimal operating conditions can be re‐evaluated
Nonlinear process model
The CSTR process model describes mass concentration balances of biomass X, glucose , mannose and xylose states () in the ordinary differential equation (ODE) system shown in Equation (1). The model builds on previous work of Sinner et al. (2021) and is extended with maintenance coefficents , which are especially relevant at low dilution rates.
represents the reactor volume (L) and () the volumetric flow rate giving the current dilution rate D () as . The respective carbon substrate concentrations () in the feed medium are denoted by . Biomass yield coefficients , specific growth rates , and substrate specific maintenance coefficients couple biomass formation with substrate consumption . The kinetics of biomass growth are described in Equations (2)–(4), including competitive substrate inhibition effects by glucose adapted from Yoon et al. (1977) and Sinner et al. (2021). Hereby, denotes the biomass specific maximum growth rate on substrate and the respective substrate affinity coefficient. The biomass specific maximum substrate uptake rates for biomass formation can be derived as .Model parametrization was performed on training data using identifiable parameter subsets (Brun et al., 2001; Ulonska et al., 2018). Model parameters (), including estimated parameters and model constants, are given in Table 1. Uncertainties of model simulations were derived by 100 Monte Carlo simulations with uniform random sampling from , using parameter uncertainties obtained from the inverse Fisher information matrix.
Table 1
Model parameters of the C. glutamicum chemostat process model
Parameter
Value
Standard deviation
Unit
Source
S1in
7.81
–
gL−1
Measured
S2in
23.96
–
gL−1
Measured
S3in
10.38
–
gL−1
Measured
MCmolS1,2,3
30.03
–
gCmol−1
Calculated
MCmolX
27.78
–
gCmol−1
Sinner et al. (2021)
DORO2
−4.00
–
mole−Cmol−1
Calculated
DORS1,2,3
+4.00
–
mole−Cmol−1
Calculated
DORX
+4.18
–
mole−Cmol−1
Sinner et al. (2021)
YXS1
0.576
±0.010
gg−1
Parameter estimation
YXS2
0.596
±0.011
gg−1
Parameter estimation
YXS3
0.545
±0.024
gg−1
Parameter estimation
μmaxS1
0.140
±0.004
h−1
Parameter estimation
μmaxS2
0.074
±0.001
h−1
Parameter estimation
μmaxS3
0.049
±0.001
h−1
Parameter estimation
KS1
2.2×10−3
±0.23 × 10−3
gL−1
Parameter estimation
KS2
0.094
±0.019
gL−1
Parameter estimation
KS3
0.046
±0.019
gL−1
Parameter estimation
mS1
4.3 × 10−3
±0.12 × 10−3
gg−1h−1
Parameter estimation
mS2
0.011
±0.14 × 10−3
gg−1h−1
Parameter estimation
mS3
3.8 × 10−3
±0.13 × 10−3
gg−1h−1
Parameter estimation
Model parameters of the C. glutamicum chemostat process modelCarbon dioxide evolution (CER) and oxygen uptake (OUR) rates () were modeled by carbon and redox balancing, see Equations (5) and (6), as described previously using a measured elemental composition of C. glutamicum growing on SSL (Sinner et al., 2021). Carbon balancing is hereby based on biomass formation and sugar consumption using the weight of a respective sugar compound per mole of carbon contained, , and the weight per mole of carbon of biomass, . Redox balancing is based on the degree of reduction (Minkevich & Eroshin, 1973) of biomass , the respective degree of reduction of the main carbon substrates glucose, mannose and xylose and the degree of reduction of oxygen . Degrees of reduction of organic compounds are obtained as the number of equivalents of electrons per C‐mol that are available upon oxidation. The degree of reduction of an element within a compound hereby corresponds to the valence of this element (). For oxygen as an electron acceptor. Parameters used for modeling are listed in Table 1. As outlined in Section 2.3 an equilibrium between liquid and gaseous phase can be assumed under constant pH and temperature conditions. This allows for a simplified simulation of gas phase dynamics, where an additional consideration of hold‐up in the liquid phase can be omitted.
Particle filter algorithm for simultaneous state and parameter estimation
For online state estimation, the ODE system of Equation (1) is presented as the system model in the general form of Equation (7). Hereby, the time derivative of process states is a function of the current states , the dynamic inputs u () and the model parameters and constants . To integrate the online off‐gas measurements for state estimation, a measurement model, Equation (8), is defined as a function of the current states and the model constants based on carbon and electron balances described in Equations (5) and (6).
The system model, Equation (7), and measurement model, Equation (8), were integrated in a bootstrap particle filter. The particle filter implementation is based on Gordon et al. (1993) and Simon (2006). It constitutes a sequential Monte Carlo algorithm that uses Bayesian interference to reconstruct the most likely current process states based on the measurments .For initialization, a random sample of particles {} is drawn from the assumed probability density function (pdf) at initial time point . To enable joint estimation of process states and parameters , the state space is augmented with a parameter vector (Kitagawa, 1998). Gaussian random walk with parameter noise variance is used to add artificial dynamics to the most sensitive subset (parameter importance ranking shown in Appendices) of kinetic parameters . Hereby 0.1 is a noise tuning factor, which is in the range of typical model parameter uncertainties as reported by Brun et al. (2001). This parameter perturbation, Equation (9), represents phenotypic plasticity during chemostat operation and is performed iteratively for each time step (calculation interval 40 min) by using the normally distributed perturbation vector . Hereby, the parameter values estimated in the previous time step, , are used.
Using the changed parameter values within system model equations , a priori particles (process states) are propagated under addition of process noise at each time step , Equation (10). Process noise levels of were applied as reported previously (Sinner et al., 2021).
The augmented state vector is denoted as in the following. Based on available online measurements , measurement variance matrix and measurement model function the relative likelihood is evaluated for each particle in Equation (11) (Simon, 2006). If one measurement type is currently out of specification limits, for example, during temporary technical malfunction, the estimator only considers the remaining measurement signals.
The relative likelihoods are then normalized (Simon, 2006):
Subsequently, multinomial resampling from gives a posteriori particles . The obtained particles are distributed according to the desired pdf . Particle means are the most likely estimation outputs for the current process states and the evolved parameters . Online biomass specific growth and substrate uptake rates are derived from the a posteriori particles. To assess estimator accuracy, the root mean square error (RMSE) between offline measured and online estimated states was calculated. Normalization by the range of offline measured states in the given data set gives the normalized root mean square error ().
Online parametric sensitivity analysis
Parametric sensitivity analysis was performed locally at each calculation time point of the state estimator (Section 2.5). Sensitivity of the kinetic model parameters regarding different process states can be analyzed. Within this system, the local parametric sensitivities on the measured outputs CER and OUR (Equations 5 and 6) were determined. As only low and steady parameter changes are expected a local analysis can be regarded as satisfactory for this application. First‐order partial derivatives of measurement model with respect to kinetic parameters give the sensitivity matrix (, for process states and kinetic parameters) at time step using the latest state estimates and parameter estimates :
The elements of sensitivity matrix were normalized with the latest parameter estimate and online measurement value .
The absolute values of the rows of the resulting normalized sensitivity matrix were then summed up to conjoin the sensitivities regarding the different process states (here for CER and OUR) at time point . This gives the sensitivity vector .
Multiobjective optimization
To analyze the process behavior of the chemostat for variable metabolic capacities, multiobjective optimization was performed using a controlled, elitist genetic algorithm (gamultiobj, Matlab R2020a, population size 50, Pareto fraction 0.35, stall generation limit 100, generation limit 1000) based on NSGA‐II (Deb et al., 2002). Dilution rate D was varied as process parameter ( lower bound, upper bound) to obtain Pareto‐optimal solutions maximizing volumetric productivity for biomass (Objective 1) and minimizing metabolizable sugar loss in the effluent (Objective 2). Online parameter estimates (Section 2.5) obtained after different process times were hereby used. Objective functions were scaled with the maximum volumetric biomass productivity for the initial parameter set and the substrate concentrations in the feed medium:
RESULTS AND DISCUSSION
Dynamic model for SSL utilization in continuous operation
As a basis for model‐based process monitoring, the SSL utilization kinetics of C. glutamicum were characterized using chemostat training data including an initial batch process phase and transient behavior upon dilution rate changes (Figure 2). Dilution rate in the chemostat phase was kept at with a short phase (10 h, 3.7 residence times) with higher dilution rate of at about 230 h process time. This short increase resulted in an accumulation of unmetabolized substrate in the effluent stream and washout of biomass before the system returned to steady state operation at low dilution rate.
Figure 2
Measured (black, circles) and simulated (blue, lines) concentrations of (a) biomass ( RMSE model error), (b) glucose ( RMSE model error), (c) mannose ( RMSE model error) and (d) xylose ( RMSE model error) for chemostat training data (Data set 1). The applied dilution rates (grey, line) are shown in (a). Error bars and shaded areas indicate 95% confidence intervals. RMSE, root mean square error
Measured (black, circles) and simulated (blue, lines) concentrations of (a) biomass ( RMSE model error), (b) glucose ( RMSE model error), (c) mannose ( RMSE model error) and (d) xylose ( RMSE model error) for chemostat training data (Data set 1). The applied dilution rates (grey, line) are shown in (a). Error bars and shaded areas indicate 95% confidence intervals. RMSE, root mean square errorThe kinetic process model described in Section 2.4 was parameterized on this training data set and the obtained model parameters are shown in Table 1. Low parametric uncertainty with coefficients of variation below 5% for all parameters besides of xylose affinity coefficient (42%) was observed. Comparing experimental and simulated target states, low root mean square errors (RMSE) for biomass (), glucose (), mannose (), xylose () as well as CER () and OUR () states were achieved. This corresponds to normalized root mean square errors (NRMSE) below the acceptance criterion of 15% (Ulonska et al., 2018) for all target states of sugar uptake and biomass formation for C. glutamicum utilizing SSL in a continuous bioprocess.Although a high dilution rate corresponds to an elevated selection pressure with potential adaption effects, the resulting model with constant parameters, such as maximum substrate specific growth rates and substrate affinities over time, did not indicate changing metabolic capacities after the short phase of high dilution rate in the training data set. As the process dynamics including the batch startup phase and the transient behavior between different dilution rates are well described by the model, it can be used as basis for online state estimation.
Online state estimation assuming constant metabolic capacities
The parameterized model (see Section 2.4 Equations 1–4 and Table 1) was used for online state estimation under usage of a particle filter algorithm (see Section 2.5) including noninvasive off‐gas analysis as online measurements. The state estimator was applied on two C. glutamicum chemostat processes (Table 2). One process was operated at a constant low dilution rate of (Data set 2) whereas the second process was run with stepwise increasing dilution rates in the range of to (Data set 3). For the assumption of constant metabolic capacities over time model parameter adaption was turned off (), resulting in a standard particle filter that only incorporates process and measurement noise.
Table 2
Overview of used data sets and corresponding estimation errors for biomass, glucose, mannose, and xylose (RMSE and ) as well as critical dilution rates ()
Usage
Estimation error RMSE gL−1 (NRMSE [%])
Estimated Dcrit
Data set
Dilution rate D [h−1]
Calibration
Validation
Biomass
Glucose
Mannose
Xylose
h−1
1
0.043 (spike with 0.27)
•
1.7a (10.6)
0.31a (9.6)
1.6a (8.4)
0.37a (6.7)
0.23a (const)
2
0.024
•
0.71b (32.3)
0.020b (46.2)
0.27b (47.3)
0.20b (56.5)
0.23b (const)
3
0.077–0.31
•
6.1b (46.7)
1.8b (1090.1)
7.9b (47.9)
1.0b (9.6)
0.23b (const)
3
0.077–0.31
•
1.4c (10.3)
0.037c (23.1)
0.73c (4.4)
1.5c (14.0)
0.23–0.32c
aModel calibration.
bStandard state estimation with constant model parameters.
cState estimation with model parameter adaption.
Overview of used data sets and corresponding estimation errors for biomass, glucose, mannose, and xylose (RMSE and ) as well as critical dilution rates ()aModel calibration.bStandard state estimation with constant model parameters.cState estimation with model parameter adaption.Based on the alignment of the model outputs of the propagated particels to the off‐gas measurements, state estimates of biomass, glucose, mannose, and xylose concentrations of Data set 2 & 3 (blue line, Figures 3 and 4) were derived by Bayesian inference. The alignment between off‐gas measurements (black line) and filtered model outputs (blue line) is displayed in Figure 5. With RMSE errors for biomass concentration below and below for glucose, mannose, and xylose substrate states, accurate online state estimation was achieved for Data set 2 without the need of any additional at‐line analytics. The filter was also applied to the second validation process (Data set 3) with a stepwise increase of dilution rates and thus selection pressure. The results are displayed with a blue line in Figure 4. When applying this conventional particle filter with constant metabolic capacities (no model parameter adaption), the process behavior is only inadequately estimated (blue line, Figure 4). This is also reflected in the insufficient alignment of off‐gas measurements and filtered model outputs (blue line, Figure 5). During late process phases with increased dilution rates, an approach to washout equilibrium is wrongly predicted by the state estimator, leading to highly inaccurate biomass and substrate estimates. The wrongly estimated accumulation of glucose, starting from app. 200 h process time, leads to strong (simulated) reciprocal substrate uptake inhibition effects and therefore to an increasing deviation between online estimated and actual culture behavior. High RMSE errors above for biomass, for glucose and for mannose concentrations are therefore obtained.
Figure 3
Online monitoring performance during validation chemostat run (Data set 2) with dilution rate (a) of (grey, line). Offline measured validation data (black, circles) and online estimated (a) biomass, (b) glucose, (c) mannose and (d) xylose concentrations are shown for particle filtering with constant metabolic capacity (blue, dotted line). Error bars and shaded areas indicate 95% confidence intervals
Figure 4
Online monitoring performance during chemostat run (Data set 3) with increasing dilution rates (a) (grey, line). Offline measured validation data (black, circles) and online estimated (a) biomass, (b) glucose, (c) mannose, and (d) xylose concentrations are shown for particle filtering with constant metabolic capacity (blue, dotted line) and with parameter adaption (orange, dotted line). Error bars and shaded areas indicate 95% confidence intervals
Figure 5
Alignment of online measurements (black, dotted line) and corresponding filtered measurement estimates using a particle filter algorithm with constant (blue, dotted line) or adapted model parameters (orange, dotted line). (a) Carbon dioxide evolution rate (CER) and (b) oxygen uptake rate (OUR) dynamics during a chemostat validation run with constant dilution rate (Data set 2). (c) CER and (d) OUR dynamics during a chemostat validation run with increasing dilution rate setpoints (Data set 3). Shaded areas indicate 95% confidence intervals. During temporary sensor malfunction only the remaining OUR signal is considered during filtering (between 120 and 137 h)
Online monitoring performance during validation chemostat run (Data set 2) with dilution rate (a) of (grey, line). Offline measured validation data (black, circles) and online estimated (a) biomass, (b) glucose, (c) mannose and (d) xylose concentrations are shown for particle filtering with constant metabolic capacity (blue, dotted line). Error bars and shaded areas indicate 95% confidence intervalsOnline monitoring performance during chemostat run (Data set 3) with increasing dilution rates (a) (grey, line). Offline measured validation data (black, circles) and online estimated (a) biomass, (b) glucose, (c) mannose, and (d) xylose concentrations are shown for particle filtering with constant metabolic capacity (blue, dotted line) and with parameter adaption (orange, dotted line). Error bars and shaded areas indicate 95% confidence intervalsAlignment of online measurements (black, dotted line) and corresponding filtered measurement estimates using a particle filter algorithm with constant (blue, dotted line) or adapted model parameters (orange, dotted line). (a) Carbon dioxide evolution rate (CER) and (b) oxygen uptake rate (OUR) dynamics during a chemostat validation run with constant dilution rate (Data set 2). (c) CER and (d) OUR dynamics during a chemostat validation run with increasing dilution rate setpoints (Data set 3). Shaded areas indicate 95% confidence intervals. During temporary sensor malfunction only the remaining OUR signal is considered during filtering (between 120 and 137 h)Although a standard particle filter implementation based on Simon (2006) could be transferred from a recently published fed‐batch application (Sinner et al., 2021) to a continuous process mode, a model and filter using static parameters can show limited applicability. This is especially relevant for prolonged continuous processes subject to high selection pressure due to high or increasing dilution rates over time.Since the model includes reciprocal sugar interaction a slight change in uptake capacities and affinities, reflected in the model parameters and , greatly affects the process behavior at higher dilution rates. Under these conditions, a small overestimation of equilibrium substrate concentrations strongly influences overall growth. A standard state estimator using constant model parameters can therefore underestimate metabolic capacities and thus the current optimal dilution rate. This will lead to CSTR operation in suboptimal regions.
Online state estimation with adaptive metabolic capacities
To include state and parametric uncertainty a particle filter was used as described in Section 2.5. Hereby, selected model parameters were integrated by state augmentation. Model parameters were adapted with with random walk dynamics () in the particle propagation step. To prevent collinear behavior, for example, within the Monod terms, only the kinetic parameters were selected as adaptive, while all other model parameters (see Table 1) were assumed to be constant. The aim of the presented online monitoring system is the accurate estimation of process states and overall metabolic capacities with relevance for process engineering. In case online reidentification of the entire parameter set is targeted, additional measurements must be included to ensure identifiability. An identifiability analysis of the model is shown in the supplementary file.The estimation results of the particle filter with model parameter adaption are shown in Figure 4 (orange line). For the region with lower dilution rate () until about 100 h process time both filtering strategies (blue line for constant metabolic capacity, orange line for adaptive metabolic capacities) show similar performance. During the phase of stepwise increasing dilution rates the different filter behaviors can be clearly seen when comparing the off‐gas measurement alignments (blue vs. orange line, Figure 5c,d). Whereas the particle filter with constant model parameters fails, the adaptive filter enables accurate online monitoring of concentration states throughout the entire CSTR process. Using the particle filter with parameter adaption (see Figure 4), estimation accuracy (RMSE) of concentration states (see Table 2) was improved by 77% for biomass, 98% for glucose and 91% for mannose in comparison to a particle filter with constant model parameters. Estimation accuracy of xylose, the substrate of lowest importance for the given process (see Section 3.4), was reduced by 50%. Low RMSE errors on biomass of and below for the substrates were achieved. This demonstrates that a particle filter with model parameter adaption is capable to monitor long‐term chemostat dynamics under conditions of increasing selection pressure during utilization of SSL with C. glutamicum. With a runtime of 2 min 30 s for every calculation step the algorithm can be run on conventional work stations (see Section 2.3) and necessary gas analytics can be integrated in the exhaust gas stream of a CSTR without the need of reactor modifications for in situ sensor placement. The time‐resolved behavior of adapted model parameters and its physiological relevance is outlined in the following section.Online parameter adaption and parametric sensitivity analysis. Maximum specific growth rates on (a) glucose, (b) mannose, and (c) xylose are shown for a particle filter with constant model parameters (blue, dashed lines) or parallel state and parameter estimation (orange, solid lines). Blue shaded areas indicate 95% confidence intervals of model parameters as obtained from calibration Data set 1. Online local sensitivities (normalized) of maximum specific growth rate parameters for (d) glucose, (e) mannose, or (f) xylose utilization are shown as a measure for current selection pressure regarding the respective carbon substrate
Online sensitivity analysis to gain insights into selection pressure and potential physiological adaption
The maximum growth capacities on the three main sugars ( on glucose, mannose, and xylose) are shown in Figure 6 over time. The dotted line displays constant model parameters as derived from model parametrization on calibration data (Data set 1) whereas the solid line displays the online adapted parameters during the second validation process (Data set 3). These maximum growth rates constitute kinetic model parameters and can be directly linked to the uptake capacities via the biomass to substrate conversion coefficients () of three sugars. In addition, the respective time‐resolved sensitivities of the three parameters on the measured gas conversion rates (CER and OUR) are displayed in Figure 6.
Figure 6
Online parameter adaption and parametric sensitivity analysis. Maximum specific growth rates on (a) glucose, (b) mannose, and (c) xylose are shown for a particle filter with constant model parameters (blue, dashed lines) or parallel state and parameter estimation (orange, solid lines). Blue shaded areas indicate 95% confidence intervals of model parameters as obtained from calibration Data set 1. Online local sensitivities (normalized) of maximum specific growth rate parameters for (d) glucose, (e) mannose, or (f) xylose utilization are shown as a measure for current selection pressure regarding the respective carbon substrate
From the estimated parameter trajectory it can be seen that mainly utilization capacities for mannose (74% increase) and also for glucose (26% increase) improved, whereas xylose uptake remained largely unaffected (8% increase). Changes of selection pressure on utilization capacity of the respective substrates can be quantified by the parametric sensitivities, which show a process phase‐dependent behavior. The capacity for growth on mannose, being the main carbon substrate consumed in the SSL sugar mixture, shows highest overall sensitivities and is therefore the first to be adapted. At very high dilution rates towards the end of the process, glucose inflow approaches the current consumption capacity. Commencing glucose accumulation leads to reciprocal sugar inhibition in the given macroscopic kinetic model and thereby leads to an increased selection pressure regarding glucose consumption resulting in the increase of (Figure 6 from 200 h onward). Increased relevance of transcriptional control was identified previously under glucose‐rich growth conditions (Graf et al., 2020) and may underlie the culture behavior observed at elevated dilution rates with substrate accumulation.As xylose is not a native substrate for C. glutamicum, the pathway for xylose metabolization was artificially added by heterologous expression of xylA and xylB (Meiswinkel et al., 2013) and therefore lacking natural regulatory mechanisms. Xylose is taken up via the myo‐inositol/proton symporter IolT1. The transporter gene is constitutively expressed in glucose‐limited chemostat cultivation and reduced expression was observed under glucose‐rich conditions (Graf et al., 2020). Expression is however not directly linked to xylose availability, indicating that it is not one of its natural substrates (Brüsseler et al., 2018). Also, xylose has been described to show poor performance as carbon source (Buschke et al., 2013) and can therefore be regarded as a minor target for adaption effects. While glucose and mannose have their entry point in glycolysis, xylose directly enters the non‐oxidative part of the pentose phosphate pathway (PPP). Accordingly, cells cultivated on xylose alone show a high flux through the non‐oxidative part of the PPP and need to compensate the lower NADPH output from the oxidative PPP with a higher flux through the TCA cycle (Buschke et al., 2013). Although mannose consumption can be increased by overexpression of the mannose‐6‐phosphate isomerase gene manA, strains still show a clear preference for glucose (Sasaki et al., 2011). Glucose uptake in C. glutamicum is mainly catalyzed by the glucose specific phosphoenolpyruvate:sugar phosphotransferase system (PTS) PtsG and to a lesser extent via the IolT1 and IolT2 symporters (Lindner et al., 2011). Mannose, an epimer of glucose, is also transported by PtsG but can also be taken up with less efficiency via the fructose‐specific PtsF (Sasaki et al., 2011). Hence, both mannose and xylose compete for their major uptake system with glucose, i.e. PtsG and IolT1, which is also reflected in the process model structure, see Eqs. (3) and (4). Given a higher affinity of these transporters to glucose, as previously described for glucose and glucosamin uptake via PtsG (Uhde et al., 2013), higher extracellular glucose availability negatively affects uptake of mannose and xylose.Such reciprocal sugar interaction can lead to high selection pressure and thereby to physiological adaption as observed by the online monitoring system of this study. From a biochemical viewpoint, an increased expression of ptsG and ptsF, encoding for the glucose‐ and fructose‐specific PTS transporters respectively, could lead to a better co‐consumption of both sugars (Sasaki et al., 2011). With an increased abundance of PtsF mannose could bypass competition for PtsG with glucose. Also amino acid substitutions in PTS transporters can improve substrate uptake capacity, as recently reported for evolved fructose utilization by Krahn et al. (2021). Another potential target of genetic adaptations is transcriptional regulation (Engels et al., 2008; Gaigalat et al., 2007; Radek et al., 2017). Altered gene expression levels of, for example, sugar transporter genes have to be confirmed by RT‐qPCR or global gene expression analyses. Changes in plasmid copy number (Prell et al., 2021) can be a further hypothetical explanation for the observed time‐variable metabolic capacities under high selection pressure. Based on the recent work of Prell et al. (2021) plasmid copy numbers can be estimated to be in the approximate range of 100–130 for pEKEx3 and 10–20 for pVWEx1. From both a physiological (Sasaki et al., 2011) and process technological perspective (Sinner et al., 2019), mannose consumption can be regarded as the main bottleneck, causing sugar loss to the effluent and limiting biomass space time yield of the analyzed continuous process utilizing the mannose‐rich by‐product stream SSL. This is also reflected by the high sensitivities regarding mannose growth capacity on the measured process outputs. This high selection pressure towards mannose utilization leads to the observed steady increase of the mannose related parameter .To further analyze and clarify the biochemical basis of observed metabolic capacity changes, plasmid copy number determination, gene expression profiling and whole genome sequencing will be tackled in future work. In addition to that, the model structure can be potentially extended by intracellular resource allocation (Kremling et al., 2018) to describe the phenomenon of a reduced xylose uptake in favor of more cost‐efficient substrate utilization pathways.
Ensuring optimal operation under changing metabolic capacities
Online estimated states and parameters (Section 3.3) are essential for reliable process monitoring but can also be further used to predict optimal as well as critical operating conditions. In continuous cultivations the critical dilution rate represents the maximum dilution rate that can be applied before biomass washout occurs, and can thereby be regarded as the overall maximum metabolic conversion capacity of a chemostat.Based on online adapted parameters the most probable current critical dilution rate can be obtained as shown for the second validation run (Data set 3) in Figure 7 (orange solid line). This capacity threshold is gradually increasing from to when dilution rate setpoints, and thereby selection pressure, are increased from app. 100 h process time onward. In contrast, critical dilution rate remains constant for a particle filter without parameter adaption (bluedashed line). During online monitoring of the biomass formation rate of the chemostat, this results in incorrectly low estimated specific growth rates (blue dotted line) in comparison to offline reference measurements (black solid line) when the process exceeds the static capacity threshold. When a particle filter with parameter adaption is used instead, specific growth rates are monitored with satisfactory accuracy (orange dotted line). This underlines that the monitoring system considering long‐term adaption effects can be successfully applied in operational regions exceeding the initially assumed capacity boundaries.
Figure 7
Online monitoring of specific growth rates and overall metabolic capacity (critical dilution rate) during a validation chemostat run with increasing dilution rate setpoints (Data set 3). Offline measured specific growth rate (black, solid line) and online monitored specific growth rate using particle filtering with constant metabolic capacity (blue, dotted line) or, respectively, with parallel state and parameter estimation (orange, dotted line). Shaded areas indicate 95% confidence intervals. Critical dilution rates are shown for constant (blue, dashed line) and online adapted metabolic capacity (orange, solid line)
Online monitoring of specific growth rates and overall metabolic capacity (critical dilution rate) during a validation chemostat run with increasing dilution rate setpoints (Data set 3). Offline measured specific growth rate (black, solid line) and online monitored specific growth rate using particle filtering with constant metabolic capacity (blue, dotted line) or, respectively, with parallel state and parameter estimation (orange, dotted line). Shaded areas indicate 95% confidence intervals. Critical dilution rates are shown for constant (blue, dashed line) and online adapted metabolic capacity (orange, solid line)In addition, optimal operating points can be determined. In this study, high volumetric biomass productivity in combination with high sugar conversion efficiency (sugar loss in the effluent below 10%) were targeted. Based on multiobjective optimization, Pareto‐optimal dilution rates for different process time points (Data set 3) were computed and displayed as circles within the phase diagrams in Figure 8. The first graph shows the relation between biomass productivity and conversion efficiency given in % sugar loss. The second and third graph display the two optimization criteria in function of the dilution rate setpoint. Due to adapted metabolic capacities, reflected in changing model parameters estimated by the particle filter, the system behavior of the process (solid lines) changes in course of the chemostat process. Selected optimal dilution rate setpoints with maximum biomass productivity satisfying the predefined sugar loss threshold are then selected (blue filled circles).
Figure 8
Maintenance of input‐output relationships of a chemostat operated at increasing dilution rates (Data set 3, at h, at h, at h, at h, at h, see Figure 4) based on online estimated parameters (see Figure 6). Simulated steady state behavior is shown for all operational regions (solid lines) and for Pareto‐optimal solutions (circles) maximizing biomass space time yield and minimizing substrate loss. Defining a maximum substrate loss threshold of 10% (dashed line), optimal setpoints are selected (blue filled circles). (a) Interdependence between biomass space time yield and residual substrate lost to the effluent. (b) Biomass space time yield in dependence of dilutionrate setpoint. (c) Substrate loss in dependence of dilutionrate setpoint
Maintenance of input‐output relationships of a chemostat operated at increasing dilution rates (Data set 3, at h, at h, at h, at h, at h, see Figure 4) based on online estimated parameters (see Figure 6). Simulated steady state behavior is shown for all operational regions (solid lines) and for Pareto‐optimal solutions (circles) maximizing biomass space time yield and minimizing substrate loss. Defining a maximum substrate loss threshold of 10% (dashed line), optimal setpoints are selected (blue filled circles). (a) Interdependence between biomass space time yield and residual substrate lost to the effluent. (b) Biomass space time yield in dependence of dilutionrate setpoint. (c) Substrate loss in dependence of dilutionrate setpointIn function of the long‐term adaption effects of the culture, washout boundary conditions shifted and the optimal dilution rate almost doubled from at process startup to after 235 h process time. Biomass space time yield substantially improved from to . Also the functional relation between the two (competing) objectives changed. For instance, an increase of dilution rate during the initial process phase would only slightly increase biomass productivity at the cost of high substrate loss (see grey circles at in Figure 8). In contrast, the adapted culture at later process phases shows a significantly reduced trade‐off between biomass productivity and substrate loss (see also curvature changes in Figure 8a).The presented time‐dependent exploration of process behavior enables a re‐evaluation of operational limits and optimal operating points and can thereby support human operator decision making. By combining feedback control algorithms with the developed online monitoring system, C. glutamicum chemostat processes utilizing SSL could be optimally controlled under consideration of time‐variable metabolic capacities. In pharmaceutical production, the re‐evaluation of process input–output relationships to provide assurance of product quality is termed as design space maintenance, as shown for simulated roller compaction and penicillin production processes by Bano et al. (2019). Similarly, this study shows how simultaneous state and parameter estimation can be used to re‐evaluate the input–output relationship during running continuous bioprocesses also considering physiological adaptation effects.
CONCLUSIONS
Within this study state estimation under changing metabolic capacities is presented and applied to continuous C. glutamicum cultivation processes utilizing the lignocellulosic by‐product stream SSL, a complex sugar mixture mainly containing mannose, glucose and xylose. During prolonged chemostat processes with increasing dilution rates physiological adaption occurred, leading to high state estimation errors and wrong prediction of biomass washout under usage of static model parameters obtained from calibration data. To enable accurate monitoring under changing metabolic capacities a particle filter for simultaneous state and parameter estimation was established. Hereby, noninvasive off‐gas analysis was used as online measurement and sugar utilization parameters were adapted during particle filtering. The results for a verification data set with increasing dilution rates show that integrating parametric variability enabled accurate estimation of biomass formation (RMSE of ) and the utilization of carbon substrates (RMSE below ), outperforming regular filtering assuming constant metabolic capacities. Analysis of the parameter trajectories and their sensitivities revealed that especially mannose consumption, being the main sugar in SSL, was improved over time. Only at dilution rates close to maximal metabolic capacities the selection pressure shifted towards the preferred glucose. Heterologously introduced and energetically suboptimal xylose consumption remained largely unchanged.For practical applications of CSTR process engineering, quantitative information on the current metabolic capacities is of crucial importance. Having these available online, operational parameters such as dilution rate can be adjusted to ensure optimal efficiency of substrate utilization and volumetric productivity. As only noninvasive off‐gas measurements are needed, the presented PAT system can be regarded as an add‐on monitoring tool to retrofit existing biorefinery processes utilizing lignocellulosic by‐product streams. Besides future investigation on the observed culture adaption by genomic and transcriptomic analysis, the developed state estimator needs to be employed for online feedback control. Thereby, model‐based state and parameter estimation considering long‐term adaption effects has the potential to further promote the usage of alternative feedstocks and to establish more efficient continuous cultivation processes.
AUTHOR CONTRIBUTIONS
Peter Sinner, Christoph Herwig, and Julian Kager conceived and designed the study. Oliver Goldbeck and Gerd M. Seibold performed strain design and construction. Peter Sinner, Marlene Stiegler, and Julian Kager performed the chemostat experiments. Peter Sinner analyzed the data and developed the software. Peter Sinner, Julian Kager, and Oliver Goldbeck wrote the manuscript. All authors reviewed, edited, and approved the final manuscript.
Authors: Mathias Aehle; Artur Kuprijanov; Sebastian Schaepe; Rimvydas Simutis; Andreas Lübbert Journal: Biotechnol Lett Date: 2011-07-09 Impact factor: 2.461
Authors: Joseph J Arensdorf; A Katrina Loomis; Philip M DiGrazia; Daniel J Monticello; Philip T Pienkos Journal: Appl Environ Microbiol Date: 2002-02 Impact factor: 4.792
Authors: Sophia Ulonska; Paul Kroll; Jens Fricke; Christoph Clemens; Raphael Voges; Markus M Müller; Christoph Herwig Journal: Biotechnol J Date: 2017-12-08 Impact factor: 4.677
Authors: Pau Cabaneros Lopez; Isuru A Udugama; Sune T Thomsen; Christian Roslander; Helena Junicke; Miguel M Iglesias; Krist V Gernaey Journal: Biotechnol Bioeng Date: 2020-10-15 Impact factor: 4.530