| Literature DB >> 29168076 |
Paul Kroll1,2, Alexandra Hofer1, Sophia Ulonska1, Julian Kager1, Christoph Herwig3,4.
Abstract
Model-based methods are increasingly used in all areas of biopharmaceutical process technology. They can be applied in the field of experimental design, process characterization, process design, monitoring and control. Benefits of these methods are lower experimental effort, process transparency, clear rationality behind decisions and increased process robustness. The possibility of applying methods adopted from different scientific domains accelerates this trend further. In addition, model-based methods can help to implement regulatory requirements as suggested by recent Quality by Design and validation initiatives. The aim of this review is to give an overview of the state of the art of model-based methods, their applications, further challenges and possible solutions in the biopharmaceutical process life cycle. Today, despite these advantages, the potential of model-based methods is still not fully exhausted in bioprocess technology. This is due to a lack of (i) acceptance of the users, (ii) user-friendly tools provided by existing methods, (iii) implementation in existing process control systems and (iv) clear workflows to set up specific process models. We propose that model-based methods be applied throughout the lifecycle of a biopharmaceutical process, starting with the set-up of a process model, which is used for monitoring and control of process parameters, and ending with continuous and iterative process improvement via data mining techniques.Entities:
Keywords: bioprocess; data mining; modelling; monitoring; optimization
Mesh:
Year: 2017 PMID: 29168076 PMCID: PMC5736780 DOI: 10.1007/s11095-017-2308-y
Source DB: PubMed Journal: Pharm Res ISSN: 0724-8741 Impact factor: 4.200
Fig. 1A simple control loop with the related four challenges (I-IV) of process development and the process lifecycle. Challenge I is the generation and storage of knowledge within models. Challenge II is the process monitoring. Challenge III is the determination of optimal process conditions for different applications and IV the continuous improvement of a process by data mining tools.
Fig. 2Systematic overview of a model-development including interlinks between data, database and datamining, information and necessary experiments and knowledge.
Summary of Applications and Novel Publications with Respect to Model-Based Experimental Design
| Method | Criteria | Application | Real-time | Reference |
|---|---|---|---|---|
| Application paper | ||||
| Signal to noise ratio | SNR = const | estimation of sampling points with respect to deviations on specific rates | at-line/ off-line | ( |
| Sequential experimental design | D-criteria | experimental design within a model discrimination workflow | at-line/ off-line | ( |
| Optimal dynamic experiments | – | MB-DoE in microbioreactor systems under use of FTIR spectroscopy as monitoring tool | at-line/ real-time | ( |
| Simultaneous solution Approach for MB-DoE | A, D & E - criteria | design of feed rates and adaptive optimal sampling strategy | at-line/ off-line | ( |
| CMB-DoE | A, D & E - criteria | adaption of a dynamic experiment under usage of real-time data control on information criteria | real-time | ( |
| Online optimal experimental re-design | A-criteria | adaption of a dynamic experiment under usage of real-time data control on information criteria | real-time | ( |
| Model discriminating experimental design | – | Model descrimination within an sequential workflow | at-line/ real-time | ( |
| Design criteria paper | ||||
D-optimal design DMOO design (multi objective optimization) | reduction of parameter interactions with MB-DoE under usage of a multi objective optimization criteria | at-line/ off-line | ( | |
| Multi objective approach | Multi-objective MB-DoE to descriminate between models and estimate kinetic parameters | at-line/ off-line | ( | |
| Anticorrelation criteria | anticorrelation criteria to estimate model parameters | at-line/ off-line | ( | |
Monitoring Solutions within Biotechnology
| Monitoring goal | Model scenario | Measurement scenario | Process system | Algorithm | Highlights | Ref. |
|---|---|---|---|---|---|---|
| Biomass growth | mass-balance with fixed stoichiometry | carbon in and outflow |
| SQP (sequential quadratic programming) | ( | |
| Biomass growth | mass-balance with variable stoichiometry | carbon and electron in and outflow |
| – | use of system redundancy | ( |
| Oxygen consumption | mass balance | offgas |
| – | simple and robust | ( |
| CO2 production | mass balance | offgas |
| – | carbonate buffered media | ( |
| Biomass concentration | kinetic model | sugar measurements |
| extended kalman filter | field of plant cells | ( |
| Substrates & biomass | kinetic model | CO2, sugars, product |
| extended kalman filter | NIR based online measurement | ( |
| Product & biomass | kinetic flux model | offgas analysis, product |
| particle filter | Raman based online measurements | ( |
| Biomass growth | kinetic model | offline & online offgas |
| extended kalman filter | account for measurement delay | ( |
| Biomass growth | kinetic model with energy balance | calorimetry |
| – | robust growth determination | ( |
Fig. 3Principle of model based monitoring with multiple measurements. Through the reconciliation of measured model outputs with current model simulations actual process states can be estimated by considering measurement and process uncertainty.
Summary
| Optimization goal | Optimization space / Constraints | Optimization variable | Optimized process / System | Algorithm | Remarks | References |
|---|---|---|---|---|---|---|
| Information content | ||||||
| Biomass concentration, conversion of PFAP | – | media components | Synechococcus | ANNSGA (artificial neural network supported genetic algorithm) | ANN | ( |
| Productivity - Offline | ||||||
| Maximal biomass productivity in minimum culture time | Constraints for feed, volume, culture time | Constant/ stair case / exponential feed rate parameter | Hybridoma cell fed-batch | fminsearchcon | Offline optimization | ( |
| Maximize amount of cells | Constraints for feeds and volume | Feed | Bakers yeast | Heuristic, analytical and numerical (adaptation of Jacobsons’s algorithm ( | ( | |
| Productivity - Online | ||||||
| Maximize productivity and yield in case of uncertainties | Volume, feed rate, operation time, amount of added substrate | Optimal feeding profile | Lysine production fed-batch | ACADO toolkit | Investigation of robust multi-objective optimal control | ( |
| Process profitability (costs of product and inducer) | – | Glucose and inducer concentration | E. Coli | Pontryagin’s maximum principle | Optimal control | ( |
| Maximize biohydrogen production | Constraints for feed, terminal region, culture time | Nutrient flow | Cyanobacteria fed-batch | IPOPT (after converting optimal control problem to nonlinear optimization problem with orthogonal collocation) | Simulation MPC with parameter estimation | ( |
| Maximize Productivity | Max volume | Feed | Steptomyces tendae | MPC | ( | |
| Robustness | ||||||
| Control glucose to a setpoint | – | Glucose feed rate profile | CHO fed-batch | SQP (sequential quadratic programming) | MPC | ( |
| Control consumed oxygen to a setpoint | – | Glutamine feed rate | CHO fed-batch | Simplex | MPC | ( |
Fig. 4Optimization space limited by technically and physiologically feasible space as well as by product and system rationales. The potential innovation space is the space where it can be increased e.g. by more knowledge about the system.
Fig. 5Workflow showing the data-driven knowledge discovery approach for the detection and minimization of disturbance variables. After selection of the targeted disturbance class via risk assessment tools, data has to be generated and/or accumulated. Indications about disturbing variables/ descriptors can be generated by correlation analysis or – if possible – via mechanistic modelling. Obtained knowledge/ information has to be implemented in the design space to allow minimization of the identified disturbances.
Various Methods are Available for the Data to Information Approach, which is applied for the Identification and Minimization of Disturbance Variables. The Most Common Ones are Stated here Including Information about Linearity, Advantages and Disadvantages as well as References to Literature
| Approach | Method | Advantages | Disadvantages | Output | Method literature | Application literature |
|---|---|---|---|---|---|---|
| Descriptive | PCA | • Orthogonal • Dimensionality reduction • Easily applicable • Provides overview of input matrix • Classification of data | • Difficult to interpret if more PCs are significant • no correlations with process response possible • linear | • Loadings ➔ describes the correlation between variables in an orthogonal manner • Scores ➔ shows grouping/ clustering/ patterns/ trends ➔ facilitates interpretation due to additional dimensionality reduction | ( | ( |
| Descriptive | Cluster analysis (CA) | • Classification of data • Multiple algorithms are available ➔ adaption to problem statement possible | • No dimensionality reduction ➔ complicates the identification of trends • Linear • No Correlation with process response | • Dendrogramm ➔ clusters can be seen and especially the distance between clusters can be analyzed | ( | |
| Descriptive and predictive | PLS-DA | • Dimensionality reduction • Prediction of group membership • Classification of data • Easily applicable | • Linear • Y-variable (i.e. class) has to be declared before analysis • Knowledge about method necessary (choice of threshold, PLS1 or PLS2) • Overfitting | • Scores ➔ shows grouping/ clustering/ patterns/ trends ➔ facilitates interpretation due to additional dimensionality reduction • Weights/ loadings ➔ relates classifier to underlying variable | ( | |
| Descriptive and predictive | OPLS-DA | • Orthogonal • see PLS-DA | • see PLS-DA | • see PLS-DA | ( | |
| Predictive | MLR | • Easily applicable • Correlation with process response | • not applicable for fingerprinting analysis (due to collinearities) • linear | • ANOVA validation • Coefficients with confidence intervals ➔ representing variables that correlate with process response | ( | |
| Predictive | PLS | • Dimensionality reduction • Correlation with process response • Variable ranking available • Easily applicable | • Not orthogonal • Correlations are assumed to be linear (only “quasi-nonlinear” algorithmic adaptations • available like Poly-PLS or Spline-PLS) • Small validity space • linear | • Observed • Coefficients with confidence intervals ➔ representing variables that correlate with process response | ( | ( |
| Predictive | PCR | • Dimensionality reduction • Easily applicable • Orthogonal • Correlation with process response | • Difficult to interpret if more PCs are significant • Correlations are assumed to be linear | • see PCA and MLR | ( | ( |
| Predictive | ANN | • Correlation with process response • Adaptive learning • Self-organization • Fault tolerance via redundant coding • Real-time operating ability • Easy insertion into existing technologies • non linear | • Mathematically demanding • difficult to implement for process development • iterative workflow • dependence of final result on initial parameters • tendency to overfitting • high training time and computational resources • non-uniqueness of final result | • Observed | ( | ( |
| Predictive | SVM | • see ANN • handling high dimensional input vectors | • see ANN | • see ANN | ( |