| Literature DB >> 33638725 |
Daniel Rodriguez-Granrose1,2, Amanda Jones3, Hannah Loftus3, Terry Tandeski3, Will Heaton3, Kevin T Foley3,4,5, Lara Silverman3,4.
Abstract
Modern bioprocess development employs statistically optimized design of experiments (DOE) and regression modeling to find optimal bioprocess set points. Using modeling software, such as JMP Pro, it is possible to leverage artificial neural networks (ANNs) to improve model accuracy beyond the capabilities of regression models. Herein, we bridge the gap between a DOE skill set and a machine learning skill set by demonstrating a novel use of DOE to systematically create and evaluate ANN architecture using JMP Pro software. Additionally, we run a mammalian cell culture process at historical, one factor at a time, standard least squares regression, and ANN-derived set points. This case study demonstrates the significant differences between one factor at a time bioprocess development, DOE bioprocess development and the relative power of linear regression versus an ANN-DOE hybrid modeling approach.Entities:
Keywords: Artificial neural network; Bioprocess; Design of experiments (DOE); MachineLearning; Process modeling
Mesh:
Year: 2021 PMID: 33638725 PMCID: PMC8144078 DOI: 10.1007/s00449-021-02529-3
Source DB: PubMed Journal: Bioprocess Biosyst Eng ISSN: 1615-7591 Impact factor: 3.210
Neural network DOE input levels
| Input factors | − 1 | 0 | 1 |
|---|---|---|---|
| First layer TanH | 0 | 50 | 100 |
| First layer Linear | 0 | 50 | 100 |
| First layer Gaussian | 0 | 50 | 100 |
| Second layer TanH | 0 | 50 | 100 |
| Second layer Linear | 0 | 50 | 100 |
| Second layer Gaussian | 0 | 50 | 100 |
Neural Network DOE response functions
| Responses | Function |
|---|---|
| Maximize | |
| SSE Training | Minimize |
| R2 Fit = | Minimize |
| SSE fit = SSE Training – SSE Validation | Minimize |
Fig. 1a Creating ANN-DOE Hybrid 32 ANNs are created per DOE specifications. b: ANN-DOE Hybrid Output parameters are modeled using standard least squares regression and the prediction profiler to find the optimal ANN configuration, a single layer ANN with 91-gaussian neurons. c: The Optimal ANN A new ANN is made using 91-gaussian activation functions Fig. 1 was created in JMP Pro 14
Fig. 2ANN Models Quality Distribution A quantile box plot of the ANNs is shown. The optimized ANN is highlighted with diagonal hatches. The goal was to maximize R2 Training and minimize the other outputs. We can see that our optimized ANN performs in the highest category for each quality metric. Figure 2 was created in JMP Pro 14. Data available in online resource 3
Fig. 3ANN versus Standard Least Squares Model Comparison. The models were compared using the model comparison dialog in JMP. This table shows the measure of fit of predicted doublings versus the actual doublings using the 24 runs from the historical dataset. Shown are the 7 neural nets with the highest Measure of Fit for Doublings R2 of the 32 created, the 91-Gaussian neural net, and the standard least squares model. Median of root average squared error (RASE) and average absolute error (AAE) for each model are also shown. Figure 3 was created in JMP Pro 14
Process Setpoints and Theorized Optimum by Model
| Models: | Cell Line | Seeding Density | Media Supplement Percentage | Media Exchange Percentage |
|---|---|---|---|---|
| SLS Regression Optimum | L1 | −1 | −0.462 | −0.402 |
| 91-Gaussiun Optimum | L1 | −1 | 0.358 | −1 |
| OFAT Setpoint | L1 | −0.5 | 0 | 0 |
| Historical Setpoint | L1 | 1 | 0 | 0 |
Fig. 4a: Doublings of all Flasks by Condition Doublings from the 24 DOE runs used to create the bioprocess design space and the 12 flasks grown for model comparison are graphed. The flasks grown under historical and OFAT conditions underperform many of the DOE runs. Two out of three replicates grown under regression derived setpoints outperform all cells grown under DOE conditions. All three replicates grown under ANN-derived conditions outperform all 33 other flasks. b: Cell Morphology by Condition Visually all cells exhibit standard “fibroblast-like” morphology. However, differences in cell density are observable between conditions. c: LSMeans Differences Student’s T Each condition is compared to every other condition with a t test at a = 0.050. We see that all four experimental setpoints resulted in significantly different doublings. Figure 4a, b were created in Microsoft Office Suite. Figure. 4c was created in JMP