| Literature DB >> 30241281 |
Yaqi Wang1,2, Yuanzhen Yang3, Jiaojiao Jiao4, Zhenfeng Wu5, Ming Yang6,7.
Abstract
A support vector regression (SVR) method was introduced to improve the robustness and predictability of the design space in the implementation of quality by design (QbD), taking the extraction process of Pueraria lobata as a case study. In this paper, extraction time, number of extraction cycles, and liquid⁻solid ratio were identified as critical process parameters (CPPs), and the yield of puerarin, total isoflavonoids, and extracta sicca were the critical quality attributes (CQAs). Models between CQAs and CPPs were constructed using both a conventional quadratic polynomial model (QPM) and the SVR algorithm. The results of the two models indicated that the SVR model had better performance, with a higher R² and lower root-mean-square error (RMSE) and mean absolute deviation (MAD) than those of the QPM. Furthermore, the design space was predicted using a grid search technique. The operational range was extraction time, 24⁻51 min; number of extraction cycles, 3; and liquid⁻solid ratio, 14⁻18 mL/g. This study is the first reported work optimizing the design space of the extraction process of P. lobata based on an SVR model. SVR modeling, with its better prediction accuracy and generalization ability, could be a reliable tool for predicting the design space and shows great potential for the quality control of QbD.Entities:
Keywords: Pueraria lobata; QPM; QbD; SVR; design space; extraction process
Mesh:
Substances:
Year: 2018 PMID: 30241281 PMCID: PMC6222814 DOI: 10.3390/molecules23102405
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Results of training and test set.
| No. | Factors | Response Variables | ||||
|---|---|---|---|---|---|---|
| Training set | ||||||
| 1 | 35 | 3 | 5 | 3.93 | 28.64 | 16.00 |
| 2 | 35 | 3 | 15 | 5.22 | 34.11 | 43.86 |
| 3 | 10 | 3 | 10 | 3.63 | 27.53 | 28.95 |
| 4 | 35 | 2 | 10 | 4.03 | 28.06 | 16.40 |
| 5 | 60 | 3 | 10 | 4.74 | 34.44 | 26.82 |
| 6 | 35 | 2 | 10 | 3.90 | 27.97 | 20.60 |
| 7 | 35 | 1 | 5 | 1.22 | 13.52 | 39.20 |
| 8 | 10 | 2 | 15 | 3.24 | 25.18 | 27.57 |
| 9 | 60 | 2 | 5 | 3.24 | 26.67 | 13.37 |
| 10 | 10 | 1 | 10 | 1.76 | 14.82 | 7.73 |
| 11 | 35 | 2 | 10 | 3.93 | 28.25 | 15.41 |
| 12 | 60 | 1 | 10 | 2.40 | 23.47 | 12.66 |
| 13 | 35 | 2 | 10 | 4.03 | 30.40 | 23.03 |
| 14 | 60 | 2 | 15 | 4.28 | 33.46 | 36.94 |
| 15 | 35 | 2 | 10 | 3.77 | 28.90 | 21.69 |
| 16 | 10 | 2 | 5 | 2.32 | 19.67 | 8.46 |
| 17 | 35 | 1 | 15 | 2.56 | 20.74 | 20.70 |
| Test set | ||||||
| 1 | 25 | 2 | 10 | 3.50 | 25.92 | 22.70 |
| 2 | 30 | 2 | 8 | 3.45 | 25.37 | 17.62 |
| 3 | 15 | 2 | 15 | 3.46 | 26.35 | 34.41 |
| 4 | 20 | 2 | 15 | 3.45 | 27.78 | 35.40 |
Statistical parameters of the quadratic polynomial model (QPM) and the support vector regression (SVR) model.
| QPM | SVR | ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
|
| Training set | 0.985 | 0.127 | 0.111 | 0.983 | 0.132 | 0.077 |
| Test set | 0.903 | 0.191 | 0.164 | 0.918 | 0.175 | 0.133 | |
| Cross-validation | 0.802 | 0.457 | 0.366 | 0.846 | 0.403 | 0.329 | |
|
| Training set | 0.988 | 0.641 | 0.514 | 0.982 | 0.789 | 0.596 |
| Test set | 0.944 | 0.946 | 0.797 | 0.975 | 0.636 | 0.559 | |
| Cross-validation | 0.908 | 1.795 | 1.429 | 0.954 | 1.272 | 1.031 | |
|
| Training set | 0.964 | 1.906 | 1.56 | 0.961 | 2.005 | 1.646 |
| Test set | 0.706 | 4.12 | 4.02 | 0.765 | 3.683 | 3.606 | |
| Cross-validation | 0.724 | 5.311 | 4.567 | 0.821 | 4.281 | 3.834 | |
Coefficients of the constructed QPM equation.
| QPM | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| C0 |
|
|
|
|
|
|
|
|
| ||
|
| Regression coefficient | 0.751 | −1.524 | −1.674 | −1.124 | 2.157 | 3.859 | 2.237 | 0.470 | 0.120 | −0.050 |
| 0.014 * | 0.006 * | 0.003 * | 0.023 * | 0.275 | 0.771 | 0.903 | 0.003 * | 0.001 * | 0.003 * | ||
|
| Regression coefficient | 9.371 | −3.317 | −11.287 | −6.567 | 11.257 | 26.075 | 13.050 | −1.740 | 1.280 | −1.750 |
| 0.001 * | 0.132 | 0.001 * | 0.012 * | 0.413 | 0.542 | 0.41 | 0.003 * | 0.001 * | 0.001 * | ||
|
| Regression coefficient | 1.402 | 0.158 | −1.702 | 8.478 | 5.412 | 17.347 | 5.582 | −7.060 | 4.460 | 11.080 |
| 0.698 | 0.979 | 0.777 | 0.186 | 0.273 | 0.477 | 0.104 | 0.491 | 0.053 | 0.478 | ||
* Significant at the 0.05 level.
Figure 1Schematic for SVR algorithm implementation.
Optimized parameters for the SVR model.
| SVR | |||
|---|---|---|---|
|
| C |
| |
|
| 2−4 | 22 | 2−6 |
|
| 2−5 | 24 | 2−3 |
|
| 2−4 | 25 | 2−3 |
Figure 2Comparison of predicted and experimental values for QPM and SVR. (a) Y1; (b) Y2; (c) Y3.
Figure 3Design space and contour plots: (a) extraction cycle (X2) fixed at 3 cycles; (b) extraction time (X1) fixed at 35 min; (c) liquid-to-solid ratio (X3) fixed at 9 mL/g; (d) four-dimensional operation space (D > 0.9).
Responses for verification experiments.
| No. | Factors | Predicted D Value | Experimental D Value | ||
|---|---|---|---|---|---|
| 1 | 35 | 3 | 14 | 0.98 | 0.99 |
| 2 | 40 | 3 | 15 | 1.03 | 1.01 |