| Literature DB >> 36119610 |
Jing Ming1, Mingjia Liu2, Mi Lei1, Bisheng Huang1, Long Chen2.
Abstract
Chaenomelis Fructus is a widely used traditional Chinese medicine with a long history in China. The total content of oleanolic acid (OA) and ursolic acid (UA) is taken as an important quality marker of Chaenomelis Fructus. In this study, quantitative models for the prediction total content of OA and UA in Chaenomelis Fructus were explored based on near-infrared spectroscopy (NIRS). The content of OA and UA in each sample was determined using high-performance liquid chromatography (HPLC), and the data was used as a reference. In the partial least squares (PLS) model, both leave one out cross validation (LOOCV) of the calibration set and external validation of the validation set were used to screen spectrum preprocessing methods, and finally the multiplicative scatter correction (MSC) was chosen as the optimal pretreatment method. The modeling spectrum bands and ranks were optimized using PLS regression, and the characteristic spectrum range was determined as 7,500-4,250 cm-1, with 14 optimal ranks. In the back propagation artificial neural network (BP-ANN) model, the scoring data of 14 ranks obtained from PLS regression analysis were taken as input variables, and the total content of OA and UA reference values were taken as output values. The number of hidden layer nodes of BP-ANN was screened by full-cross validation (Full-CV) of the calibration set and external validation of the validation set. The result shows that both PLS model and PLS-BP-ANN model have strong prediction ability. In order to evaluate and compare the performance and prediction ability of models, the total content of OA and UA in each sample of the test set were detected under the same HPLC conditions, the NIRS data of the test set were input, respectively, to the optimized PLS model and PLS-BP-ANN model. By comparing the root-mean-square error (RMSEP) and determination coefficient (R 2) of the test set and ratio of performance to deviation (RPD), the PLS-BP-ANN model was found to have better performance with RMSEP of 0.59 mg·g-1, R 2 of 95.10%, RPD of 4.53 and bias of 0.0387 mg·g-1. The results indicated that NIRS can be used for the rapid quality control of Chaenomelis Fructus.Entities:
Keywords: Chaenomelis Fructus; back propagation artificial neural network; near-infrared spectroscopy; oleanolic acid; partial least squares regression; quantitative model; ursolic acid
Year: 2022 PMID: 36119610 PMCID: PMC9478200 DOI: 10.3389/fpls.2022.978937
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Figure 1NIR spectra of samples.
Statistic results of the calibration set and validation set.
| Set | Samples | OA (mg·g−1) | UA (mg·g−1) | Total content of OA and UA (mg·g−1) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Range | Mean | SD | CV | Range | Mean | SD | CV | Range | Mean | SD | CV | ||
| Calibration set | 60 | 1.9–13.4 | 7.7 | 2.6 | 0.34 | 0.2–5 | 1.5 | 1 | 0.67 | 4.2–15.3 | 9.2 | 2.6 | 0.28 |
| Validation set | 30 | 2.1–13.2 | 7.8 | 2.6 | 0.33 | 0.3–3.2 | 1.5 | 0.9 | 0.60 | 4.4–14.5 | 9.3 | 2.7 | 0.29 |
Modeling results based on different compound.
| Model number | Compound | LOOCV | External validation | Rank | |||
|---|---|---|---|---|---|---|---|
| RMSECV (mg·g−1) | RMSEP (mg·g−1) | RPD | |||||
| M1 | OA | 1.76 | 52.00 | 1.3 | 72.89 | 1.97 | 20 |
| M2 | UA | 0.92 | 6.34 | 0.81 | 14.05 | 1.09 | 6 |
| M3 | OA+UA | 1.14 | 80.50 | 0.73 | 92.17 | 3.58 | 18 |
Optimization results of spectra pretreatment in calibration models.
| Model number | Pretreatment | LOOCV | External validation | Rank | |||
|---|---|---|---|---|---|---|---|
| RMSECV (mg·g−1) | RMSEP (mg·g−1) | RPD | |||||
| M4 | VN | 1.02 | 84.46 | 0.71 | 92.74 | 3.72 | 20 |
| M5 | FD+SG (17 points) | 1.00 | 85.13 | 0.84 | 89.66 | 3.11 | 15 |
| M6 | SD+SG (17 points) | 1.26 | 76.23 | 1.03 | 84.50 | 2.54 | 18 |
| M7 | MSC | 1.02 | 84.6 | 0.66 | 93.67 | 3.98 | 19 |
| M8 | FD+VN+SG (17 points) | 0.99 | 85.34 | 0.76 | 91.60 | 3.46 | 14 |
| M9 | FD+MSC+SG (17 points) | 1.07 | 82.85 | 0.71 | 92.56 | 3.67 | 12 |
Figure 2The characteristic interval division (A) and correlation curve between number of principal factors and RMSE in the PLS model (B).
Figure 3The results of model M10 for the total content of OA and UA. (A) Internal cross-validation and (B) external validation.
Figure 4The optimization of neurons number in hidden layer (A) and prediction results of internal across validation and external validation of model M11 (B).
Figure 5Prediction results for the total content of OA and UA of test set samples by model M10 (A) and M11 (B).
Statistic results of the test set.
| Model number | RMSEP (mg·g−1) | RPD | Bias (mg·g−1) | ARD (%) | |
|---|---|---|---|---|---|
| M10 | 0.68 | 93.72 | 3.95 | 0.0498 | 6.3 |
| M11 | 0.59 | 95.10 | 4.53 | 0.0387 | 4.8 |