| Literature DB >> 28737713 |
Zhe Wu1,2,3, Yanli Zhao4,5, Ji Zhang6,7, Yuanzhong Wang8,9.
Abstract
Gentiana rigescens is a precious herbal medicine in China because of its liver-protective and choleretic effects. A method for the qualitative identification and quantitative evaluation of G. rigescens from Yunnan Province, China, has been developed employing Fourier transform infrared (FT-IR) spectroscopy and high performance liquid chromatography (HPLC) with the aid of chemometrics such as partial least squares discriminant analysis (PLS-DA) and support vector machines (SVM) regression. Our results indicated that PLS-DA model could efficiently discriminate G. rigescens from different geographical origins. It was found that the samples which could not be determined accurately were in the margin or outside of the 95% confidence ellipses. Moreover, the result implied that geographical origins variation of root samples were more obvious than that of stems and leaves. The quantitative analysis was based on gentiopicroside content which was the main active constituent in G. rigescens. For the prediction of gentiopicroside, the performances of model based on the parameters selected through grid search algorithm (GS) with seven-fold cross validation were better than those based on genetic algorithm (GA) and particle swarm optimization algorithm (PSO). For the SVM-GS model, the result was satisfactory. FT-IR spectroscopy coupled with PLS-DA and SVM-GS can be an alternative strategy for qualitative identification and quantitative evaluation of G. rigescens.Entities:
Keywords: FT-IR spectroscopy; Gentiana rigescens; partial least squares discriminant analysis; qualitative; quantitative; support vector machines regression
Mesh:
Substances:
Year: 2017 PMID: 28737713 PMCID: PMC6152034 DOI: 10.3390/molecules22071238
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Contents of gentiopicroside in G. rigescens (mg/g) with different parts of plants from different geographical origins by HPLC.
Figure 2The average FT-IR spectra of root (A), stem (B) and leaf (C) in G. rigescens from different geographical origins (Dali, Lijiang, Diqing and Yuxi) in the 4000–400 cm−1 range.
Figure 3The scatter plot of PLS-DA FT-IR spectra display the information of samples of root (A), stem (B), leaf (C) and three parts (root, stem and leaf) (D) in G. rigescens from different geographical origins (Dali, Lijiang, Diqing and Yuxi). The abscissa represents the variation of the first component and the ordinate represents the variation of the second component.
Statistics of the optimal calibration models.
| Types Parameter | Model 1 | Model 2 | Model 3 | Model 4 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dali, Root | Diqing, Root | Lijiang, Root | Yuxi, Root | Dali, Stem | Diqing, Stem | Lijiang, Stem | Yuxi, Stem | Dali, Leaf | Diqing, Leaf | Lijiang, Leaf | Yuxi, Leaf | Root | Stem | Leaf | |
| R2 | 0.9543 | 0.9654 | 0.9426 | 0.9762 | 0.8346 | 0.8464 | 0.9098 | 0.9472 | 0.9031 | 0.8964 | 0.8854 | 0.9231 | 0.8334 | 0.8425 | 0.9331 |
| RMSEE | 0.0852 | 0.0985 | 0.1247 | 0.0615 | 0.1552 | 0.1999 | 0.1531 | 0.0877 | 0.1364 | 0.1695 | 0.1464 | 0.1140 | 0.1835 | 0.1927 | 0.1295 |
| RMSECV | 0.1785 | 0.2313 | 0.1992 | 0.0709 | 0.2740 | 0.2688 | 0.1807 | 0.1689 | 0.1614 | 0.1924 | 0.1680 | 0.1713 | 0.2272 | 0.2304 | 0.1445 |
Model evaluation statistics for partial least squares discriminant analysis (PLS-DA) models. R2: determination coefficient; RMSEE: root-mean-square error of estimation; RMSECV: root-mean-square error of cross-validation.
Results of the PLS-DA model 1 validation set samples.
| Samples | Actual Class | Calculated Class | YPre | Ydev |
|---|---|---|---|---|
| 2 | Dali, Root | Dali, Root | 1.1112 | 0.4306 |
| 4 | Dali, Root | Uncertain | 1.5243 | 0.6372 |
| 6 | Dali, Root | Uncertain | 1.5774 | 0.6637 |
| 7 | Dali, Root | Dali, Root | 1.2030 | 0.4765 |
| 13 | Lijiang, Root | Diqing, Root | 0.4190 | 0.1300 |
| 14 | Lijiang, Root | Lijiang, Root | 0.6108 | 0.2930 |
| 18 | Lijiang, Root | Lijiang, Root | 0.8904 | 0.3202 |
| 20 | Lijiang, Root | Lijiang, Root | 0.8029 | 0.3449 |
| 21 | Lijiang, Root | Lijiang, Root | 1.1986 | 0.4743 |
| 24 | Lijiang, Root | Lijiang, Root | 1.0592 | 0.4046 |
| 25 | Lijiang, Root | Lijiang, Root | 0.8340 | 0.2920 |
| 31 | Diqing, Root | Diqing, Root | 0.7014 | 0.2522 |
| 40 | Diqing, Root | Diqing, Root | 0.8934 | 0.3217 |
| 42 | Diqing, Root | Diqing, Root | 0.7153 | 0.2364 |
| 44 | Diqing, Root | Diqing, Root | 0.7702 | 0.2601 |
| 45 | Diqing, Root | Diqing, Root | 1.0444 | 0.3972 |
| 47 | Diqing, Root | Diqing, Root | 0.8626 | 0.3079 |
| 54 | Yuxi, Root | Yuxi-Root | 0.9195 | 0.3351 |
| 57 | Yuxi, Root | Uncertain | 1.3332 | 0.5416 |
| 59 | Yuxi, Root | Yuxi-Root | 1.1181 | 0.4341 |
Ypre: predicted value; Ydev and deviation values.
Results of the PLS-DA model 2 validation set samples.
| Samples | Actual Class | Calculated Class | YPre | Ydev |
|---|---|---|---|---|
| 61 | Dali, Stem | Dali, Stem | 0.9741 | 0.4110 |
| 62 | Dali, Stem | Dali, Stem | 1.1943 | 0.4722 |
| 66 | Dali, Stem | Dali, Stem | 0.8053 | 0.2776 |
| 70 | Dali, Stem | Dali, Stem | 0.5274 | 0.1387 |
| 72 | Lijiang, Stem | Lijiang, Stem | 1.0518 | 0.4318 |
| 74 | Lijiang, Stem | Lijiang, Stem | 0.9547 | 0.3524 |
| 76 | Lijiang, Stem | Uncertain | 1.3367 | 0.6355 |
| 81 | Lijiang, Stem | Lijiang, Stem | 0.7931 | 0.2796 |
| 87 | Lijiang, Stem | Lijiang, Stem | 1.0656 | 0.4078 |
| 88 | Lijiang, Stem | Lijiang, Stem | 1.0032 | 0.3766 |
| 91 | Diqing, Stem | Diqing, Stem | 1.2258 | 0.4879 |
| 94 | Diqing, Stem | Diqing, Stem | 1.1724 | 0.4612 |
| 95 | Diqing, Stem | Diqing, Stem | 1.0654 | 0.4077 |
| 104 | Diqing, Stem | Lijiang, Stem | 0.4763 | 0.2348 |
| 105 | Diqing, Stem | Diqing, Stem | 0.6590 | 0.2045 |
| 107 | Diqing, Stem | Lijiang, Stem | 0.3148 | 0.2175 |
| 112 | Yuxi, Stem | Yuxi, Stem | 0.9311 | 0.3704 |
| 117 | Yuxi, Stem | Yuxi, Stem | 0.9702 | 0.3601 |
| 120 | Yuxi, Stem | Yuxi, Stem | 0.9334 | 0.3417 |
Ypre: predicted value; Ydev and deviation values.
Results of PLS-DA model 3 validate set samples.
| Samples | Actual Class | Calculated Class | YPre | Ydev |
|---|---|---|---|---|
| 123 | Dali, Leaf | Dali, Leaf | 0.7455 | 0.2478 |
| 128 | Dali, Leaf | Uncertain | 1.4334 | 0.5917 |
| 133 | Lijiang, Leaf | Lijiang, Leaf | 0.8984 | 0.3242 |
| 134 | Lijiang, Leaf | Lijiang, Leaf | 0.6928 | 0.2214 |
| 135 | Lijiang, Leaf | Lijiang, Leaf | 0.5073 | 0.1949 |
| 136 | Lijiang, Leaf | Lijiang, Leaf | 0.8768 | 0.3134 |
| 137 | Lijiang, Leaf | Uncertain | 1.1790 | 0.6497 |
| 139 | Lijiang, Leaf | Lijiang, Leaf | 0.9106 | 0.3494 |
| 140 | Lijiang, Leaf | Lijiang, Leaf | 0.5083 | 0.1241 |
| 141 | Lijiang, Leaf | Lijiang, Leaf | 0.9160 | 0.3330 |
| 144 | Lijiang, Leaf | Lijiang, Leaf | 1.0707 | 0.4103 |
| 145 | Lijiang, Leaf | Lijiang, Leaf | 1.3312 | 0.5406 |
| 146 | Lijiang, Leaf | Dali, Leaf | 0.3026 | 0.1056 |
| 154 | Diqing, Leaf | Diqing, Leaf | 1.1157 | 0.4329 |
| 160 | Diqing, Leaf | Diqing, Leaf | 0.8556 | 0.3242 |
| 165 | Diqing, Leaf | Diqing, Leaf | 0.5760 | 0.1630 |
| 166 | Diqing, Leaf | Diqing, Leaf | 1.1611 | 0.4555 |
| 170 | Yuxi, Leaf | Yuxi, Leaf | 0.9046 | 0.3273 |
| 176 | Yuxi, Leaf | Yuxi, Leaf | 0.8712 | 0.3127 |
| 178 | Yuxi, Leaf | Yuxi, Leaf | 0.9259 | 0.3379 |
Ypre: predicted value; Ydev and deviation values.
Results of PLS-DA model 4 validation set samples.
| Samples | Actual Class | Calculated Class | YPre | Ydev |
|---|---|---|---|---|
| 5 | Root | Root | 0.7803 | 0.3378 |
| 6 | Root | Root | 0.9955 | 0.4414 |
| 7 | Root | Root | 0.7524 | 0.2794 |
| 12 | Root | Root | 0.9642 | 0.4205 |
| 13 | Root | Root | 0.9584 | 0.4167 |
| 14 | Root | Root | 0.8682 | 0.3566 |
| 15 | Root | Root | 0.7034 | 0.2467 |
| 18 | Root | Root | 0.8453 | 0.3500 |
| 19 | Root | Root | 0.8823 | 0.3660 |
| 20 | Root | Root | 0.8876 | 0.3695 |
| 21 | Root | Root | 0.6858 | 0.2350 |
| 22 | Root | Root | 0.8692 | 0.3573 |
| 24 | Root | Root | 0.9851 | 0.4345 |
| 29 | Root | Uncertain | 1.1444 | 0.5407 |
| 30 | Root | Root | 0.9511 | 0.4118 |
| 31 | Root | Root | 0.6869 | 0.2357 |
| 32 | Root | Root | 0.9623 | 0.4193 |
| 34 | Root | Root | 0.8161 | 0.3218 |
| 38 | Root | Root | 0.9635 | 0.4201 |
| 40 | Root | Root | 1.0701 | 0.4912 |
| 41 | Root | Root | 0.6536 | 0.2135 |
| 43 | Root | Root | 0.7605 | 0.2848 |
| 49 | Root | Root | 0.9819 | 0.4324 |
| 50 | Root | Root | 0.7037 | 0.3252 |
| 51 | Root | Uncertain | 1.2548 | 0.6143 |
| 52 | Root | Root | 0.9847 | 0.4343 |
| 53 | Root | Root | 0.8527 | 0.3462 |
| 57 | Root | Root | 0.8420 | 0.3391 |
| 58 | Stem | Stem | 0.5076 | 0.1588 |
| 61 | Stem | Uncertain | 1.1209 | 0.5250 |
| 62 | Stem | Uncertain | 1.2384 | 0.6033 |
| 63 | Stem | Root | 0.2768 | 0.2470 |
| 70 | Stem | Stem | 0.6177 | 0.1896 |
| 71 | Stem | Stem | 0.7688 | 0.2903 |
| 73 | Stem | Stem | 0.6066 | 0.1822 |
| 78 | Stem | Root | 0.1209 | 0.2604 |
| 82 | Stem | Stem | 0.7153 | 0.2596 |
| 83 | Stem | Root | 0.4597 | 0.0842 |
| 84 | Stem | Stem | 0.6782 | 0.2299 |
| 86 | Stem | Stem | 1.0495 | 0.4932 |
| 89 | Stem | Stem | 0.7826 | 0.2995 |
| 104 | Stem | Stem | 0.7306 | 0.2648 |
| 106 | Stem | Uncertain | 1.2252 | 0.5946 |
| 108 | Stem | Stem | 0.8728 | 0.3596 |
| 110 | Stem | Stem | 0.7044 | 0.2639 |
| 111 | Stem | Uncertain | 1.5978 | 0.8430 |
| 112 | Stem | Stem | 1.0158 | 0.4550 |
| 122 | Leaf | Leaf | 1.0092 | 0.4505 |
| 123 | Leaf | Leaf | 0.9702 | 0.4246 |
| 129 | Leaf | Leaf | 0.6900 | 0.2377 |
| 130 | Leaf | Leaf | 0.9760 | 0.4285 |
| 132 | Leaf | Leaf | 0.7933 | 0.3066 |
| 133 | Leaf | Leaf | 0.8667 | 0.3556 |
| 141 | Leaf | Leaf | 0.8623 | 0.3526 |
| 149 | Leaf | Leaf | 0.9041 | 0.3805 |
| 153 | Leaf | Leaf | 0.9990 | 0.4438 |
| 159 | Leaf | Leaf | 0.8485 | 0.3434 |
| 163 | Leaf | Leaf | 0.9792 | 0.4325 |
| 165 | Leaf | Leaf | 1.0081 | 0.4499 |
| 170 | Leaf | Leaf | 0.9311 | 0.3985 |
Ypre: predicted value; Ydev and deviation values.
Figure 4The 3D view of the optimization results for parameters c and g by grid search method with seven-fold cross validation.
Figure 5The optimization results for parameters c and g by genetic algorithm.
Figure 6The optimization results for parameters c and g by particle swarm optimization algorithm.
Statistics of the SVM models.
| Model | c | g | CVmse | Rt2 (%) | RMSEE | Rv2 (%) | RMSEP |
|---|---|---|---|---|---|---|---|
| GS-SVM | 0.5000 | 0.0040 | 0.0149 | 92.7143 | 3.1056 | 83.5721 | 11.1421 |
| GA-SVM | 0.4573 | 0.0100 | 0.0163 | 96.3977 | 3.1760 | 82.3279 | 11.1504 |
| PSO-SVM | 0.4454 | 0.0100 | 0.0162 | 96.3120 | 3.2131 | 82.3529 | 11.1506 |
Rt2: determination coefficient for training set; Rv2: determination coefficient for validated set; RMSEE: root-mean-square error of estimation; RMSEP: root-mean-square error of prediction.
Figure 7Correlation diagram between FT-IR predicted values and the reference values in the training and validation sets for gentiopicroside.
Information of G. rigescens.
| No. | Site | Description | No. | Site | Description | No. | Site | Description |
|---|---|---|---|---|---|---|---|---|
| 1–10 | Dali, Yunnan | Root | 61–70 | Dali, Yunnan | Stem | 121–130 | Dali, Yunnan | Leaf |
| 11–30 | Lijiang, Yunnan | Root | 71–90 | Lijiang, Yunnan | Stem | 131–149 | Lijiang, Yunnan | Leaf |
| 31–50 | Diqing, Yunnan | Root | 91–110 | Diqing, Yunnan | Stem | 150–169 | Diqing, Yunnan | Leaf |
| 51–60 | Yuxi, Yunnan | Root | 111–120 | Yuxi, Yunnan | Stem | 170–179 | Yuxi, Yunnan | Leaf |