| Literature DB >> 29311739 |
Tianjun Yuan1,2, Yanli Zhao1, Ji Zhang1, Yuanzhong Wang3.
Abstract
Dried sclerotium of Wolfiporia cocos (F.A. Wolf) Ryvarden & Gilb. is a traditional Chinese medicine. Its chemical components showed difference among geographical origins, which made it difficult to keep therapeutic potency consistent. The identification of the geographical origin of W. cocos is the fundamental prerequisite for its worldwide recognition and acceptance. Four variable selection methods were employed for near infrared spectroscopy (NIR) variable selection and the characteristic variables were screened for the establishment of Fisher function models in further identification of the origin of W. cocos from Yunnan, China. For the obvious differences between poriae cutis (fu-ling-pi in Chinese, or FLP) and the inner part (bai-fu-ling in Chinese, or BFL) of the sclerotia of W. cocos in the pattern space of principal component analysis (PCA), we established discriminant models for FLP and BFL separately. Through variable selection, the models were significant improved and also the models were simplified by using only a small part of the variables. The characteristic variables were screened (13 for BFL and 10 for FLP) to build Fisher discriminant function models and the validation results showed the models were reliable and effective. Additionally, the characteristic variables were interpreted.Entities:
Mesh:
Year: 2018 PMID: 29311739 PMCID: PMC5758700 DOI: 10.1038/s41598-017-18458-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The original spectra of BFL and FLP. The red lines represent BFL samples, while the other colorized lines stand for FLP samples.
Figure 2Principal component scores of BFL and FLP. The black triangles represent BFL samples, while the red squares correspond to FLP samples.
Prediction results of PLS-DA models of BFL built by different variable selection methods.
| Primary ID | 886 spectral variables | 40 spectral variables by CARS | 95 spectral variables by MC-UVE | 90 spectral variables by SPA | 129 spectral variables by LPG | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | |
| BFL-01 | 1 | 1 | 0.836 | 0.116 | 1 | 1 | 1.001 | 0.001 | 1 | 1 | 0.654 | 0.245 | 1 | 1 | 1.023 | 0.016 | 1 | 1 | 0.805 | 0.138 |
| BFL-05 | 1 | 1 | 1.629 | 0.445 | 1 | 1 | 0.774 | 0.160 | 1 | 1 | 0.644 | 0.252 | 1 | 1 | 1.23 | 0.163 | 1 | 1 | 0.548 | 0.320 |
| BFL-20 | 1 | 1 | 1.536 | 0.379 | 1 | 1 | 0.834 | 0.117 | 1 | 1 | 0.541 | 0.324 | 1 | 1 | 0.805 | 0.138 | 1 | 1 | 0.749 | 0.178 |
| BFL-34 | 1 | 1 | 1.618 | 0.437 | 1 | 1 | 0.719 | 0.199 | 1 | 1 | 0.621 | 0.268 | 1 | 1 | 1.205 | 0.145 | 1 | 1 | 0.762 | 0.168 |
| BFL-40 | 1 | 1 | 0.835 | 0.117 | 1 | 1 | 1.168 | 0.119 | 1 | 1 | 0.711 | 0.204 | 1 | 1 | 1.577 | 0.408 | 1 | 1 | 0.822 | 0.126 |
| BFL-48 | 1 | SU | 1.782 | 0.553 | 1 | 1 | 1.119 | 0.084 | 1 | 1 | 0.685 | 0.223 | 1 | 1 | 1.400 | 0.283 | 1 | 1 | 0.725 | 0.194 |
| BFL-33 | 2 | 2 | 2.004 | 0.003 | 2 | 2 | 2.084 | 0.059 | 2 | 2 | 1.756 | 0.173 | 2 | 2 | 1.849 | 0.107 | 2 | 2 | 1.758 | 0.171 |
| BFL-42 | 2 | SU | 2.635 | 0.450 | 2 | 2 | 1.554 | 0.315 | 2 | 2 | 1.539 | 0.326 | 2 | 2 | 1.87 | 0.092 | 2 | 2 | 1.682 | 0.225 |
| BFL-49 | 2 | 2 | 1.928 | 0.051 | 2 | 2 | 1.593 | 0.288 | 2 | 2 | 2.192 | 0.136 | 2 | 2 | 1.963 | 0.026 | 2 | 2 | 1.920 | 0.057 |
| BFL-54 | 2 | 2 | 1.845 | 0.110 | 2 | 2 | 1.967 | 0.023 | 2 | 2 | 1.963 | 0.026 | 2 | 2 | 1.72 | 0.198 | 2 | 2 | 1.821 | 0.126 |
| BFL-55 | 2 | 2 | 1.751 | 0.176 | 2 | 2 | 1.942 | 0.041 | 2 | 2 | 1.882 | 0.083 | 2 | 2 | 2.034 | 0.024 | 2 | 2 | 1.716 | 0.201 |
| BFL-12 | 3 | 3 | 2.802 | 0.140 | 3 | 3 | 3.019 | 0.013 | 3 | 3 | 2.562 | 0.310 | 3 | 3 | 2.88 | 0.085 | 3 | 3 | 2.575 | 0.300 |
| BFL-15 | 4 | 4 | 3.844 | 0.110 | 4 | 4 | 4.593 | 0.419 | 4 | 4 | 3.744 | 0.181 | 4 | 4 | 3.876 | 0.088 | 4 | 4 | 3.582 | 0.296 |
| BFL-37 | 4 | 4 | 3.948 | 0.037 | 4 | 4 | 3.712 | 0.204 | 4 | 4 | 3.720 | 0.198 | 4 | 4 | 3.9 | 0.071 | 4 | 4 | 3.807 | 0.137 |
| BFL-47 | 4 | 4 | 3.893 | 0.076 | 4 | 4 | 3.873 | 0.090 | 4 | 4 | 3.861 | 0.098 | 4 | 4 | 3.657 | 0.243 | 4 | 4 | 3.817 | 0.129 |
| BFL-04 | 5 | SU | 5.653 | 0.462 | 5 | 5 | 4.901 | 0.070 | 5 | 5 | 4.782 | 0.154 | 5 | 5 | 4.565 | 0.308 | 5 | 5 | 4.576 | 0.300 |
| BFL-13 | 5 | 5 | 4.813 | 0.132 | 5 | 5 | 5.126 | 0.089 | 5 | 5 | 5.117 | 0.083 | 5 | 5 | 4.685 | 0.223 | 5 | 5 | 4.790 | 0.149 |
| BFL-16 | 5 | 5 | 4.904 | 0.068 | 5 | 5 | 5.047 | 0.033 | 5 | 5 | 4.751 | 0.176 | 5 | 5 | 4.818 | 0.129 | 5 | 5 | 5.013 | 0.009 |
| BFL-25 | 5 | 5 | 4.965 | 0.025 | 5 | 5 | 4.829 | 0.121 | 5 | 5 | 4.995 | 0.004 | 5 | 5 | 4.762 | 0.168 | 5 | 5 | 4.829 | 0.121 |
| Accuracy (%) | 84.21 | 100 | 100 | 100 | 100 | |||||||||||||||
|
| 0.940 | 0.977 | 0.966 | 0.972 | 0.970 | |||||||||||||||
| RMSECV | 0.290 | 0.181 | 0.219 | 0.197 | 0.208 | |||||||||||||||
| RMSEP | 0.382 | 0.239 | 0.289 | 0.260 | 0.274 | |||||||||||||||
Note: AC (Actual class), CC (Calculated class), Ypre (Predicted value), Ydev (Deviation), SU (Suspicious).
Prediction results of PLS-DA models of FLP built by different variable selection methods.
| Primary ID | 886 spectral variables | 20 spectral variables by CARS | 35 spectral variables by MC-UVE | 30 spectral variables by SPA | 129 spectral variables by LPG | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | AC | CC | Ypre | Ydev | |
| FLP-01 | 1 | 1 | 0.940 | 0.042 | 1 | 1 | 0.551 | 0.317 | 1 | 1 | 1.078 | 0.055 | 1 | 1 | 0.816 | 0.130 | 1 | 1 | 0.852 | 0.105 |
| FLP-05 | 1 | 1 | 1.622 | 0.440 | 1 | 1 | 0.488 | 0.362 | 1 | 1 | 0.598 | 0.285 | 1 | 1 | 0.652 | 0.246 | 1 | 1 | 0.434 | 0.400 |
| FLP-32 | 1 | 1 | 1.015 | 0.011 | 1 | 1 | 1.114 | 0.081 | 1 | 1 | 0.593 | 0.288 | 1 | 1 | 0.627 | 0.264 | 1 | 1 | 1.057 | 0.040 |
| FLP-34 | 1 | 1 | 1.608 | 0.430 | 1 | UN | 0.380 | 0.438 | 1 | 1 | 1.290 | 0.205 | 1 | 1 | 1.172 | 0.122 | 1 | 1 | 0.974 | 0.018 |
| FLP-40 | 1 | 1 | 0.799 | 0.142 | 1 | 1 | 1.133 | 0.094 | 1 | 1 | 0.609 | 0.277 | 1 | 1 | 1.013 | 0.009 | 1 | 1 | 0.906 | 0.066 |
| FLP-50 | 1 | 1 | 0.810 | 0.134 | 1 | 1 | 1.138 | 0.098 | 1 | 1 | 0.866 | 0.095 | 1 | 1 | 1.436 | 0.308 | 1 | 1 | 0.870 | 0.092 |
| FLP-59 | 1 | 1 | 0.617 | 0.271 | 1 | 1 | 0.971 | 0.021 | 1 | 1 | 0.579 | 0.298 | 1 | 1 | 1.321 | 0.227 | 1 | 1 | 0.647 | 0.249 |
| FLP-30 | 2 | 2 | 1.895 | 0.074 | 2 | 2 | 1.474 | 0.372 | 2 | 2 | 2.465 | 0.329 | 2 | 2 | 1.802 | 0.140 | 2 | 2 | 1.737 | 0.186 |
| FLP-46 | 2 | 2 | 2.681 | 0.482 | 2 | 2 | 1.502 | 0.352 | 2 | 2 | 1.701 | 0.212 | 2 | 2 | 2.47 | 0.332 | 2 | 2 | 1.604 | 0.280 |
| FLP-49 | 2 | 2 | 2.587 | 0.415 | 2 | 2 | 2.053 | 0.037 | 2 | 2 | 1.509 | 0.347 | 2 | 2 | 1.866 | 0.095 | 2 | 2 | 1.521 | 0.339 |
| FLP-54 | 2 | 2 | 1.855 | 0.102 | 2 | 2 | 1.566 | 0.307 | 2 | 2 | 1.663 | 0.238 | 2 | 2 | 1.657 | 0.243 | 2 | 2 | 1.962 | 0.027 |
| FLP-08 | 3 | 3 | 2.989 | 0.008 | 3 | 3 | 3.368 | 0.260 | 3 | 3 | 3.039 | 0.028 | 3 | 3 | 3.469 | 0.332 | 3 | 3 | 2.899 | 0.072 |
| FLP-26 | 4 | 4 | 3.616 | 0.271 | 4 | 4 | 3.900 | 0.071 | 4 | 4 | 4.463 | 0.327 | 4 | 4 | 3.779 | 0.156 | 4 | 4 | 3.828 | 0.121 |
| FLP-45 | 4 | 4 | 4.412 | 0.291 | 4 | 4 | 4.350 | 0.247 | 4 | 4 | 3.723 | 0.196 | 4 | SU | 3.275 | 0.513 | 4 | 4 | 4.487 | 0.344 |
| FLP-04 | 5 | 5 | 5.407 | 0.288 | 5 | 5 | 5.474 | 0.335 | 5 | 5 | 4.683 | 0.224 | 5 | 5 | 5.321 | 0.227 | 5 | 5 | 5.482 | 0.341 |
| FLP-19 | 5 | 5 | 5.557 | 0.394 | 5 | 5 | 4.774 | 0.160 | 5 | 5 | 5.025 | 0.018 | 5 | 5 | 5.477 | 0.337 | 5 | 5 | 4.666 | 0.236 |
| FLP-23 | 5 | SU | 5.727 | 0.514 | 5 | 5 | 4.785 | 0.152 | 5 | 5 | 5.335 | 0.237 | 5 | 5 | 4.589 | 0.291 | 5 | 5 | 4.760 | 0.170 |
| FLP-25 | 5 | 5 | 4.777 | 0.158 | 5 | 5 | 4.599 | 0.284 | 5 | 5 | 4.789 | 0.149 | 5 | 5 | 4.720 | 0.198 | 5 | 5 | 4.809 | 0.135 |
| Accuracy (%) | 94.44 | 94.44 | 100 | 94.44 | 100 | |||||||||||||||
|
| 0.932 | 0.950 | 0.958 | 0.949 | 0.964 | |||||||||||||||
| RMSECV | 0.311 | 0.268 | 0.245 | 0.269 | 0.225 | |||||||||||||||
| RMSEP | 0.410 | 0.353 | 0.323 | 0.354 | 0.296 | |||||||||||||||
Note: AC (Actual Class), CC (Calculated Class), Ypre (Predicted value), Ydev (Deviation), UN (uncredited), SU (suspicious).
Figure 3Chemometric analysis of common variables of BFL. (a) PLS-DA scores scatter plot. (b) PLS-DA loading scatter plot. (c) PLS-DA loadings biplot. (d) Fisher discriminant analysis scatter plot.
Figure 4Chemometric analysis of common variables of FLP. (a) PLS-DA scores scatter plot. (b) PLS-DA loading scatter plot. (c) PLS-DA loadings biplot. (d) Fisher discriminant analysis scatter plot.
The coefficients of Fisher functions of BFL.
| Y1 | Y2 | Y3 | Y4 | Y5 | |
|---|---|---|---|---|---|
| A0 | 3698.54 | 4029.41 | 3761.75 | 3749.1 | 3788.01 |
| A1 | 8.31E+07 | 8.18E+07 | 8.80E+07 | 7.77E+07 | 7.97E+07 |
| A2 | 7.73E+07 | 7.58E+07 | 8.20E+07 | 7.10E+07 | 7.39E+07 |
| A3 | 5.84E+07 | 6.18E+07 | 5.92E+07 | 6.04E+07 | 6.03E+07 |
| A4 | 6.01E+06 | 3.62E+06 | 4.34E+06 | 5.36E+06 | 5.96E+06 |
| A5 | 4.03E+06 | 5.67E+06 | 5.35E+06 | 5.74E+06 | 4.64E+06 |
| A6 | 1.69E+08 | 1.77E+08 | 1.71E+08 | 1.71E+08 | 1.73E+08 |
| A7 | 1.51E+08 | 1.59E+08 | 1.55E+08 | 1.53E+08 | 1.55E+08 |
| A8 | 7.42 E+08 | 7.79E+08 | 7.84E+08 | 7.48E+08 | 7.56E+08 |
| A9 | 4.35E+08 | 4.65E+08 | 4.81E+08 | 4.52E+08 | 4.54E+08 |
| A10 | 5.83E+08 | 5.79E+08 | 5.64E+08 | 5.47E+08 | 5.73E+08 |
| A11 | 4.48E+08 | 4.40E+08 | 4.33E+08 | 4.20E+08 | 4.44E+08 |
| A12 | 6.64E+06 | 8.38E+06 | 8.07E+06 | 7.87E+06 | 6.05E+06 |
| A13 | 2.73E+07 | 2.27E+07 | 2.36E+07 | 2.37E+07 | 2.77E+07 |
The coefficients of Fisher functions of FLP.
| Y1 | Y2 | Y3 | Y4 | Y5 | |
|---|---|---|---|---|---|
| B0 | 1075.23 | 1133.72 | 1171.49 | 1174.26 | 1126.33 |
| B1 | 7.48E+06 | 7.47E+06 | 7.39E+06 | 7.23E+06 | 7.43E+06 |
| B2 | 1.07E+06 | 6.71E+05 | 4.47E+05 | 3.29E+07 | 7.55E+05 |
| B3 | 9.58E+06 | 9.37E+06 | 9.41E+06 | 9.30E+06 | 9.48E+06 |
| B4 | 7.00E+06 | 8.03E+06 | 8.10E+06 | 8.57E+06 | 7.48E+06 |
| B5 | 5.97E+06 | 7.05E+06 | 7.28E+06 | 7.98E+06 | 6.57E+06 |
| B6 | 5.41E+06 | 5.90E+06 | 5.86E+06 | 5.86E+06 | 5.64E+06 |
| B7 | 9.71E+07 | 9.82E+07 | 1.00E+08 | 9.89E+07 | 9.93E+07 |
| B8 | 4.87E+07 | 4.91E+07 | 4.91E+07 | 4.98E+07 | 4.95E+07 |
| B9 | 1.65E+07 | 1.68E+07 | 1.69E+07 | 1.63E+07 | 1.66E+07 |
| B10 | 1.42E+07 | 1.53E+07 | 1.55E+07 | 1.47E+07 | 1.44E+07 |
The validation results of the Fisher discriminant analysis of BFL.
| Validation | Statistics | Class | 1 | 2 | 3 | 4 | 5 | Total |
|---|---|---|---|---|---|---|---|---|
| Originala | Count | 1 | 13 | 0 | 0 | 0 | 0 | 13 |
| 2 | 0 | 7 | 0 | 0 | 0 | 7 | ||
| 3 | 0 | 0 | 4 | 0 | 0 | 4 | ||
| 4 | 0 | 0 | 0 | 6 | 0 | 6 | ||
| 5 | 1 | 0 | 0 | 0 | 9 | 10 | ||
| Accuracy rate % | 100 | 100 | 100 | 100 | 88.9 | 40 | ||
| Cross validationb | Count | 1 | 6 | 0 | 0 | 0 | 0 | 6 |
| 2 | 0 | 5 | 0 | 0 | 0 | 5 | ||
| 3 | 0 | 0 | 1 | 0 | 0 | 1 | ||
| 4 | 0 | 0 | 1 | 2 | 0 | 3 | ||
| 5 | 0 | 0 | 0 | 0 | 4 | 4 | ||
| Accuracy rate % | 100 | 100 | 100 | 66.7 | 100 | 19 |
Note: a97.50% % of original grouped cases correctly classified; bCross validation is done only for those cases in the analysis. In cross validation, each case is classified by the functions derived from all cases other than that case. 94.74% of the cross validation grouped cases correctly classified.
The validation results of the Fisher discriminant analysis of FLP.
| Statistics | Class | 1 | 2 | 3 | 4 | 5 | Total | |
|---|---|---|---|---|---|---|---|---|
| Originala | Count | 1 | 13 | 0 | 0 | 0 | 0 | 13 |
| 2 | 0 | 8 | 0 | 0 | 0 | 8 | ||
| 3 | 0 | 0 | 3 | 0 | 0 | 3 | ||
| 4 | 0 | 0 | 0 | 7 | 0 | 7 | ||
| 5 | 1 | 0 | 0 | 0 | 7 | 8 | ||
| Accuracy rate % | 100 | 100 | 100 | 100 | 87.5 | 39 | ||
| Cross validationb | Count | 1 | 6 | 0 | 0 | 0 | 0 | 6 |
| 2 | 0 | 4 | 0 | 0 | 0 | 4 | ||
| 3 | 0 | 0 | 2 | 0 | 0 | 2 | ||
| 4 | 0 | 0 | 0 | 2 | 0 | 2 | ||
| 5 | 1 | 0 | 0 | 0 | 3 | 4 | ||
| Accuracy rate % | 100 | 100 | 100 | 100 | 75 | 18 |
Note: a97.43% of original grouped cases correctly classified; bCross validation is done only for those cases in the analysis. In cross validation, each case is classified by the functions derived from all cases other than that case. 94.44% of the cross validation grouped cases correctly classified.