| Literature DB >> 31200472 |
Yuan-Yuan Wang1, Jie-Qing Li2, Hong-Gao Liu3, Yuan-Zhong Wang4.
Abstract
Due to the existence of Lingzhi adulteration, there is a growing demand for species classification of medicinal mushrooms by various techniques. The objective of this study was to explore a rapid and reliable way to distinguish between different Lingzhi species and compare the influence of data pretreatment methods on the recognition results. To this end, 120 fresh fruiting bodies of Lingzhi were collected, and all of them were analyzed by attenuated total reflection-Fourier transform infrared spectroscopy (ATR-FTIR). Random forest (RF), support vector machine (SVM) and partial least squares discriminant analysis (PLS-DA) classification models were established for raw and pretreated second derivative (SD) spectral matrices to authenticate different Lingzhi species. The results of multivariate statistical analysis indicated that the SD preprocessing method displayed a higher classification ability, which may be attributed to the analysis of powder samples that requires removal of overlapping peaks and baseline shifts. Compared with RF, the results of the SVM and PLS-DA methods were more satisfying, and their accuracies for the test set were both 100%. Among SVM and PLS-DA, the training set and test set accuracy of PLS-DA were both 100%. In conclusion, ATR-FTIR spectroscopy data pretreated by SD combined with PLS-DA is a simple, rapid, non-destructive and relatively inexpensive method to discriminate between mushroom species and provide a good reference to quality assessment.Entities:
Keywords: Ganoderma; attenuated total reflection-Fourier transform infrared spectroscopy; authentication; chemometrics; partial least squares discriminant analysis; random forest; support vector machine
Mesh:
Year: 2019 PMID: 31200472 PMCID: PMC6631843 DOI: 10.3390/molecules24122210
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1The average spectra of ATR-FTIR of five species of Lingzhi samples. (a) Raw ATR-FTIR spectra; (b) pretreated SD of the ATR-FTIR spectra.
Figure 2PCA (a) and t-SNE (b) feature visualization of Lingzhi samples.
Figure 3Selection results of tree number and branch node of random forest model using raw and SD spectra.
Figure 4Actual and predicted category results of test samples by SVM. (a) Raw, (b) SD.
Results of SVM models for recognizing different species of Lingzhi basing on different data matrices.
| Data Matrices | Best c | Best g | Accuracy of Training Set (%) | Accuracy of Test Set (%) |
|---|---|---|---|---|
| Raw | 5.24288 × 105 | 9.5367 × 10−7 | 82.72 | 89.74 |
| SD | 8 | 6.9053 × 10−4 | 93.83 | 100 |
Figure 5Latent variables (LVS), R2Y and Q2 values of PLS-DA of Raw (a) and SD (b) spectra for Lingzhi species.
Results of PLS-DA models of different species basing on different data matrices.
| Data Matrices | Rmsee | Rmsecv | Rmsep | Q2 | R2Y | Accuracy of Training Set (%) | Accuracy of Test Set (%) |
|---|---|---|---|---|---|---|---|
| Raw | 0.217228 | 0.25516 | 0.210544 | 0.474 | 0.651 | 92.59 | 89.74 |
| SD | 0.12055 | 0.22003 | 0.120649 | 0.651 | 0.896 | 100 | 100 |
Summary of the classification of Lingzhi by different models and spectra pretreatment using ATR-FTIR spectra.
| Methods | Predicted | Raw | SD | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| A | B | C | D | E | A | B | C | D | E | ||
| RF | A | 11 | 0 | 0 | 0 | 4 | 15 | 0 | 0 | 0 | 0 |
| B | 1 | 2 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | |
| C | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | |
| D | 0 | 0 | 0 | 0 | 3 | 2 | 0 | 0 | 1 | 0 | |
| E | 1 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 0 | 15 | |
| SVM | A | 14 | 0 | 0 | 1 | 0 | 15 | 0 | 0 | 0 | 0 |
| B | 2 | 1 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | |
| C | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | |
| D | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 3 | 0 | |
| E | 1 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 0 | 15 | |
| PLS-DA | A | 15 | 0 | 0 | 0 | 0 | 15 | 0 | 0 | 0 | 0 |
| B | 1 | 2 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | |
| C | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | |
| D | 0 | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 3 | 0 | |
| E | 1 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 0 | 15 | |
Parameters of merit for the classification of Lingzhi using ATR-FTIR spectrum after applying different chemometric methods and spectra pretreatment.
| Methods | Parameter | Raw | SD | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| A | B | C | D | E | A | B | C | D | E | ||
| RF | Sensitivity | 0.733 | 0.667 | 0.333 | 0.000 | 0.933 | 1.000 | 1.000 | 1.000 | 0.333 | 1.000 |
| Specificity | 0.833 | 1.000 | 1.000 | 1.000 | 0.708 | 0.917 | 1.000 | 1.000 | 1.000 | 1.000 | |
| Precision | 0.733 | 1.000 | 1.000 | 0.000 | 0.667 | 0.882 | 1.000 | 1.000 | 1.000 | 1.000 | |
| SVM | Sensitivity | 0.933 | 0.333 | 1.000 | 1.000 | 0.933 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| Specificity | 0.875 | 1.000 | 1.000 | 0.972 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | |
| Precision | 0.824 | 1.000 | 1.000 | 0.75 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | |
| PLS-DA | Sensitivity | 1.000 | 0.667 | 1.000 | 0.333 | 0.933 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| Specificity | 0.917 | 1.000 | 1.000 | 1.000 | 0.917 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | |
| Precision | 0.882 | 1.000 | 1.000 | 1.000 | 0.875 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | |
Information of the mushroom samples.
| Code | Quantity | NO. | Latin Name |
|---|---|---|---|
| A | 45 | 1–45 |
|
| B | 9 | 46–54 |
|
| C | 10 | 55–64 |
|
| D | 10 | 65–74 |
|
| E | 46 | 75–120 |
|