| Literature DB >> 32148785 |
Xiu-Ping Li1,2, Jieqing Li1, Tao Li3, Honggao Liu1, Yuanzhong Wang1,2.
Abstract
The plateau specialty agricultural products, wild porcini mushrooms, have great value both as a superb cuisine and as a potential medication. Due to quality different between species added with the fraud behavior in sales process, make poor quality or poisonous sample inflow into the market, which pose a health risk for consumers, but also disrupted the mushroom market. Traditional analysis way is time-consuming and laborious. Therefore, the aim of this study is to develop a way using fourier transform mid-infrared (FT-MIR) spectrometry and data fusion strategies for the fast and accurate species discrimination and predict amount of total polyphenol in four porcini mushrooms. The t-distributed stochastic neighbor embedding based on mid-level data fusion showed two species of Boletus edulis and B. umbriniporus have been identified. The order of correct rate of PLS-DA models was mid-level data fusionq (100%) > mid-level data fusione (97.06%) = mid-level data fusionv (97.06%) = stipes (97.06%) > low-level data fusion (94.12%) > caps (91.18%). The order of correct rate of grid-search support vector machine models was low-level data fusion (100%) > caps (94.12%) > stipes (91.18%), and the order of particle swarm optimization support vector machine was low-level data fusion (100%) > caps (97.06%) > stipes (88.24%). The mid-level data fusionq and low-level data fusion had best discrimination accuracy (100%) allowing each mushroom classed into its real species, which could be used for accurate discrimination of samples. B. edulis mushrooms had highest total polyphenol, with 14.76 mg/g dw and 17.33 in caps and stipes mg/g dw, respectively. The phenols were easier to accumulate in the caps in Leccinum rugosiceps (1.03) and B. tomentipes (1.19), and the opposite phenomenon is observed in B. edulis (0.85) and B. umbriniporus (0.95). The correlation coefficient and residual predictive deviation of best prediction model were 86.76% and 2.40%, respectively, indicating that that there is good relevance between FT-MIR and total polyphenol content, which could be used to predict roughly polyphenols content in mushrooms.Entities:
Keywords: FT‐MIR spectroscopy; data fusion; porcini mushroom; species discrimination; total polyphenol prediction
Year: 2020 PMID: 32148785 PMCID: PMC7020324 DOI: 10.1002/fsn3.1313
Source DB: PubMed Journal: Food Sci Nutr ISSN: 2048-7177 Impact factor: 2.863
The information of mushrooms samples
| Code | Species | Location | Quantity |
|---|---|---|---|
| 1 |
| Pubei, Yimen, Yuxi | 11 |
| 2 |
| Pubei, Yimen, Yuxi | 7 |
| 3 |
| Pubei, Yimen, Yuxi | 10 |
| 4 |
| Pubei, Yimen, Yuxi | 10 |
| 5 |
| Pubei, Yimen, Yuxi | 9 |
| 6 |
| Pubei, Yimen, Yuxi | 7 |
| 7 |
| Fuliangpeng, Eshan, Yuxi | 6 |
| 8 |
| Xiaojie, Eshan, Yuxi | 9 |
| 9 |
| Tongchang, Yimen, Yuxi | 10 |
| 10 |
| Chah, Eshan, Yuxi | 7 |
| 11 |
| Huangcaoba, Yuxi | 8 |
| 12 |
| Tongchang, Yimen, Yuxi | 6 |
Figure 1Workflow of species discrimination and total polyphenol prediction
Figure 2Average FT‐MIR spectra of the caps and stipes. 1, B. edulis; 2, L. rugosiceps; 3, B. omentipes; 4, B. umbriniporus
Figure 3Mushroom discrimination by t‐SNE. , B. edulis; , L. rugosiceps; , B. tomentipes; , B. umbriniporus; e, eigenvalue selection by eigenvalue >1; v, variable selection by VIP >1; q, variable selection by maximum Q2
Mushroom discrimination by PLS‐DA
| Caps | ||||||
|---|---|---|---|---|---|---|
| R2Y(cum) | Q2 (cum) | RMSEE (avg) | RMSECV (avg) | LVS | ||
| 0.84 | 0.63 | 0.18 | 0.26 | 12 | ||
| 1 | 2 | 3 | 4 | Accuracy (%) | ||
| 1 | 14 | 0 | 0 | 0 | 100 | |
| 2 | 0 | 3 | 2 | 1 | 50 | |
| 3 | 0 | 0 | 10 | 0 | 100 | |
| 4 | 0 | 0 | 0 | 4 | 100 | |
| Total | 34 | 14 | 3 | 12 | 5 | 91.18 |
1, B. edulis; 2, L. rugosiceps; 3, B. tomentipes; 4, B. umbriniporus; FD, first‐order derivative, LVs: the number of potential variables.
Predicted category.
Genuine category.
Mushroom discrimination by GS‐SVM and PSO‐SVM
| Origin of data | Strategy | Best c | Best g | Accuracy of training set (%) | Accuracy of test set (%) |
|---|---|---|---|---|---|
| Caps | GS‐SVM | 1.05e + 06 | 1.91e−06 | 87.88 | 94.12 (32/34) |
| PSO‐SVM | 30.44 | 20.93 | 89.39 | 97.06 (33/34) | |
| Stipes | GS‐SVM | 2,896.31 | 6.91e−04 | 93.94 | 91.18 (31/34) |
| PSO‐SVM | 2.38 | 54.73 | 92.42 | 88.24 (30/34) | |
| Low‐level data fusion | GS‐SVM | 5.66 | 45.25 | 92.42 | 100 (34/34) |
| PSO‐SVM | 7.9 | 27.85 | 92.42 | 100 (34/34) |
The arithmetic means, standard deviation, median value, amount range, extraction percentage, and bioconcentration factor of total phenolic content in two morphological parts of mushrooms
| Morphological part | Species | |||
|---|---|---|---|---|
|
|
|
|
| |
| Caps | 15.10 ± 1.37 | 14.33 ± 2.02 | 12.27 ± 1.85 | 10.22 ± 1.97 |
| 14.76 | 14.06 | 12.03 | 10.13 | |
| 12.28–17.55 | 9.49–17.04 | 8.98–16.34 | 7.59–16.91 | |
| 1.51 | 1.4 | 1.23 | 1.02 | |
| Stipes | 15.81 ± 3.48 | 14.00 ± 2.10 | 11.30 ± 3.83 | 10.74 ± 1.40 |
| 17.33 | 13.68 | 10.09 | 10.70 | |
| 8.16–19.71 | 10.63–19.13 | 6.32–18.50 | 7.59–13.42 | |
| 1.58 | 1.4 | 1.13 | 1.07 | |
| BCFc‐s | 0.85 | 1.03 | 1.19 | 0.95 |
Abbreviations: dw, dry weight; C, the number of caps; s, the number of stipes; BCFc‐s, quotient calculated by the median value of cap divided by the median value of stipe.
Prediction results of total polyphenol by GS‐SVM regression model
| Substrate | Treatent | Best c | Best g | Cross validation | External prediction | |||
|---|---|---|---|---|---|---|---|---|
| R2 | RMSECV | R2 | RMSEP | RPD | ||||
| Caps | No | 724.08 | 3.45e−04 | 0.92 | 0.73 | 0.77 | 2.24 | 1.20 |
| SNV | 5.66 | 5.52e−03 | 1 | 0.12 | 0.53 | 1.10 | 1.13 | |
| MSC | 370,728 | 9.54e−07 | 0.96 | 0.55 | 0.67 | 2.34 | 0.92 | |
| FD | 16 | 1.95e−03 | 0.97 | 0.41 | 0.68 | 1.31 | 1.52 | |
|
| 4 | 2.76e−03 | 0.97 | 0.43 | 0.87 | 0.82 | 2.40 | |
| SG (7) | 512 | 4.88e−04 | 0.92 | 0.69 | 0.77 | 2.15 | 1.24 | |
| SG (15) | 724.08 | 3.45e−04 | 0.91 | 0.75 | 0.77 | 2.29 | 1.19 | |
| SNV + | 0.35 | 5.52e−03 | 0.49 | 1.91 | 0.11 | 1.84 | 0.78 | |
| FD + SG (7) | 16 | 1.95e−03 | 0.97 | 0.41 | 0.67 | 1.32 | 1.51 | |
| FD + SG (9) | 16 | 1.95e−03 | 0.97 | 0.41 | 0.67 | 1.33 | 1.50 | |
| SNV + FD+SG (9) | 64 | 2.44e−04 | 0.95 | 0.80 | 0.58 | 1.11 | 1.86 | |
| Stipes | No | 92,681.9 | 1.35e−06 | 0.88 | 1.27 | 0.80 | 3.28 | 1.04 |
| SNV | 524,288 | 1.35e−06 | 0.97 | 0.66 | 0.80 | 3.80 | 0.95 | |
| MSC | 8,192 | 2.70e−06 | 0.89 | 1.16 | 0.85 | 2.02 | 1.28 | |
| FD | 32,768 | 1.08e−05 | 1 | 0.13 | 0.85 | 4.80 | 0.93 | |
|
| 32,768 | 9.54 | 0.98 | 0.57 | 0.79 | 2.09 | 1.33 | |
| SG (7) | 2,896.31 | 4.32e−05 | 0.9 | 1.17 | 0.81 | 3.70 | 0.95 | |
| SNV + FD | 1,024 | 1.53e−05 | 0.94 | 0.82 | 0.85 | 2.65 | 1.20 | |
| MSC + FD | 11,585.2 | 1.90e−06 | 0.95 | 0.75 | 0.83 | 2.27 | 1.38 | |
| FD + SG (9) | 16,384 | 2.15e−05 | 1 | 0.13 | 0.86 | 4.61 | 0.96 | |
| FD + SG (11) | 1,448.15 | 2.16e−05 | 0.92 | 0.96 | 0.88 | 4.09 | 0.91 | |
| FD + SG (13) | 4,096 | 1.53e−05 | 0.96 | 0.72 | 0.89 | 4.25 | 0.94 | |
| MSC + FD + SG (9) | 5,792.62 | 3.81e−06 | 0.94 | 0.78 | 0.83 | 2.32 | 1.78 | |
Abbreviations: FD, first‐order derivative; SD, second‐order derivative; MSC, multiplicative scattering correction; SNV, standard normal variate; SG, Savitzky––Golay smoothing; SG (7), Savitzky–Golay smoothing with seven points; SG (9), Savitzky–Golay smoothing with nine points; SG (11), SavitzkyGolay smoothing with eleven points; SG (13), Savitzky–Golay smoothing with thirteen points; SG (15), SavitzkyGolay smoothing with thirteen points.
Figure 4Best prediction results of total polyphenol