| Literature DB >> 34884054 |
Chunwang Dong1, Chongshan Yang1,2, Zhongyuan Liu1,2, Rentian Zhang1,2, Peng Yan1, Ting An1,2, Yan Zhao2, Yang Li1.
Abstract
Catechin is a major reactive substance involved in black tea fermentation. It has a determinant effect on the final quality and taste of made teas. In this study, we applied hyperspectral technology with the chemometrics method and used different pretreatment and variable filtering algorithms to reduce noise interference. After reduction of the spectral data dimensions by principal component analysis (PCA), an optimal prediction model for catechin content was constructed, followed by visual analysis of catechin content when fermenting leaves for different periods of time. The results showed that zero mean normalization (Z-score), multiplicative scatter correction (MSC), and standard normal variate (SNV) can effectively improve model accuracy; while the shuffled frog leaping algorithm (SFLA), the variable combination population analysis genetic algorithm (VCPA-GA), and variable combination population analysis iteratively retaining informative variables (VCPA-IRIV) can significantly reduce spectral data and enhance the calculation speed of the model. We found that nonlinear models performed better than linear ones. The prediction accuracy for the total amount of catechins and for epicatechin gallate (ECG) of the extreme learning machine (ELM), based on optimal variables, reached 0.989 and 0.994, respectively, and the prediction accuracy for EGC, C, EC, and EGCG of the content support vector regression (SVR) models reached 0.972, 0.993, 0.990, and 0.994, respectively. The optimal model offers accurate prediction, and visual analysis can determine the distribution of the catechin content when fermenting leaves for different fermentation periods. The findings provide significant reference material for intelligent digital assessment of black tea during processing.Entities:
Keywords: catechin component content; congou; fermentation; hyperspectral; quantitative forecast; visual analysis
Mesh:
Substances:
Year: 2021 PMID: 34884054 PMCID: PMC8659440 DOI: 10.3390/s21238051
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Hyperspectral data acquisition system flowchart.
The Characteristic wavelengths screened by different variable screening methods.
| Algorithm | Endoplasmic Composition | Characteristic Wavelengths |
|---|---|---|
|
| The total amount of catechins | 406, 457, 505, 536, 547, 566, 580, 625, 650, 731, 776, 891, 942, and 947 nm |
|
| EGC | 436, 457, 492, 513, 554, 579, 625, 674, 683, 694, 705, 727, 729, 730, 743, 757, 766, 767, 832, 835, 838, 846, 847, 886, 893, 897, 901, 914, 917, 938, 952, and 953 nm |
|
| C | 417, 418, 435, 441, 442, 486, 507, 526, 548, 556, 614, 615, 623, 679, 696, 707, 786, 789, 797, 841, 887, 904, 905, 927, 928, 941, and 942 nm |
|
| EC | 436, 441, 447, 497, 499, 557, 558, 607, 627, 638, 656, 667, 683, 689, 696, 786, 883, 886, 887, 901, 904, 955, 956, and 957 nm |
|
| EGCG | 447, 453, 455, 496, 503, 505, 526, 527, 587, 589, 606, 627, 628, 635, 637, 646, 732, 735, 796, 797, 807, 828, 836, 838, 839, 912, 915, 927, and 948 nm |
|
| ECG | 447, 453, 455, 496, 503, 505, 526, 527, 587, 589, 606, 627, 628, 635, 637, 646, 732, 735, 796, 797, 807, 828, 836, 838, 839, 912, 915,927, and 948 nm |
Figure 2Change trends of catechin component content during Congou black tea fermentation.
Figure 3Changes in catechin content in the fermented leaves of stacked black tea at different fermentation levels and positions. ((a–d) are the changes in the content of catechin components at different positions during the fermentation of black tea for 2, 3, 4, and 5 hours, respectively).
Figure 4A set of average spectra at different fermentation moments and spectral curves before and after pretreatment. ((a) is the average spectrum of black tea at different fermentation times; (b) is the original spectrum of black tea at different fermentation moments; (c–i) are the pre-processed spectra of the original spectra using 2 De, Center, Min-Max, MSC, Smooth, SNV, and Zscore algorithms, respectively).
Optimal results of pretreatment methods that affect catechin content, in the PLS model.
| Physical and Chemical Composition | Pretreatment Method PCs | Calibration Set | Prediction Set | |||
|---|---|---|---|---|---|---|
| Rc | RMSECV | Rp | RMSEP | |||
| Total catechins | Z-Score | 5 | 0.918 | 0.502 | 0.911 | 0.592 |
| EGC | MSC | 10 | 0.810 | 0.0095 | 0.769 | 0.0102 |
| C | Z-Score | 9 | 0.889 | 0.0301 | 0.883 | 0.0398 |
| EC | SNV | 7 | 0.903 | 0.0218 | 0.891 | 0.0311 |
| EGCG | Z-Score | 9 | 0.928 | 0.116 | 0.920 | 0.151 |
| ECG | SNV | 9 | 0.929 | 0.117 | 0.923 | 0.140 |
Catechin content prediction results from different models.
| Catechin Component | Methods | Variable | PCs | Calibration Set | Prediction Set | |||
|---|---|---|---|---|---|---|---|---|
| Rc | RMSECV | Rp | RMSEP | RPD | ||||
| Total catechins | SPA-PLS | 14 | 6 | 0.977 | 0.268 | 0.979 | 0.239 | 4.46 |
| SPA-SVR | 14 | 8 | 0.994 | 0.142 | 0.987 | 0.193 | 5.92 | |
| SPA-ELM | 14 | 7 | 0.994 | 0.136 | 0.989 | 0.175 | 6.50 | |
| EGC | VCPA-GA-PLS | 32 | 8 | 0.946 | 0.0053 | 0.956 | 0.0050 | 2.03 |
| VCPA-GA-SVR | 32 | 8 | 0.983 | 0.0030 | 0.972 | 0.0041 | 3.78 | |
| VCPA-GA-ELM | 32 | 9 | 0.954 | 0.0048 | 0.926 | 0.0059 | 2.66 | |
| C | VCPA-IRIV-PLS | 27 | 10 | 0.993 | 0.0076 | 0.991 | 0.0087 | 6.11 |
| VCPA-IRIV-SVR | 27 | 7 | 0.996 | 0.0060 | 0.993 | 0.0082 | 6.72 | |
| VCPA-IRIV-ELM | 27 | 9 | 0.996 | 0.0056 | 0.992 | 0.0086 | 6.37 | |
| EC | VCPA-IRIV-PLS | 24 | 9 | 0.984 | 0.0113 | 0.987 | 0.0086 | 4.68 |
| VCPA-IRIV-SVR | 24 | 7 | 0.996 | 0.0059 | 0.990 | 0.0075 | 5.69 | |
| VCPA-IRIV-ELM | 24 | 8 | 0.995 | 0.0064 | 0.988 | 0.0081 | 5.25 | |
| EGCG | VCPA-IRIV-PLS | 29 | 10 | 0.991 | 0.0953 | 0.991 | 0.0868 | 6.24 |
| VCPA-IRIV-SVR | 29 | 5 | 0.996 | 0.0684 | 0.994 | 0.0793 | 7.33 | |
| VCPA-IRIV-ELM | 29 | 7 | 0.995 | 0.0701 | 0.993 | 0.0825 | 7.00 | |
| ECG | VCPA-GA-PLS | 49 | 9 | 0.992 | 0.0496 | 0.992 | 0.0498 | 6.53 |
| VCPA-GA-SVR | 49 | 8 | 0.995 | 0.0426 | 0.994 | 0.0502 | 6.68 | |
| VCPA-GA-ELM | 49 | 7 | 0.996 | 0.0335 | 0.994 | 0.0468 | 7.29 | |
Figure 5Optimal model optimization results and prediction effects of the content of catechin components. ((a,i) are the prediction effects of the total catechin model and the EGCG model, respectively; (c,e,g,k) are the internal parameter optimization diagrams of EGC, C, EC, and ECG models, respectively; (b,d,f,h,j,l) are the predicted scatter plots of the total catechins, EGC, C, EC and ECG models, respectively).
Figure 6The visual analysis chart of catechin content for different fermentation levels.