| Literature DB >> 31289665 |
Huan Su1, Weiquan Wu2, Xiaochun Wan1, Jingming Ning1.
Abstract
Seventy-three Lu'an guapian tea (LAGP) samples were collected from 11 growing locations in the city of Lu'an, Anhui Province, China. Through high-performance liquid chromatography, 18 amino acids, along with gallic acid, caffeine, and five catechins, were quantitatively detected. Hierarchical cluster, correlation and principal component analysis, and a support vector machine were used for geographical discrimination. The findings suggested that the differences in tea quality between the inner and outer mountain regions are related to isoleucine, leucine, phenylalanine, and valine contents, with a correlation coefficient of more than 0.85. Principal component analysis combining with support vector machine was a feasible method. The identification rates for the inner and outer mountains were 97.96% in the training set and 95.83% in the prediction set. Furthermore, the identification rates for the three counties were 91.84% and 95.83% in the training and prediction sets, respectively.Entities:
Keywords: Lu’an guapian tea; factor selection; geographical origin discrimination; high‐performance liquid chromatography; pattern recognition
Year: 2019 PMID: 31289665 PMCID: PMC6593377 DOI: 10.1002/fsn3.1062
Source DB: PubMed Journal: Food Sci Nutr ISSN: 2048-7177 Impact factor: 2.863
Figure 1Sample collection areas
Figure 2High‐performance liquid chromatography (HPLC) chromatogram of catechins, caffeine, and amino acids. HPLC chromatograms of the mixed references of amino acids (a) and of catechins and caffeine (b)
The content of each substance at different regions (WT%, means ± SD)
| Group 1 | Group 2 | ||||
|---|---|---|---|---|---|
| HSGP | JZGP | YAGP | IMGP | OMSP | |
| GA | 0.049 ± 0.002c | 0.036 ± 0.002b | 0.027 ± 0.002a | 0.036 ± 0.002 | 0.033 ± 0.002 |
| EGC | 3.230 ± 0.173b | 2.773 ± 0.124a | 3.328 ± 0.131b | 3.095 ± 0.105 | 2.945 ± 0.144 |
| C | 0.062 ± 0.008a | 0.122 ± 0.008b | 0.073 ± 0.010a | 0.090 ± 0.008 | 0.105 ± 0.010 |
| CAF | 3.445 ± 0.117ab | 3.785 ± 0.133b | 3.196 ± 0.103a | 3.492 ± 0.101 | 3.572 ± 0.131 |
| EC | 0.492 ± 0.024a | 0.512 ± 0.022a | 0.534 ± 0.025a | 0.500 ± 0.014 | 0.556 ± 0.033 |
| EGCG | 11.069 ± 0.510b | 8.718 ± 0.373a | 8.023 ± 0.260a | 9.275 ± 0.315 | 7.933 ± 0.280 |
| ECG | 1.805 ± 0.070b | 1.939 ± 0.085b | 1.413 ± 0.040a | 1.750 ± 0.068 | 1.672 ± 0.061 |
| Asp | 0.109 ± 0.004a | 0.147 ± 0.009b | 0.156 ± 0.008b | 0.137 ± 0.006 | 0.159 ± 0.010 |
| Ser | 0.036 ± 0.001a | 0.059 ± 0.005b | 0.071 ± 0.006b | 0.051 ± 0.003 | 0.079 ± 0.007 |
| Glu | 0.149 ± 0.004a | 0.170 ± 0.008a | 0.165 ± 0.009a | 0.163 ± 0.006 | 0.167 ± 0.010 |
| Gly | 0.003 ± 0.000a | 0.003 ± 0.000a | 0.003 ± 0.000a | 0.003 ± 0.000 | 0.004 ± 0.000 |
| His | 0.031 ± 0.001a | 0.039 ± 0.002a | 0.039 ± 0.003a | 0.036 ± 0.002 | 0.042 ± 0.003 |
| Arg | 0.094 ± 0.006ab | 0.123 ± 0.012b | 0.088 ± 0.006a | 0.098 ± 0.007 | 0.123 ± 0.013 |
| Thr | 0.015 ± 0.000a | 0.019 ± 0.001a | 0.025 ± 0.002b | 0.017 ± 0.001 | 0.028 ± 0.002 |
| Ala | 0.017 ± 0.001a | 0.021 ± 0.001b | 0.022 ± 0.001b | 0.019 ± 0.001 | 0.024 ± 0.001 |
| Pro | 0.051 ± 0.002a | 0.054 ± 0.002a | 0.053 ± 0.002a | 0.051 ± 0.002 | 0.057 ± 0.002 |
| Cys | 0.009 ± 0.002a | 0.009 ± 0.002a | 0.011 ± 0.002b | 0.009 ± 0.002 | 0.011 ± 0.002 |
| Tyr | 0.017 ± 0.002a | 0.020 ± 0.002a | 0.026 ± 0.002b | 0.019 ± 0.002 | 0.027 ± 0.002 |
| Val | 0.008 ± 0.002a | 0.015 ± 0.002b | 0.021 ± 0.002c | 0.013 ± 0.002 | 0.023 ± 0.002 |
| Met | 0.097 ± 0.002c | 0.053 ± 0.002b | 0.037 ± 0.002a | 0.061 ± 0.002 | 0.039 ± 0.002 |
| Lys | 0.016 ± 0.002a | 0.020 ± 0.002ab | 0.024 ± 0.002b | 0.018 ± 0.002 | 0.027 ± 0.002 |
| Ile | 0.007 ± 0.002a | 0.012 ± 0.002b | 0.018 ± 0.002c | 0.011 ± 0.002 | 0.020 ± 0.002 |
| Leu | 0.012 ± 0.002a | 0.018 ± 0.00b | 0.024 ± 0.002c | 0.016 ± 0.002 | 0.026 ± 0.002 |
| Phe | 0.016 ± 0.002a | 0.021 ± 0.002a | 0.032 ± 0.002b | 0.019 ± 0.002 | 0.035 ± 0.002 |
| Thea | 1.076 ± 0.002a | 1.003 ± 0.002a | 0.993 ± 0.002a | 0.999 ± 0.002 | 1.045 ± 0.002 |
Values in the same row of Group 1 that are labeled with different superscript letters (a–c) differ significantly (p < 0.05). Statistical analysis was one‐way ANOVA with pairwise post hoc comparisons by the method of Bonferroni.
Figure 3Heat map of 25 variables and the results of hierarchical clustering for origins
Correlation analysis for chemical composition and region
| GA | EGC | C | CAF | EC | EGCG | ECG | Asp | ||
| Group 1 | −0.319 | −0.250 | −0.139 | −0.317 | −0.039 | −0.527 | −0.451 | 0.538 | |
| Group 2 | −0.575 | 0.130 | −0.041 | −0.208 | 0.126 | −0.467 | −0.405 | 0.313 | |
| Ser | Glu | Gly | Phe | His | Arg | Thr | Ala | ||
| Group 1 | 0.811 | 0.125 | 0.221 | 0.877 | 0.346 | 0.191 | 0.830 | 0.536 | |
| Group 2 | 0.405 | 0.098 | 0.145 | 0.430 | 0.163 | −0.110 | 0.464 | 0.363 | |
| Pro | Cys | Tyr | Val | Met | Lys | Ile | Leu | Thea | |
| Group 1 | 0.367 | 0.519 | 0.758 | 0.887 | −0.471 | 0.820 | 0.897 | 0.893 | −0.131 |
| Group 2 | 0.042 | 0.270 | 0.456 | 0.481 | −0.779 | 0.370 | 0.484 | 0.450 | −0.136 |
Pearson correlation coefficient was used for analysis. Group 1 means IMGP and OMGP; Group 2 means JZGP, YAGP, and HSGP.
Correlation is significant at a 0.05 level (two‐tailed);
Correlation is significant at a 0.01 level (two‐tailed).
Figure 4Principal component analysis (PCA) scoring plots for IMGP, OMGP (a) and JZGP, YAGP, HSGP (b)
Results of factor selection and optimization
| Methods | Screening criteria | Factors | |
|---|---|---|---|
| Inner and outer mountains | Counties | ||
| Original factors | All components being tested | Polyphenols, amino acids, CAF | Polyphenols, amino acids, CAF |
| HCA | Euclidean distance | C, EC, ECG, Asp, Glu, Arg, Thea | EGC, C, Caf, EC, ECG, Thea, Arg, Asp, Glu |
| Correlation analysis | Pearson correlation coefficient | EGCG, Asp, Phe, Thr, Ala, Cys, Ser, Tyr, Val, Lys, Ile, Leu | GA, EGCG, ECG, Ser, Phe, Thr, Tyr, Val, Met, Ile, Leu |
| PCA | The principal components | GA, C, EGC, EGCG, ECG | GA, C, EGC, EGCG, ECG |
Results of SVM with different factors
| Samples | Factors selected (amount) | Correct discrimination rate | |
|---|---|---|---|
| Training set | Prediction set | ||
| Inner and outer mountains | Origin (25) | 100% (49/49) | 91.67% (22/24) |
| Origin—HCA (7) | 100% (49/49) | 95.83% (23/24) | |
| Origin—correlation analysis (12) | 97.96% (48/49) | 83.33% (20/24) | |
| Origin—PCA (5) | 97.96% (48/49) | 95.83% (23/24) | |
| Counties | Origin (25) | 93.88% (46/49) | 95.83% (23/24) |
| Origin—HCA (9) | 87.76% (43/49) | 87.50% (21/24) | |
| Origin—correlation analysis (11) | 91.84% (45/49) | 79.17% (19/24) | |
| Origin—PCA (5) | 91.84% (45/49) | 95.83% (23/24) | |