| Literature DB >> 30049946 |
Berta Baca-Bocanegra1, Julio Nogales-Bueno2, Francisco José Heredia3, José Miguel Hernández-Hierro4.
Abstract
Near infrared hyperspectral data were collected for 200 Syrah and Tempranillo grape seed samples. Next, a sample selection was carried out and the phenolic content of these samples was determined. Then, quantitative (modified partial least square regressions) and qualitative (K-means and lineal discriminant analyses) chemometric tools were applied to obtain the best models for predicting the reference parameters. Quantitative models developed for the prediction of total phenolic and flavanolic contents have been successfully developed with standard errors of prediction (SEP) in external validation similar to those previously reported. For these parameters, SEPs were respectively, 11.23 mg g-1 of grape seed, expressed as gallic acid equivalents and 4.85 mg g-1 of grape seed, expressed as catechin equivalents. The application of these models to the whole sample set (selected and non-selected samples) has allowed knowing the distributions of total phenolic and flavanolic contents in this set. Moreover, a discriminant function has been calculated and applied to know the phenolic extractability level of the samples. On average, this discrimination function has allowed a 76.92% of samples correctly classified according their extractability level. In this way, the bases for the control of grape seeds phenolic state from their near infrared spectra have been stablished.Entities:
Keywords: chemometrics; extractability; flavanols; grape seeds; near infrared; phenolic compounds; total phenols; vibrational spectroscopy
Year: 2018 PMID: 30049946 PMCID: PMC6111751 DOI: 10.3390/s18082426
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Description of the procedure carried out for each sample from the raw hyperspectral image acquisition until obtaining the average spectrum.
Figure 2Description of near infrared spectra. (a) Near infrared average raw spectra and standard deviations (10 times amplified) for Syrah and Tempranillo samples. (b) Scores of the grape samples in the space defined by the first and second PCs. (c) Scores of the calibration and validation samples in the space defined by PC1 and PC2.
Main statistical descriptors for reference parameters in calibration and validation sets.
| Set | Reference Parameter | Maximum | Mean | Minimum | SD 1 |
|---|---|---|---|---|---|
| Calibration | EPC 2 | 79.92 | 17.14 | 0.67 | 15.39 |
| EFC 3 | 56.42 | 11.07 | 0.34 | 11.22 | |
| TPC 4 | 99.97 | 59.40 | 32.90 | 13.51 | |
| TFC 5 | 66.92 | 21.66 | 6.63 | 10.49 | |
| ETP 6 | 80.59 | 28.09 | 0.99 | 20.15 | |
| EF 7 | 89.16 | 42.60 | 2.20 | 25.76 | |
| Validation | EPC 2 | 41.89 | 14.91 | 2.24 | 9.49 |
| EFC 3 | 33.12 | 10.17 | 0.99 | 8.54 | |
| TPC 4 | 88.14 | 56.62 | 27.50 | 13.37 | |
| TFC 5 | 39.42 | 20.65 | 11.03 | 8.21 | |
| ETP 6 | 82.91 | 29.06 | 4.04 | 22.01 | |
| EF 7 | 93.04 | 44.23 | 7.80 | 25.33 |
1 SD: Standard deviation; 2 EPC: extractable total phenolic content (mg g−1 of grape seed, expressed as gallic acid equivalents); 3 EFC: extractable flavanolic content (mg g−1 of grape seed, expressed as catechin equivalents); 4 TPC: total phenolic content (mg g−1 of grape seed, expressed as gallic acid equivalents); 5 TFC: total flavanolic content (mg g−1 of grape seed, expressed as catechin equivalents); 6 ETP: extractability of total phenols (expressed as percentages); 7 EF: extractability of flavanols (expressed as percentages).
Main statistical descriptors for the MPLS models developed in the NIR zone close to 950–1650 nm.
| Spectral Pretreatment | Reference Parameters | T Outliers | PLS Factors | N 1 | Est. Min | SD 2 | Est. Max | SEC 3 | RSQ 4 | SECV 5 | SEP 6 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| None 2,10,10,1 | EPC 7 | 5 | 5 | 61 | 0.00 | 9.17 | 41.80 | 5.13 | 0.69 | 6.45 | 6.79 |
| None 2,10,10,1 | EFC 8 | 6 | 5 | 60 | 0.00 | 7.59 | 31.80 | 3.50 | 0.79 | 4.21 | 6.12 |
| SNV + Detrend 2,15,15,1 | TPC 9 | 2 | 8 | 64 | 23.22 | 11.64 | 93.08 | 7.38 | 0.60 | 8.41 | 11.23 |
| SNV + Detrend 1,5,5,1 | TFC 10 | 3 | 5 | 63 | 0.00 | 7.19 | 41.62 | 2.17 | 0.91 | 3.62 | 4.85 |
| None 2,15,15,1 | ETP 11 | 2 | 6 | 64 | 0.00 | 19.62 | 86.36 | 9.74 | 0.75 | 11.83 | 19.26 |
| None 2,15,15,1 | EF 12 | 0 | 6 | 66 | 0.00 | 25.76 | 119.87 | 13.50 | 0.73 | 16.67 | 23.47 |
1 N: number of samples (calibration set); 2 SD: standard deviation; 3 SEC: standard error of calibration; 4 RSQ: coefficient of determination (calibration set); 5 SECV: standard error of cross-validation (7 cross-validation groups); 6 SEP: standard error of prediction (external validation); 7 EPC: extractable total phenolic content (mg g−1 of grape seed, expressed as gallic acid equivalents); 8 EFC: extractable flavanolic content (mg g−1 of grape seed, expressed as catechin equivalents); 9 TPC: total phenolic content (mg g−1 of grape seed, expressed as gallic acid equivalents); 10 TFC: total flavanolic content (mg g−1 of grape seed, expressed as catechin equivalents); 11 ETP: extractability of total phenols (expressed as percentages); 12 EF: extractability of flavanols (expressed as percentages).
Figure 3(a) Loading plots of the MPLS model for total phenolic content (TPC). (b) Loading plots of the MPLS model for total flavanolic content (TFC). (c) Standard errors of prediction obtained in the external validation procedure for all MPLS models carried out expressed as percentages.
Extractability levels of total phenols and flavanols for grape seed samples allocated in calibration and validation sets. Means and standard deviations are shown.
| Set | Samples | N 1 | ETP 2 | EF 3 | ||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| Calibration | All | 66 | 28.09 | 20.15 | 42.60 | 25.76 |
| Low | 36 | 12.93 | 8.08 | 22.39 | 13.83 | |
| High | 30 | 46.30 | 14.24 | 66.84 | 11.87 | |
| Validation | All | 26 | 29.06 | 22.01 | 44.23 | 25.33 |
| Low | 14 | 13.76 | 6.20 | 24.73 | 10.71 | |
| High | 12 | 46.91 | 20.24 | 66.97 | 16.58 | |
1 N: number of samples; 2 ETP: extractability of total phenols (expressed as percentages); 3 EF: extractability of flavanols (expressed as percentages).
Samples correctly classified by the LDA in the leave-one-out cross-validation and in the external validation. The obtained lineal discriminant function is also shown.
| Samples | Leave-One-Out Cross-Validation | External Validation | ||
|---|---|---|---|---|
| Samples Correctly Classified | % of Samples Correctly Classified | Samples Correctly Classified | % of Samples Correctly Classified | |
| Low | 30/36 | 83.33 | 12/14 | 85.7 |
| High | 25/30 | 83.33 | 8/12 | 66.67 |
| All | 55/66 | 83.33 | 20/26 | 76.92 |
| Discriminant function | ||||
Figure 4Distributions of Syrah and Tempranillo grape seeds in different total phenolic content (a,c) and total flavanolic content (b,d).
Figure 5Representation of grape seed samples according their predicted total phenolic content (TPC) and total flavanolic content (TFC). Samples are codified as (a) Syrah or Tempranillo samples or (b) samples with low or high phenolic extractabilities.