| Literature DB >> 36010522 |
Haixia Sun1, Shujuan Zhang1, Rui Ren1, Jianxin Xue1, Huamin Zhao1.
Abstract
To solve the failure problem of the visible/near infrared (VIS/NIR) spectroscopy model, soluble solids content (SSC) detection for fresh jujubes cultivated in different modes was carried out based on the method of variable optimization and model update. Iteratively retained informative variables (IRIV) and successive projections algorithm (SPA) algorithms were used to extract characteristic wavelengths, and least square support vector machine (LS-SVM) was used to establish detection models. Compared with IRIV, IRIV-SPA achieved better performance. Combined with the offset properties of the wavelength, repeated wavelengths were removed, and wavelength recombination was carried out to create a new combination of variables. Using these fused wavelengths, the model was recalibrated based on the Euclidean distance between samples. The LS-SVM detection model of SSC was established using the update method of wavelength fusion-Euclidean distance. Good prediction results were achieved using the proposed model. The determination coefficient (R2), root mean square error (RMSE), and residual predictive deviation (RPD) of the test set on SSC of fresh jujubes cultivated in the open field were 0.82, 1.49%, and 2.18, respectively. The R2, RMSE, and RPD of the test set on SSC of fresh jujubes cultivated in the rain shelter were 0.81, 1.44%, and 2.17, respectively. This study realized the SSC detection of fresh jujubes with different cultivation and provided a method for the establishment of a robust VIS/NIR detection model for fruit quality, effectively addressing the industry need for identifying jujubes grown in the open field.Entities:
Keywords: cultivation; fresh jujube; model update; variable fusion; visible/near infrared spectrum
Year: 2022 PMID: 36010522 PMCID: PMC9407388 DOI: 10.3390/foods11162522
Source DB: PubMed Journal: Foods ISSN: 2304-8158
Figure 1Mean spectrum of fresh jujubes. (a) Spectral curves of open-field cultivation, (b) spectral curves of rain-shelter cultivation, and (c) average spectrum.
Statistics of soluble solids content (%).
| Cultivation | Data Set | Maximum | Minimum | Mean | Standard Deviations |
|---|---|---|---|---|---|
| Open field | Total samples | 35.5 | 21.2 | 26.59 | 2.78 |
| Calibration set | 35.5 | 21.2 | 26.55 | 2.86 | |
| Prediction set | 32.5 | 22 | 26.69 | 2.57 | |
| Rain shelter | Total samples | 37.4 | 21.8 | 28.81 | 3.10 |
| Calibration set | 37.4 | 21.8 | 28.75 | 3.18 | |
| Prediction set | 34.2 | 24.3 | 28.97 | 2.84 |
Variable classification rules of iteratively retains informative variables (IRIV) [37].
| Variable Class | Classification Rules |
|---|---|
| Interfering variable | DMEANi > 0, Pi < 0.05 |
| Uninformative variable | DMEANi > 0, Pi > 0.05 |
| Strongly informative variable | DMEANi < 0, Pi < 0.05 |
| Weakly informative variable | DMEANi < 0, Pi > 0.05 |
Prediction results of SSC using different pre-processing methods.
| Pretreatment | Prediction Set | Rc2 | RMSEC (%) | Rp2 | RMSEP (%) | RPD |
|---|---|---|---|---|---|---|
| No-pretreatment | Open field | 0.84 | 1.15 | 0.80 | 1.14 | 2.25 |
| Rain shelter | 0.59 | 2.54 | 1.12 | |||
| Baseline | Open field | 0.83 | 1.24 | 0.67 | 1.48 | 1.74 |
| Rain shelter | 0.47 | 2.77 | 1.03 | |||
| SG | Open field | 0.84 | 1.15 | 0.80 | 1.14 | 2.25 |
| Rain shelter | 0.58 | 2.54 | 1.12 |
Figure 2Number of retained variables.
Figure 3DMEAN and P–values of the nonparametric Mann–Whitney U test on variable.
Figure 4Selection of characteristic wavelength using IRIV based on open-field cultivation.
Prediction results of SSC using IRIV and IRIV-SPA.
| Variable Selection Methods | Number of Wavelengths | Prediction Set | Rc2 | RMSEC (%) | Rp2 | RMSEP (%) | RPD |
|---|---|---|---|---|---|---|---|
| IRIV | 71 | Open field | 0.93 | 0.76 | 0.85 | 1.02 | 2.52 |
| Rain shelter | 0.71 | 2.50 | 1.14 | ||||
| IRIV-SPA | 10 | Open field | 0.82 | 1.23 | 0.79 | 1.20 | 2.14 |
| Rain shelter | 0.65 | 3.33 | 0.85 |
Figure 5Selection of characteristic wavelength using IRIV based on rain-shelter cultivation.
Figure 6Fusion of characteristic wavelength.
Figure 7RMSECV distribution in different numbers of samples.
Prediction results of SSC after model update.
| Model Update | Prediction Set | Rc2 | RMSEC (%) | Rp2 | RMSEP (%) | RPD |
|---|---|---|---|---|---|---|
| No update | Open field | 0.82 | 1.23 | 0.79 | 1.20 | 2.14 |
| Rain shelter | 0.65 | 3.33 | 0.85 | |||
| Wavelength fusion | Open field | 0.85 | 1.10 | 0.80 | 1.17 | 2.20 |
| Rain shelter | 0.69 | 2.96 | 0.96 | |||
| Wavelength fusion-Euclidean distance | Open field | 0.88 | 1.02 | 0.79 | 1.17 | 2.20 |
| Rain shelter | 0.81 | 1.35 | 2.10 |
Figure 8Detection results of SSC after model update using wavelength fusion-Euclidean distance. (a) Prediction set results; (b) test set results.