| Literature DB >> 35928849 |
Tao Pan1, Jiaqi Li1, Chunli Fu2, Nailiang Chang1, Jiemei Chen2.
Abstract
The identification of high-quality wine brands can avoid adulteration and fraud and protect the rights and interests of producers and consumers. Since the main components of wine are roughly the same, the characteristic components that can distinguish wine brands are usually trace amounts and not unique. The conventional quantitative detection method for brand identification is complicated and difficult. The naive Bayes (NB) classifier is an algorithm based on probability distribution, which is simple and particularly suitable for multiclass discriminant analysis. However, the absorbance probability between spectral wavelengths is not necessarily strongly independent, which limits the application of Bayes method in spectral pattern recognition. This research proposed a Bayes classifier algorithm based on wavelength optimization. First, a large-scale wavelength screening for equidistant combination (EC) was performed, and then wavelength step-by-step phase-out (WSP) was carried out to reduce the correlation between wavelengths and improve the accuracy of Bayes discrimination. The proposed EC-WSP-Bayes method was applied to the 5-category discriminant analysis of wine brand identification based on visible and near-infrared (Vis-NIR) spectroscopy. Among them, four types of wine brands were collected from regular sales channels as identification brands. The fifth type of samples was composed of 21 other commercial brand wines and home-brewed wines from various sources, as the interference brand. The optimal EC-WSP-Bayes model was selected, the corresponding wavelength combination was 404, 600, 992, 2,070, 2,266, and 2,462 nm located in the visible light, shortwave NIR, and combination frequency regions. In modeling and independent validation, the total recognition accuracy rate (RAR Total ) reached 98.1 and 97.6%, respectively. The technology is quick and easy, which is of great significance to regulate the alcohol market. The proposed model of less-wavelength and high-efficiency (N = 6) can provide a valuable reference for small special instruments. The proposed integrated chemometric method can reduce the correlation between wavelengths, improve the recognition accuracy, and improve the applicability of the Bayesian method.Entities:
Keywords: Bayes classifier; equidistant combination wavelength screening; multibrand identification; visible and near-infrared spectroscopy; wavelength step-by-step phase-out; wine
Year: 2022 PMID: 35928849 PMCID: PMC9344138 DOI: 10.3389/fnut.2022.796463
Source DB: PubMed Journal: Front Nutr ISSN: 2296-861X
Calibration-prediction-validation division for the spectra of five types of samples.
| I | II | III | IV | V | Total | |
| Calibration | 120 | 120 | 120 | 120 | 117 | 597 |
| Prediction | 90 | 90 | 90 | 90 | 108 | 468 |
| Validation | 90 | 90 | 90 | 90 | 108 | 468 |
| Total | 300 | 300 | 300 | 300 | 333 | 1533 |
FIGURE 1Schematic diagram of modeling framework.
FIGURE 2Average spectra of five types of wine and the spectrum of distilled water in the visible and near-infrared (Vis-NIR) region.
Recognition accuracy rates (%) of direct Bayes model in modeling.
| Method |
| RAR1 | RAR2 | RAR3 | RAR4 | RAR5 | RAR | RAR |
| Bayes | 1050 | 92.2% | 100.0% | 94.4% | 94.4% | 94.4% | 95.1% | 2.9% |
Recognition accuracy rates (%) of the optimal equidistant combination (EC)-Bayes model in modeling.
| Method |
|
|
|
| RAR1 | RAR2 | RAR3 | RAR4 | RAR5 | RAR | RAR |
| EC-Bayes | 404 | 2462 | 22 | 49 | 93.3% | 100.0% | 95.6% | 95.6% | 100.0% | 97.0% | 2.7% |
FIGURE 3Comparison of the top 10 equidistant combination (EC)-Bayes models and corresponding EC-wavelength step-by-step phase-out (WSP)-Bayes models: (A) RAR; (B) number of wavelengths.
FIGURE 4Position of the wavelength combination of the optimal equidistant combination (EC)-wavelength step-by-step phase-out (WSP)-Bayes model labeled in the average spectrum.
Recognition accuracy rates (%) of the optimal equidistant combination (EC)-wavelength step-by-step phase-out (WSP)-Bayes model in modeling.
| Method |
|
|
| RAR1 | RAR2 | RAR3 | RAR4 | RAR5 | RAR | RAR |
| EC-WSP-Bayes | 404 | 2462 | 6 | 94.4% | 100.0% | 100.0% | 95.6% | 100.0% | 98.1% | 2.8% |
FIGURE 5Total recognition accuracy rate (RAR) in the process of wavelength step-by-step phase-out (WSP) for the equidistant combination (EC)-Bayes model.
Recognition accuracy rates (%) of optimal equidistant combination (EC)-wavelength step-by-step phase-out (WSP)-Bayes model in validation.
| Method |
|
|
| RAR1 | RAR2 | RAR3 | RAR4 | RAR5 | RAR | RAR |
| EC-WSP-Bayes | 404 | 2462 | 6 | 93.3% | 100.0% | 97.8% | 100.0% | 97.2% | 97.6% | 2.7% |
FIGURE 6Identification for the spectra of the validation samples based on the optimal equidistant combination (EC)-wavelength step-by-step phase-out (WSP)-Bayes model.