| Literature DB >> 35454520 |
Zhongpeng Ji1,2, Zhiping He1,2, Yuhua Gui1,2, Jinning Li1,2, Yongjian Tan1,2, Bing Wu1,2, Rui Xu1,2, Jianyu Wang1,2.
Abstract
Near-infrared spectroscopy has been widely applied in various fields such as food analysis and agricultural testing. However, the conventional method of scanning the full spectrum of the sample and then invoking the model to analyze and predict results has a large amount of collected data, redundant information, slow acquisition speed, and high model complexity. This paper proposes a feature wavelength selection approach based on acousto-optical tunable filter (AOTF) spectroscopy and automatic machine learning (AutoML). Based on the programmable selection of sub nm center wavelengths achieved by the AOTF, it is capable of rapid acquisition of combinations of feature wavelengths of samples selected using AutoML algorithms, enabling the rapid output of target substance detection results in the field. The experimental setup was designed and application validation experiments were carried out to verify that the method could significantly reduce the number of NIR sampling points, increase the sampling speed, and improve the accuracy and predictability of NIR data models while simplifying the modelling process and broadening the application scenarios.Entities:
Keywords: AOTF; AutoML; feature wavelength selection; near infrared detection system
Year: 2022 PMID: 35454520 PMCID: PMC9030996 DOI: 10.3390/ma15082826
Source DB: PubMed Journal: Materials (Basel) ISSN: 1996-1944 Impact factor: 3.623
Figure 1Feature wavelength selection using the AutoGluon-Tabular architecture.
Figure 2AOTF spectroscopy schematic. (a) AOTF spectroscopy and wave vector diagram; (b) curve of AOTF driving frequency with diffraction wavelength.
Figure 3AOTF detection system and testing process. (a) schematic diagram of AOTF dual optical path and dual detector NIR spectrometer; P1, P2, polarizer; BS, adjustable beam splitter; D1, D2, InGaAs photodetector; (b) flow chart of AOTF detection based on AutoML.
The main performance parameters of the AOTF.
| Parameters | |
|---|---|
| Material | TeO2 |
| Spectral coverage/nm | 900–2400 |
| FWHM/nm | 3.75–8.4 @ < 1380 nm |
| 4.2–9.6 @ > 1380 nm | |
| RF/MHz | 45.25–128.55 |
| Angular aperture/° | >8 |
| Diffraction angle/° | >7.5 |
| Power/W | ~2 |
The fat contents of the milk samples.
| Brand | Number of Samples | Fat (%) |
|---|---|---|
| Item1 | 5 | 1.6 |
| Item2 | 5 | 0 |
| Item3 | 5 | 3.5 |
| Item4 | 5 | 3.9 |
| Item5 | 5 | 3.5 |
| Item6 | 5 | 3.0 |
| Item7 | 5 | 3.1 |
| Item8 | 5 | 3.0 |
| Item9 | 5 | 3.1 |
| all | 45 | -- |
ABV of baijiu samples.
| Brand | ABV 1 | Number of Samples | ABV Measured |
|---|---|---|---|
| Item1 | 42 | 14 | 42.0 |
| 50 | 49 | 50.2 | |
| 53 | 77 | 53.1 | |
| 56 | 47 | 56.2 | |
| Item2 | 56 | 22 | 55.8 |
| all | -- | 209 | -- |
1 ABV = alcohol by volume.
Figure 4Absorbance spectra of different samples. (a) milk samples; (b) alcohol samples.
Statistics on fat of milk samples.
| Data Sets | Number of Samples | Min | Max | Mean | STD |
|---|---|---|---|---|---|
| Total samples | 45 | 0 | 3.9 | 2.99 | 1.12 |
| Calibration set | 31 | 0 | 3.9 | 2.96 | 1.15 |
| Prediction set | 14 | 0 | 3.9 | 2.76 | 1.32 |
Statistics on ABV of baijiu samples.
| Data Sets | Number of Samples | Min | Max | Mean | STD |
|---|---|---|---|---|---|
| Total samples | 209 | 42 | 56.2 | 52.66 | 3.60 |
| Calibration set | 146 | 42 | 56.2 | 52.49 | 3.80 |
| Prediction set | 63 | 42 | 56.2 | 53.05 | 3.07 |
Figure 5Results of the “score val” (RMSEV) and permutation importance of the two samples. (a) “score val” (RMSEV) of milk samples. We also calculated RMSEP as “score test”. (b) degrees of permutation importance of the milk samples; (c) “score val” (RMSEV) of alcohol samples. We also calculated RMSEP as “score test”. (d) permutation importance of alcohol samples.
Results of feature wavelength selection.
| Samples | Selected Wavelengths (nm) |
|---|---|
| Milk | 1280, 1265, 2170, 1290, 1305, 1260, 1310, 2040 |
| Baijiu | 1250, 1665, 1710, 1405, 1430, 1425, 1485, 1415 |
The performance of the system for two datasets.
| Data Sets | Number of Variables | RMSEV | RMSEP | Sampling Duration |
|---|---|---|---|---|
| milk | 211 | 0.161 | 0.190 | 13.3 s |
| 8 | 0.143 | 0.180 | 0.6 s | |
| baijiu | 211 | 0.529 | 0.614 | 13.3 s |
| 8 | 0.403 | 0.507 | 0.6 s |