| Literature DB >> 35948651 |
Enguang Zuo1, Lei Sun2, Junyi Yan3, Cheng Chen4,5, Chen Chen6, Xiaoyi Lv1,3,7.
Abstract
Fennel contains many antioxidant and antibacterial substances, and it has very important applications in food flavoring and other fields. The kinds and contents of chemical substances in fennel vary from region to region, which can affect the taste and efficacy of the fennel and its derivatives. Therefore, it is of great significance to accurately classify the origin of the fennel. Recently, origin detection methods based on deep networks have shown promising results. However, the existing methods spend a relatively large time cost, a drawback that is fatal for large amounts of data in practical application scenarios. To overcome this limitation, we explore an origin detection method that guarantees faster detection with classification accuracy. This research is the first to use the machine learning algorithm combined with the Fourier transform-near infrared (FT-NIR) spectroscopy to realize the classification and identification of the origin of the fennel. In this experiment, we used Rubberband baseline correction on the FT-NIR spectral data of fennel (Yumen, Gansu and Turpan, Xinjiang), using principal component analysis (PCA) for data dimensionality reduction, and selecting extreme learning machine (ELM), Convolutional Neural Network (CNN), recurrent neural network (RNN), Transformer, generative adversarial networks (GAN) and back propagation neural network (BPNN) classification model of the company realizes the classification of the sample origin. The experimental results show that the classification accuracy of ELM, RNN, Transformer, GAN and BPNN models are above 96%, and the ELM model using the hardlim as the activation function has the best classification effect, with an average accuracy of 100% and a fast classification speed. The average time of 30 experiments is 0.05 s. This research shows the potential of the machine learning algorithm combined with the FT-NIR spectra in the field of food production area classification, and provides an effective means for realizing rapid detection of the food production area, so as to merchants from selling shoddy products as good ones and seeking illegal profits.Entities:
Mesh:
Year: 2022 PMID: 35948651 PMCID: PMC9365781 DOI: 10.1038/s41598-022-17810-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Neural network structure of ELM.
Figure 2The NIR spectrum of the fennel.
Waveband and composition distribution of major NIR peaks of the fennel.
| Wavenumber ( | Assignment | Substances related including in fennel |
|---|---|---|
| 4260 | O–H, N–H and C–O bands | Essential oils |
| 4330 | C–H of Lipids | Hydroxy fatty acids |
| 4830 | Amide | Proteins |
| 5155 | C–H, N–H, O–H of water molecule | Water |
| 5670 | – | Steroids |
| 5808 | – | Polyphenols |
| 6840 | polyamides | Flavonoids |
The contribution rates of the first six features in PCA.
| Principal component | Contribution rate(%) | Cumulative contribution rate(%) |
|---|---|---|
| 1 | 85.93 | 85.93 |
| 2 | 6.22 | 92.14 |
| 3 | 4.80 | 96.95 |
| 4 | 1.45 | 98.40 |
| 5 | 0.51 | 98.91 |
| 6 | 0.35 | 99.26 |
Figure 3Three-dimensional scatter plot of three principal components of the fennel FT-NIR spectroscopy.
Experimental performance of each model comparison results.
| Languages | Model | Sensitivity (%) | Specificity (%) | Accuracy (%) | Run time(s) | AUC |
|---|---|---|---|---|---|---|
| MATLAB | ELM-sig | 97.89 | 99.18 | 98.73 | 0.05 | 0.98 |
| ELM-sin | 97.63 | 99.51 | 98.84 | 0.04 | 0.98 | |
| BPNN- tansig | 95.92 | 97.47 | 96.91 | 0.51 | 0.96 | |
| BPNN- logsig | 98.07 | 97.17 | 97.43 | 0.44 | 0.97 | |
| Python | RNN | 100.00 | 100.00 | 100.00 | 2.01 | 1.00 |
| Transformer | 100.00 | 100.00 | 100.00 | 8.47 | 1.00 | |
| GAN | 100.00 | 100.00 | 100.00 | 126.22 | 1.00 | |
| BPNN | 98.75 | 93.33 | 97.89 | 0.45 | 0.96 | |
| CNN | 62.96 | 65.15 | 64.51 | 2.29 | 0.64 | |
Significant values are in [bold].
Figure 4AUC curves of each model.