| Literature DB >> 34066453 |
Francesca Venturini1,2, Michela Sperti3, Umberto Michelucci2,4, Ivo Herzig1, Michael Baumgartner1, Josep Palau Caballero5, Arturo Jimenez5, Marco Agostino Deriu3.
Abstract
Extra virgin olive oil (EVOO) is the highest quality of olive oil and is characterized by highly beneficial nutritional properties. The large increase in both consumption and fraud, for example through adulteration, creates new challenges and an increasing demand for developing new quality assessment methodologies that are easier and cheaper to perform. As of today, the determination of olive oil quality is performed by producers through chemical analysis and organoleptic evaluation. The chemical analysis requires advanced equipment and chemical knowledge of certified laboratories, and has therefore limited accessibility. In this work a minimalist, portable, and low-cost sensor is presented, which can perform olive oil quality assessment using fluorescence spectroscopy. The potential of the proposed technology is explored by analyzing several olive oils of different quality levels, EVOO, virgin olive oil (VOO), and lampante olive oil (LOO). The spectral data were analyzed using a large number of machine learning methods, including artificial neural networks. The analysis performed in this work demonstrates the possibility of performing the classification of olive oil in the three mentioned classes with an accuracy of 100%. These results confirm that this minimalist low-cost sensor has the potential to substitute expensive and complex chemical analysis.Entities:
Keywords: artificial neural networks; fluorescence sensor; fluorescence spectroscopy; machine learning; olive oil; quality control
Year: 2021 PMID: 34066453 PMCID: PMC8148140 DOI: 10.3390/foods10051010
Source DB: PubMed Journal: Foods ISSN: 2304-8158
The number of olive oils samples in each quality class. EVOO: Extra virgin olive oil, VOO: Virgin olive oil, and LOO: Lampante olive oil.
| Quality | Number of Samples |
|---|---|
| EVOO | 12 |
| VOO | 8 |
| LOO | 7 |
Figure 1Schematics of the minimalist fluorescence sensor. Blue: Excitation light, red: Fluorescence light.
Figure 2Photo of the minimalist fluorescence sensor with olive oil samples in the glass vials and a bottle of olive oil.
List of the machine learning methods used, with implementation parameters and references to the methods description: Support vector machine (SVM), naïve Bayes (NB), multinomial logistic regression (MLR), principal component analysis (PCA) and linear discriminant analysis (LDA), decision tree (DT), random forest (RF), and k-nearest neighbor (k-NN).
| Algorithm | Implementation Details | References |
|---|---|---|
| SVM | Regularization parameter | [ |
| NB | None | [ |
| MLR | regularization penalty = l2, solver algorithm = Newton conjugate gradient | [ |
| PCA + LDA | Number of components used with LDA: 2, 3, 4, 5, 10, 15, 20, 25, and 30 | [ |
| DT | Split quality criterion used = Gini impurity | [ |
| RF | Number of trees = 100, split quality criterion used = Gini impurity | [ |
| k-NN | Numbers of neighbors | [ |
Figure 3Fluorescence emission spectra of selected olive oils. Panel (A) five EVOOs, panel (B) five VOOs, and panel (C) five LOOs. Each curve shows a single spectrum without averaging or smoothing.
Summary of results of the classification given by the average of the accuracy and its standard deviation ; machine learning methods: Support vector machine (SVM), naïve Bayes (NB), multinomial logistic regression (MLR), principal component analysis (PCA) and linear discriminant analysis (LDA), decision tree (DT), random forest (RF), and k-nearest neighbor (k-NN).
| Algorithm | Average Accuracy | Standard Deviation |
|---|---|---|
| SVM | 0.51 | 0.07 |
| NB | 0.64 | 0.05 |
| MLR | 0.88 | 0.03 |
| PCA + LDA | 0.93 | 0.02 |
| (10 PCA Components) | ||
| DT | 0.99 | 0.01 |
| ANN | 0.99 | 0.04 |
| PCA + LDA | 0.999 | 0.006 |
| (30 PCA Components) | ||
| RF | 1.0 | 0.0 |
| k-NN | 1.0 | 0.0 |
Figure 4Evolution of the average of the accuracy and its standard deviation with increasing ANN complexity. For each architecture the points indicate the average of the accuracy of 100 split and train runs, and the error lines indicate the standard deviation.