| Literature DB >> 35062374 |
Dominique Albert-Weiss1, Ahmad Osman1,2.
Abstract
A pivotal topic in agriculture and food monitoring is the assessment of the quality and ripeness of agricultural products by using non-destructive testing techniques. Acoustic testing offers a rapid in situ analysis of the state of the agricultural good, obtaining global information of its interior. While deep learning (DL) methods have outperformed state-of-the-art benchmarks in various applications, the reason for lacking adaptation of DL algorithms such as convolutional neural networks (CNNs) can be traced back to its high data inefficiency and the absence of annotated data. Active learning is a framework that has been heavily used in machine learning when the labelled instances are scarce or cumbersome to obtain. This is specifically of interest when the DL algorithm is highly uncertain about the label of an instance. By allowing the human-in-the-loop for guidance, a continuous improvement of the DL algorithm based on a sample efficient manner can be obtained. This paper seeks to study the applicability of active learning when grading 'Galia' muskmelons based on its shelf life. We propose k-Determinantal Point Processes (k-DPP), which is a purely diversity-based method that allows to take influence on the exploration within the feature space based on the chosen subset k. While getting coequal results to uncertainty-based approaches when k is large, we simultaneously obtain a better exploration of the data distribution. While the implementation based on eigendecomposition takes up a runtime of O(n3), this can further be reduced to O(n·poly(k)) based on rejection sampling. We suggest the use of diversity-based acquisition when only a few labelled samples are available, allowing for better exploration while counteracting the disadvantage of missing the training objective in uncertainty-based methods following a greedy fashion.Entities:
Keywords: active learning; agriculture; deep learning; quality monitoring
Mesh:
Year: 2022 PMID: 35062374 PMCID: PMC8780071 DOI: 10.3390/s22020414
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Spectrogram of a sample (a) before processing and (b) after processing based on spectral subtraction by using a SCNR algorithm with parameters and . As shown in panel (b), a reduction of the low frequency noise is achieved without degrading the signal of interest. A side effect is the generation of artefacts by masking new frequencies at different time periods. As the signal of interest is of short period, the artefacts have little impact. The green box in panel (a) resembles the crop towards the signal of interest.
Overview of the number of data within the dataset split in into training, test and validation sets. Based on the summation of all classes consists of 0.6, of 0.25 and of 0.15 based of the total amount of data.
| Class |
|
|
|
|---|---|---|---|
| 1 | 968 | 429 | 259 |
| 2 | 1086 | 422 | 228 |
| 3 | 582 | 231 | 163 |
| 4 | 1050 | 454 | 272 |
Figure 2Visualisation of the active learning cycle for a DL setting.
Figure 3Visualisation of the DL architecture. The information extraction of the amplitude and phase results based on four convolutional modules with descending filter size towards the output.
Figure 4Active learning curves with error bounds for (a) accuracy, (b) loss, (c) recall and (d) precision.
Figure 5Active learning for k-DPP at values . Results are averaged over five iterations shown with error bounds for (a) accuracy, (b) loss, (c) precision and (d) recall.
Summary of the studied acquisition functions in comparison to the different metrics. Within the table we represent the average and the standard deviation for the results obtained after the last iteration. The best performing acquisition metric is highlighted in by bold caption.
| Acquisition Function | Accuracy | Loss | Precision | Recall |
|---|---|---|---|---|
| BALD | 0.7098 | 0.6935 | 0.7361 | 0.6667 |
| (0.1290) | (0.0228) | (0.0132) | (0.0131) | |
| least confidence | 0.7174 | 0.6760 | 0.7531 |
|
| (0.0083) | (0.0132) | (0.0116) |
| |
| 0.7260 |
| 0.7615 | 0.6504 | |
| (0.0107) |
| (0.0093) | (0.0221) | |
| margin sampling |
| 0.7391 |
| 0.6712 |
|
|
| ( | (0.014) | |
| ratio of confidence | 0.7283 | 0.7283 | 0.7596 | 0.6714 |
| (0.1827) | (0.209) | (0.0164) | (0.0161) | |
| random | 0.7135 | 0.7135 | 0.7509 | 0.6469 |
| (0.0220) | (0.0247) | (0.0277) | (0.0162) |
Summary of all parameters taking impact on the training objective.
| Objective | Parameters | Value |
|---|---|---|
| weight span [g] | 837.2–1555.3 | |
| difference of the room temperature [°C] | 18.4–22.9 | |
| Dataset | difference room humidity [%] | 20.77–49.37 |
| measurements on shelf life | {0, 7, 10, 15, 17, 62} | |
| Augmenation types | horizontal flipping, vertical flipping | |
| Preprocessing | Gain filter parameter | 9 |
| Gain filter parameter | 45 | |
| batch size | 64 | |
| learning rate | 0.005; exponential decay | |
| optimisation algorithm | SGD | |
| Hyperparameters | momentum | 0.9 |
| nestorov | activated | |
| clipping norm | 1.0 | |
| gradient clipping | 0.5 | |
| initial set | 30 | |
| Active Learning | initial set | 50 |
| {1, 40, 200} |