| Literature DB >> 31878107 |
Andrzej Szczurek1, Monika Maciejewska1, Beata Bąk2, Jakub Wilk2, Jerzy Wilde2, Maciej Siuda2.
Abstract
The study focused on a method of detection for bee colony infestation with the Varroa destructor mite, based on the measurements of the chemical properties of beehive air. The efficient detection of varroosis was demonstrated. This method of detection is based on a semiconductor gas sensor array and classification module. The efficiency of detection was characterized by the true positive rate (TPR) and true negative rate (TNR). Several factors influencing the performance of the method were determined. They were: (1) the number and kind of sensors, (2) the classifier, (3) the group of bee colonies, and (4) the balance of the classification data set. Gas sensor array outperformed single sensors. It should include at least four sensors. Better results of detection were attained with a support vector machine (SVM) as compared with the k-nearest neighbors (k-NN) algorithm. The selection of bee colonies was important. TPR and TNR differed by several percent for the two examined groups of colonies. The balance of the classification data was crucial. The average classification results were, for the balanced data set: TPR = 0.93 and TNR = 0.95, and for the imbalanced data set: TP = 0.95 and FP = 0.53. The selection of bee colonies and the balance of classification data set have to be controlled in order to attain high performance of the proposed detection method.Entities:
Keywords: Varroa destructor; classification; gas sensor; honey bee
Year: 2019 PMID: 31878107 PMCID: PMC6983005 DOI: 10.3390/s20010117
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Bee colonies examined in the study, their Varroa destructor infestation rates and the number of valid measurement results. Two identical measurement devices were used, one instrument per one group of bee colonies (A and B).
| Bee Colonies Group A | Bee Colonies Group B | ||||
|---|---|---|---|---|---|
| ID | Number of Valid Measurement Results | ID | Number of Valid Measurement Results | ||
| 1A | 24.76 | 45 | 1B | 2.36 | 29 |
| 2A | 0.00 | 47 | 2B | 0.00 | 6 |
| 3A | 3.45 | 6 | 3B | 6.50 | 23 |
| 4A | 2.20 | 123 | 4B | 2.04 | 22 |
| 5A | 0.00 | 33 | 5B | 0.00 | 19 |
| 6A | 1.30 | 31 | 6B | 1.16 | 14 |
| 7A | 2.21 | 70 | 7B | 3.40 | 50 |
| 8A | 0.00 | 66 | 8B | 0.00 | 39 |
| 9A | 2.00 | 45 | 9B | 1.13 | 92 |
Figure 1The instrument based on gas sensors. (a) General view, in casing; (b) Block diagram of multichannel recorder of gas sensor signals, model MCA-8; (c) Block diagram of power supply.
Figure 2(a) Sampling probe; (b) sampling probe placement in the upper part of the beehive—top view.
Confusion matrix [28].
| Predicted Negative | Predicted Positive | |
|---|---|---|
| Actual Negative | True negative (TN) | False positive (FP) |
| Actual Positive | False negative (FN) | True positive (TP) |
Figure 3Gas sensor signals recorded during measurements of beehives occupied by two exemplary bee colonies. (a) Colony A1—not infested by Varroa destructor (see Table 1); (b) colony A2—infested at the rate of 25.76% (see Table 1). Differential baseline correction was applied.
True positive rate (TPR) and true negative rate (TNR) for various options of classification. For each option of classification, we averaged the results obtained using different combinations of sensors as the source of input data.
| Option of Classification | Group A of Bee Colonies | Group B of Bee Colonies | ||
|---|---|---|---|---|
| Mean TPR | Mean TNR | Mean TPR | Mean TNR | |
| SVM, balanced | 0.8701 | 0.9069 | 0.9362 | 0.9506 |
| k-NN, balanced | 0.7953 | 0.8924 | 0.8604 | 0.9513 |
| SVM, imbalanced | 0.9505 | 0.4993 | 0.9889 | 0.4507 |
| k-NN, imbalanced | 0.8886 | 0.5756 | 0.9543 | 0.5274 |
p-values resulting from of the analysis of variance, which examined the significance of the kind of classifier as a factor that influences classification performance.
| Compared Options of Classification | Group A of Bee Colonies | Group B of Bee Colonies | ||
|---|---|---|---|---|
| SVM, balanced vs. k-NN, balanced | 6.71 × 10−25 | 2.67 × 10−3 | 6.71 × 10−25 | 2.67 × 10−3 |
| SVM, imbalanced vs. k-NN, imbalanced | 2.31 × 10−38 | 8.35 × 10−7 | 2.31 × 10−38 | 8.35 × 10−7 |
p-values resulting from of the analysis of variance, which examined the significance of the balance of classification data set as a factor that influences classification performance.
| Compared Options of Classification | Group A of Bee Colonies | Group B of Bee Colonies | ||
|---|---|---|---|---|
| SVM, balanced vs. SVM, imbalanced | 2.17 × 10−36 | 5.17 × 10−65 | 2.17 × 10−36 | 5.17 × 10−65 |
| k-NN, balanced vs. k-NN, imbalanced | 8.65 × 10−39 | 4.83 × 10−62 | 8.65 × 10−39 | 4.83 × 10−62 |
p-values resulting from of the analysis of variance, which examined the significance of the group of bee colonies, for which the measurement data was collected as a factor that influences classification performance.
| Option of Classification Compared Between Groups of Bee Colonies | ||
|---|---|---|
| SVM, balanced | 2.47 × 10−19 | 2.87 × 10−17 |
| k-NN, balanced | 1.89 × 10−12 | 1.11 × 10−28 |
| SVM, imbalanced | 4.67 × 10−35 | 3.53 × 10−2 |
| k-NN, imbalanced | 4.58 × 10−31 | 2.06 × 10−3 |
Figure 4(a) True positive rate (TPR) and true negative rate (TNR) for classification options using the measurement data collected for group A of bee colonies; (b) TPR and TNR for classification options using the measurement data collected for group B of bee colonies. A mean ± standard deviation was shown for each group of models utilizing various combinations of sensors as the source of input data, while the size of combination was fixed. TPR is displayed with a blue line and TNR is displayed with a red line.
p-values resulting from multiple comparisons, which examined the significance of the number of sensors used as sources of classification data, as a factor that influences classification performance.
| Compared Numbers of Sensors | Group A of Bee Colonies | Group B of Bee Colonies | ||
|---|---|---|---|---|
| 1 vs. 2 | 0.002 | 0.000 | 0.000 | 0.130 |
| 1 vs. 3 | 0.003 | 0.000 | 0.000 | 0.000 |
| 1 vs. 4 | 0.000 | 0.000 | 0.000 | 0.000 |
| 1 vs. 5 | 0.000 | 0.000 | 0.000 | 0.001 |
| 2 vs. 3 | 0.997 | 0.005 | 0.000 | 0.052 |
| 2 vs. 4 | 0.000 | 0.010 | 0.000 | 0.000 |
| 2 vs. 5 | 0.000 | 0.000 | 0.000 | 0.060 |
| 3 vs. 4 | 0.000 | 1.000 | 0.927 | 0.059 |
| 3 vs. 5 | 0.000 | 0.146 | 0.099 | 0.932 |
| 4 vs. 5 | 0.770 | 0.170 | 0.348 | 0.761 |
Figure 5Relationship between the true positive rate and the false positive rate for classification of bee colony infestation by Varroa destructor based on gas sensor array responses, using support-vector machine and balanced data sets. Various combinations of sensors were considered, their size was n. (a) Results based on the measurement data collected for bee colony group A. (b) Results based on the measurement data collected for bee colony group B.