| Literature DB >> 31430909 |
Tao Liu1, Dongqi Li2, Jianjun Chen3, Yanbing Chen2, Tao Yang2, Jianhua Cao2.
Abstract
Drift correction is an important concern in Electronic noses (E-nose) for maintaining stable performance during continuous work. A large number of reports have been presented for dealing with E-nose drift through machine-learning approaches in the laboratory. In this study, we aim to counter the drift effect in more challenging situations in which the category information (labels) of the drifted samples is difficult or expensive to obtain. Thus, only a few of the drifted samples can be used for label querying. To solve this problem, we propose an innovative methodology based on Active Learning (AL) that selectively provides sample labels for drift correction. Moreover, we utilize a dynamic clustering process to balance the sample category for label querying. In the experimental section, we set up two E-nose drift scenarios-a long-term and a short-term scenario-to evaluate the performance of the proposed methodology. The results indicate that the proposed methodology is superior to the other state-of-art methods presented. Furthermore, the increasing tendencies of parameter sensitivity and accuracy are analyzed. In addition, the Label Efficiency Index (LEI) is adopted to measure the efficiency and labelling cost of the AL methods. The LEI values indicate that our proposed methodology exhibited better performance than the other presented AL methods in the online drift correction of E-noses.Entities:
Keywords: active learning; drift counteraction; dynamic clustering; electronic nose
Year: 2019 PMID: 31430909 PMCID: PMC6721181 DOI: 10.3390/s19163601
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Diagram of active learning.
Figure 2(a) Effects of traditional active learning; (b) effects of proposed AL-DC.
Brief of drift dataset.
| Batch IDs | Recording Time | Sample Category | Dimension | Sample Number |
|---|---|---|---|---|
| Batch 1 | Month 1–2 | 6 | 128 | 445 |
| Batch 2 | Month 3–4 and 8–10 | 6 | 128 | 1244 |
| Batch 3 | Month 11–13 | 5 | 128 | 1586 |
| Batch 4 | Month 14–15 | 5 | 128 | 161 |
| Batch 5 | Month 16 | 5 | 128 | 197 |
| Batch 6 | Month 17–20 | 6 | 128 | 2300 |
| Batch 7 | Month 21 | 6 | 128 | 3613 |
| Batch 8 | Month 22–23 | 6 | 128 | 294 |
| Batch 9 | Month 24 and 30 | 6 | 128 | 470 |
| Batch 10 | Month 36 | 6 | 128 | 3600 |
Figure 3(a) Data distribution of Batch 1; (b) data distribution of Batch 2; (c) data distribution of Batch 3; (d) data distribution of Batch 4&5; (e) data distribution of Batch 6; (f) data distribution of Batch 7; (g) data distribution of Batch 8; (h) data distribution of Batch 9; (i) data distribution of Batch 10.
Data allocation.
| Size of the Data Pool | Size of the Testing Samples | |
|---|---|---|
| Batch 2 | 623 | 1244 |
| Batch 3 | 794 | 1586 |
| Batch 4 | 81 | 161 |
| Batch 5 | 99 | 197 |
| Batch 6 | 1151 | 2300 |
| Batch 7 | 1807 | 3613 |
| Batch 8 | 148 | 294 |
| Batch 9 | 237 | 470 |
| Batch 10 | 1740 | 3480 |
Comparison of recognition accuracy in long-term drift (%).
| Type | Method | Batch 2 | Batch 3 | Batch 4 | Batch 5 | Batch 6 | Batch 7 | Batch 8 | Batch 9 | Batch 10 | Average |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Component correction | CC-LDA | 47.27 | 57.76 | 50.93 | 62.44 | 41.48 | 37.42 | 68.37 | 52.34 | 31.17 | 49.91 |
| CC-OSC | 88.10 | 66.71 | 54.66 | 53.81 | 65.13 | 63.71 | 36.05 | 40.21 | 40.08 | 56.50 | |
| Instancecorrection | TCTL+SEMI |
| 96.85 | 91.30 | 98.98 | 86.78 | 82.51 | 86.05 | 83.19 | 65.75 | 87.60 |
| DAELM-T (40) | 83.52 | 96.34 | 88.20 |
| 78.43 | 80.93 | 87.42 |
| 56.25 | 85.62 | |
| Label free correction | SVM-comgfk | 74.47 | 70.15 | 59.78 | 75.09 | 73.99 | 54.59 | 55.88 | 70.23 | 41.85 | 64.00 |
| ML-comgfk | 80.25 | 74.99 | 78.79 | 67.41 | 77.82 | 71.68 | 49.96 | 50.79 | 53.79 | 67.28 | |
| MFKS (20) | 85.45 | 77.96 | 88.65 | 83.61 | 89.38 | 68.80 | 84.67 | 78.66 | 42.54 | 77.75 | |
| DRCA | 89.15 | 92.69 | 87.58 | 95.94 | 86.52 | 60.25 | 62.24 | 72.34 | 52.00 | 77.63 | |
| Active learning | AL-ACR (10) | 90.03 | 83.67 | 92.55 | 98.48 | 78.09 | 74.54 | 92.86 | 74.89 | 62.16 | 83.03 |
| AL-ACR (20) | 89.23 | 84.17 | 96.89 |
| 76.43 | 62.66 | 89.46 | 95.11 | 64.89 | 84.26 | |
| AL-DC-US (10) | 90.35 |
|
|
| 78.83 | 71.99 | 92.18 | 81.06 | 65.80 | 86.54 | |
| AL-DC-QBC (10) | 90.03 | 82.41 | 77.02 | 74.62 | 73.43 | 53.47 | 29.93 | 65.74 | 63.13 | 67.76 | |
| AL-DC-ER (10) | 90.68 | 98.42 | 96.27 | 98.48 | 90.30 | 81.87 | 90.14 | 75.96 | 62.67 | 87.20 | |
| AL-DC-US (20) | 90.59 | 97.79 |
|
| 79.30 |
|
| 96.60 | 68.59 | 90.47 | |
| AL-DC-QBC (20) | 90.27 | 82.47 | 62.73 | 75.63 | 73.65 | 74.81 | 84.35 | 63.19 | 67.21 | 74.93 | |
| AL-DC-ER (20) | 90.19 | 98.87 |
|
|
| 87.30 | 88.44 | 95.96 |
|
|
Note: the bold in each column of the table denotes the highest recognition accuracy to certain batch, the same below.
Comparison of recognition accuracy in short-term drift (%).
| Framework | Method | 1→2 | 2→3 | 3→4 | 4→5 | 5→6 | 6→7 | 7→8 | 8→9 | 9→10 | Average |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Component correction | CC-LDA | 47.27 | 46.72 | 70.81 | 85.28 | 48.87 | 75.15 | 77.21 | 62.77 | 30.25 | 60.48 |
| Instance correction | DAELM-T (40) | 83.52 | 96.41 | 81.36 | 96.45 | 85.13 | 80.49 | 85.71 |
| 56.81 | 85.10 |
| Label free correction | SVM-comgfk | 74.47 | 73.75 | 78.51 | 64.26 | 69.97 | 77.69 | 82.69 | 85.53 | 17.76 | 69.40 |
| ML-comgfk | 80.25 | 98.55 | 84.89 | 89.85 | 75.53 | 91.17 | 61.22 | 95.53 | 39.56 | 79.62 | |
| DRCA | 89.15 | 98.11 | 95.03 | 69.54 | 50.87 | 78.94 | 65.99 | 84.04 | 36.31 | 74.22 | |
| Active learning | AL-ACR (10) | 90.03 | 99.12 | 85.71 | 98.98 | 54.17 | 97.15 | 88.78 | 95.32 | 52.59 | 84.65 |
| AL-ACR (20) | 89.87 | 98.87 | 85.73 | 98.98 | 76.27 | 97.23 | 94.22 | 87.44 |
| 88.84 | |
| AL-DC-US (10) |
| 99.31 | 98.76 | 98.98 | 96.96 | 98.42 | 94.56 | 87.66 | 50.34 | 90.66 | |
| AL-DC-QBC (10) | 90.59 | 99.12 | 98.76 | 98.48 | 75.48 | 93.16 | 92.18 | 78.72 | 51.58 | 86.45 | |
| AL-DC-ER (10) | 90.68 | 99.05 | 98.76 |
| 95.13 | 98.62 | 90.82 | 96.81 | 42.67 | 90.22 | |
| AL-DC-US (20) | 90.84 |
| 98.76 |
|
|
|
| 88.30 | 62.53 | 92.53 | |
| AL-DC-QBC (20) | 90.59 | 99.12 | 98.76 | 98.48 | 73.61 | 97.84 | 93.54 | 79.15 | 59.20 | 87.81 | |
| AL-DC-ER (20) | 90.59 | 99.31 |
|
| 98.09 | 98.84 |
| 98.30 | 64.14 |
|
Figure 4(a) Accuracy of AL-US-type methods in Setting 1; (b) accuracy of AL-QBC-type methods in Setting 1; (c) accuracy of AL-ER-type methods in Setting 1; (d) accuracy of AL-US-type methods in Setting 2; (e) accuracy of AL-QBC-type methods in Setting 2; (f) accuracy of AL-ER-type methods in Setting 2.
Figure 5(a) Accuracy fluctuation on US with ELM; (b) accuracy fluctuation on QBC with ELM; (c) accuracy fluctuation on ER with ELM; (d) accuracy fluctuation on US with SVM; (e) accuracy fluctuation on QBC with SVM; (f) accuracy fluctuation on ER with SVM.
LEI (%, Setting 1).
| Method | N = 11 | N = 12 | N = 13 | N = 14 | N = 15 | N = 16 | N = 17 | N = 18 | N = 19 | N = 20 | Optimal |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AL-US | 38.46 | 41.61 | 44.33 | 42.20 | 43.82 | 44.74 | 44.46 |
| 44.30 | 44.46 | 45.00 ( |
| AL-DC-US | 43.83 | 43.15 | 44.22 | 45.05 | 44.01 | 46.43 | 43.80 | 46.15 | 45.79 |
| 46.53 ( |
| AL-QBC | 24.59 | 25.03 | 24.32 | 23.88 | 25.74 | 25.86 | 25.30 | 26.08 |
| 29.20 | 33.77 ( |
| AL-DC-QBC | 35.19 | 36.26 | 39.87 |
| 33.12 | 36.60 | 34.45 | 37.87 | 39.95 | 37.40 | 40.53 ( |
| AL-ER | 38.40 | 38.44 | 38.44 | 38.43 | 38.62 | 38.61 | 38.54 | 38.93 | 38.97 |
| 38.97 ( |
| AL-DC-ER | 43.61 | 44.80 | 44.12 | 45.64 | 44.87 |
| 44.32 | 43.89 | 44.92 | 43.42 | 46.42 ( |
LEI (%, Setting 2).
| Method | N = 11 | N = 12 | N = 13 | N = 14 | N = 15 | N = 16 | N = 17 | N = 18 | N = 19 | N = 20 | Optimal |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AL-US | 43.07 | 45.57 | 45.59 | 45.55 | 46.27 | 46.37 | 46.44 | 46.42 |
| 46.53 | 46.68 ( |
| AL-DC-US | 45.15 | 45.16 | 45.36 | 45.53 | 46.11 | 46.40 | 46.74 |
| 46.73 | 46.73 | 46.82 ( |
| AL-QBC | 40.72 | 40.74 | 40.72 | 40.55 | 40.42 | 40.83 | 41.16 |
| 40.97 | 40.56 | 41.25 ( |
| AL-DC-QBC |
| 42.93 | 42.84 | 43.01 | 42.46 | 42.33 | 42.34 | 41.57 | 41.93 | 41.89 | 43.06 ( |
| AL-ER | 41.02 | 41.67 | 41.64 | 41.67 | 41.66 | 41.67 | 41.66 | 41.48 | 41.53 |
| 42.66 ( |
| AL-DC-ER | 45.29 | 46.36 | 46.29 | 46.60 | 46.62 |
| 46.59 | 46.54 | 46.58 | 46.58 | 46.63 ( |