| Literature DB >> 27420074 |
You Wang1, Jiacheng Miao2, Xiaofeng Lyu3, Linfeng Liu4, Zhiyuan Luo5, Guang Li6.
Abstract
In the application of electronic noses (E-noses), probabilistic prediction is a good way to estimate how confident we are about our prediction. In this work, a homemade E-nose system embedded with 16 metal-oxide semi-conductive gas sensors was used to discriminate nine kinds of ginsengs of different species or production places. A flexible machine learning framework, Venn machine (VM) was introduced to make probabilistic predictions for each prediction. Three Venn predictors were developed based on three classical probabilistic prediction methods (Platt's method, Softmax regression and Naive Bayes). Three Venn predictors and three classical probabilistic prediction methods were compared in aspect of classification rate and especially the validity of estimated probability. A best classification rate of 88.57% was achieved with Platt's method in offline mode, and the classification rate of VM-SVM (Venn machine based on Support Vector Machine) was 86.35%, just 2.22% lower. The validity of Venn predictors performed better than that of corresponding classical probabilistic prediction methods. The validity of VM-SVM was superior to the other methods. The results demonstrated that Venn machine is a flexible tool to make precise and valid probabilistic prediction in the application of E-nose, and VM-SVM achieved the best performance for the probabilistic prediction of ginseng samples.Entities:
Keywords: Venn machine; electronic nose; ginseng; probabilistic prediction; support vector machine
Mesh:
Year: 2016 PMID: 27420074 PMCID: PMC4970134 DOI: 10.3390/s16071088
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Details of the ginseng samples.
| No. | Ginseng Species | Places of Production |
|---|---|---|
| 1 | Chinese red ginseng | Ji’an |
| 2 | Chinese red ginseng | Fusong |
| 3 | Korean red ginseng | Ji’an |
| 4 | Chinese white ginseng | Ji’an |
| 5 | Chinese white ginseng | Fusong |
| 6 | American ginseng | Fusong |
| 7 | American ginseng | USA |
| 8 | American ginseng | Canada |
| 9 | American ginseng | Tonghua |
Figure 1Typical responses of 16 sensors to ginseng samples.
Classification rates and assessment criteria of validity of probabilistic prediction results for ginseng samples by each method.
| Methods | Classification Rate | Assessment Criteria of Validity | ||
|---|---|---|---|---|
| dln | dsq | d1 | ||
| VM-SVM | 86.35% | |||
| Platt’s method | 0.3876 | 0.3439 | 0.1480 | |
| VM-SR | ||||
| SR | 76.19% | Inf a | 0.4376 | 0.1853 |
| VM-NB | ||||
| NB | 40.32% | 0.5851 | 0.4510 | 0.0332 |
a In the prediction result of SR, predicted probability value of certain sample was 1, whereas the prediction was wrong, which lead this criteria to be infinite.
Sensitivity and specificity for each category with each method.
| Method/Category | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | |
|---|---|---|---|---|---|---|---|---|---|---|
| VM-SVM | Sensitivity | 0.9714 | 0.8857 | 1 | 0.8571 | 0.8286 | 1 | 0.6857 | 0.8286 | 0.7143 |
| Specificity | 0.9857 | 0.9893 | 0.9964 | 0.9821 | 0.9786 | 1 | 0.9536 | 0.9857 | 0.9750 | |
| Platt’s method | Sensitivity | 0.9714 | 0.8857 | 1 | 0.8571 | 0.8857 | 1 | 0.7143 | 0.8571 | 0.7714 |
| Specificity | 0.9857 | 0.9893 | 0.9964 | 0.9893 | 0.9821 | 1 | 0.9607 | 0.9857 | 0.9786 | |
| VM-SR | Sensitivity | 0.8000 | 0.8286 | 0.9429 | 0.7429 | 0.5714 | 1 | 0.5143 | 0.8000 | 0.8000 |
| Specificity | 0.9714 | 0.9714 | 1 | 0.9536 | 0.9643 | 0.9964 | 0.9571 | 0.9750 | 0.9607 | |
| SR | Sensitivity | 0.7429 | 0.7429 | 0.9714 | 0.7143 | 0.6000 | 1 | 0.5429 | 0.8000 | 0.7429 |
| Specificity | 0.9714 | 0.9714 | 1 | 0.9536 | 0.9643 | 0.9964 | 0.9571 | 0.9750 | 0.9607 | |
| VM-NB | Sensitivity | 0.8000 | 0 | 0.9714 | 0.6571 | 0.4286 | 0.9429 | 0.4286 | 0.4857 | 0.7143 |
| Specificity | 0.8464 | 0.9786 | 0.9929 | 0.9500 | 0.9321 | 1 | 0.9429 | 0.9714 | 0.9393 | |
| NB | Sensitivity | 0 | 0.3143 | 0.9714 | 0.2857 | 0.286 | 0.6857 | 0.571 | 0.3714 | 0.1429 |
| Specificity | 1 | 0.9607 | 0.9929 | 0.8750 | 0.8786 | 0.9107 | 0.8929 | 0.8679 | 0.9500 |
Figure 2Validity of probabilistic predictions by (a) VM-SVM; (b) VM-SR; (c) VM-NB; (d) Platt’s method; (e) Softmax Regression; and (f) Naïve Bayes in offline mode.
Figure 3Validity of probabilistic predictions by (a) VM-SVM (Venn machine based on SVM); (b) VM-SR (Venn machine based on Softmax Regression); (c) VM-NB (Venn machine based on Naïve Bayes) in online mode.
Figure 4Change of precision of predicted probability intervals for samples from category (1–9) during online prediction process with VM-SVM.