| Literature DB >> 30181445 |
Xianghao Zhan1, Xiaoqing Guan2, Rumeng Wu3, Zhan Wang4, You Wang5, Guang Li6.
Abstract
As alternative herbal medicine gains soar in popularity around the world, it is necessary to apply a fast and convenient means for classifying and evaluating herbal medicines. In this work, an electronic nose system with seven classification algorithms is used to discriminate between 12 categories of herbal medicines. The results show that these herbal medicines can be successfully classified, with support vector machine (SVM) and linear discriminant analysis (LDA) outperforming other algorithms in terms of accuracy. When principal component analysis (PCA) is used to lower the number of dimensions, the time cost for classification can be reduced while the data is visualized. Afterwards, conformal predictions based on 1NN (1-Nearest Neighbor) and 3NN (3-Nearest Neighbor) (CP-1NN and CP-3NN) are introduced. CP-1NN and CP-3NN provide additional, yet significant and reliable, information by giving the confidence and credibility associated with each prediction without sacrificing of accuracy. This research provides insight into the construction of a herbal medicine flavor library and gives methods and reference for future works.Entities:
Keywords: conformal prediction; electronic nose; herbal medicine; reliability; support vector machine
Mesh:
Year: 2018 PMID: 30181445 PMCID: PMC6165400 DOI: 10.3390/s18092936
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Research steps.
Figure 2Physical appearances of the 12 categories of herbal medicine.
The response characteristics of sensors.
| No. | Sensor Type | Specific Response Sensitivity |
|---|---|---|
| 1 | TGS800 | Carbon monoxide, ethanol, methane, hydrogen, ammonia |
| 2 | TGS813 | Carbon monoxide, ethanol, methane, hydrogen, isobutane |
| 3 | TGS813 | Carbon monoxide, ethanol, methane, hydrogen, isobutane |
| 4 | TGS816 | Carbon monoxide, ethanol, methane, hydrogen, isobutane |
| 5 | TGS821 | Carbon monoxide, ethanol, methane, hydrogen |
| 6 | TGS822 | Carbon monoxide, ethanol, methane, acetone, n-hexane, |
| benzene, isobutane | ||
| 7 | TGS822 | Carbon monoxide, ethanol, methane, acetone, |
| n-Hexane, benzene, isobutane | ||
| 8 | TGS826 | Ammonia, trimethyl amine |
| 9 | TGS830 | Ethanol, R-12, R-11, R-22, R-113 |
| 10 | TGS832 | R-134a, R-12 and R-22, ethanol |
| 11 | TGS880 | Carbon monoxide, ethanol, methane, hydrogen, isobutane |
| 12 | TGS2620 | Methane, Carbon monoxide, isobutane, hydrogen |
| 13 | TGS2600 | Carbon monoxide, hydrogen |
| 14 | TGS2602 | Hydrogen, ammonia ethanol, hydrogen sulfide, toluene |
| 15 | TGS2610 | Ethanol, hydrogen, methane, isobutane/propane |
| 16 | TGS2611 | Ethanol, hydrogen, isobutane, methane |
Figure 3Layout of self-assembled E-nose system.
Figure 4Experiment process for one single sample.
Figure 5Sensor responses to Astragalus given by the E-nose system (voltage (v) versus time (0.01 s)).
Classification performances of different algorithms.
| Prediction Tasks and Algorithms | DT | KNN | LDA | SVM | NB | BP (Back Propagation) |
|---|---|---|---|---|---|---|
| 12 Categories of herbal medicine | 92.17% | 91.67% | 98.33% | 98.94% | 91.33% | 90.83% |
Prediction accuracy of SVM with different kernels in offline mode.
| Task and SVM Kernel | Linear | Quadratic | MLP (Multilayer Perceptron Kernel) | RBF (Radial Basis Function) |
|---|---|---|---|---|
| 12 TCM discrimination | 98.94% | 98.92% | 82.51% | 93.69% |
Prediction accuracy of KNN with parameter k in offline mode.
| The K of KNN | 1 | 3 | 5 | 7 | 9 |
|---|---|---|---|---|---|
| 12 TCM discrimination | 91.67% | 91.50% | 90.17% | 90.00% | 88.50% |
PCA analysis in terms of accuracy and time cost.
|
|
|
|
|
|
|
|
| Accuracy | 92.17% | 91.67% | 91.50% | 98.33% | 98.94% | 91.33% |
| Time(s) | 36.605 | 0.277 | 0.293 | 37.987 | 967.555 | 166.992 |
|
|
|
|
|
|
|
|
| Accuracy | 81.83% | 91.17% | 90.67% | 95.50% | 97.64% | 87.50% |
| Time(s) | 15.208 | 0.122 | 0.152 | 31.759 | 695.299 | 48.531 |
|
|
|
|
|
|
|
|
| Accuracy | 82.33% | 87.67% | 87.67% | 85.00% | 87.32% | 84.50% |
| Time(s) | 6.984 | 0.081 | 0.084 | 29.778 | 252.202 | 17.679 |
Figure 6Distribution of samples after PCA.
Conformal prediction accuracy in offline mode.
| Prediction Tasks | CP-1NN | CP-3NN | 1NN | 3NN |
|---|---|---|---|---|
| 12 categories of herbal medicines | 91.50% | 92.17% | 91.67% | 91.50% |
Five typical individual predictions for 12 herbal medicine classifications with CP-1NN.
| Sample Index | True Label | Forced Prediction | Confidence | Credibility |
|---|---|---|---|---|
| 5 | 1 (Astragalus) | 1 (Astragalus) | 0.9950 | 0.7433 |
| 233 | 5 (Radix Angelicae Pubescentis) | 5 (Radix Angelicae Pubescentis) | 0.9883 | 0.4650 |
| 384 | 8 (Codonopsis Pilosula) | 10 (Ligusticum Chuanxiong Hort) | 0.9400 | 0.1317 |
| 478 | 10 (Ligusticum Chuanxiong Hort) | 8 (Codonopsis Pilosula) | 0.9183 | 0.0867 |
| 512 | 11 (Radix Peucedani) | 11 (Radix Peucedani) | 0.9950 | 0.7383 |
Figure 7Confidence values and credibility levels for 12 herbal medicine classifications with CP-1NN.
Figure 8Confidence and credibility for 12 herbal medicine classifications with CP-3NN.