| Literature DB >> 28880228 |
Yungang Zhu1, Dayou Liu2, Radu Grosu3, Xinhua Wang4,5, Hongying Duan6, Guodong Wang7.
Abstract
Disease diagnosis can be performed based on fusing the data acquired by multiple medical sensors from patients, and it is a crucial task in sensor-based e-healthcare systems. However, it is a challenging problem that there are few effective diagnosis methods based on sensor data fusion for atrial hypertrophy disease. In this article, we propose a novel multi-sensor data fusion method for atrial hypertrophy diagnosis, namely, characterized support vector hyperspheres (CSVH). Instead of constructing a hyperplane, as a traditional support vector machine does, the proposed method generates "hyperspheres" to collect the discriminative medical information, since a hypersphere is more powerful for data description than a hyperplane. In detail, CSVH constructs two characterized hyperspheres for the classes of patient and healthy subject, respectively. The hypersphere for the patient class is developed in a weighted version so as to take the diversity of patient instances into consideration. The hypersphere for the class of healthy people keeps furthest away from the patient class in order to achieve maximum separation from the patient class. A query is labelled by membership functions defined based on the two hyperspheres. If the query is rejected by the two classes, the angle information of the query to outliers and overlapping-region data is investigated to provide the final decision. The experimental results illustrate that the proposed method achieves the highest diagnosis accuracy among the state-of-the-art methods.Entities:
Keywords: computer-aided diagnosis; multi-sensor data fusion; support vector hypersphere; trial hypertrophy
Mesh:
Year: 2017 PMID: 28880228 PMCID: PMC5620658 DOI: 10.3390/s17092049
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 12 dimensions of all the dimensions to briefly show the outliers or overlaps in the atrial hypertrophy datasets.
Figure 2Exclusive OR dataset.
Figure 3Cross plane dataset.
Classification accuracy of the two datasets (%).
| Classifier | XOR Dataset | Crossing Line | ||
|---|---|---|---|---|
| Training | Testing | Training | Testing | |
| NN1 | 95.2 | 93.5 | 97.7 | 97.6 |
| SVM | 95.7 | 93.6 | 96.9 | 96.2 |
| LDSVM | 96.1 | 94.2 | 97.1 | 96.5 |
| MVSVM | 96.3 | 93.9 | 97.4 | 96.0 |
| TWSVM | 97.0 | |||
| CSVH | 96.9 | 98.1 | ||
Figure 4Characterized Support Vector Hyperspheres: membership values.
Figure 5Multi-weight Vector projection Support Vector Machines: Distance values.
Figure 6Twin Support Vector Machines: Distance values.
Comparison of accuracy (%).
| Dataset | NN1 | SVM | MVSVM | TWSVM | LDSVM | CSVH |
|---|---|---|---|---|---|---|
| Blood transfusion | 81.5 ± 3.9 | 82.0 ± 3.1 | 83.1 ± 4.2 | 84.0 ± 2.6 | 83.7 ± 3.7 | |
| Ionosphere | 92.6 ± 3.1 | 94.8 ± 4.7 | 94.1 ± 5.1 | 95.0 ± 4.1 | 95.3 ± 3.0 | |
| Breast Cancer | 95.3 ± 2.8 | 96.0 ± 3.6 | 95.7 ± 4.3 | 96.5 ± 3.4 | 96.6 ± 3.8 | |
| SPECTF heart | 93.1 ± 3.6 | 93.2 ± 4.1 | 93.7 ± 3.1 | 97.2 ± 3.0 | 95.1 ± 4.4 | |
| Liver | 72.6 ± 3.8 | 74.6 ± 4.2 | 74.1 ± 4.2 | 75.0 ± 2.9 | 74.4 ± 2.8 | |
| Australian | 85.2 ± 4.1 | 86.2 ± 3.3 | 87.0 ± 3.6 | 87.3 ± 4.3 | 87.7 ± 3.5 | |
| Diabetes | 75.1 ± 3.5 | 75.3 ± 3.9 | 76.2 ± 2.6 | 77.2 ± 3.6 | 78.1 ± 3.1 |
F values of the Friedman test among the accuracies for each dataset (significance level α = 0.05).
| Dataset | |
|---|---|
| Blood transfusion | 44.8571 |
| Ionosphere | 36.1714 |
| Breast Cancer | 39.9429 |
| SPECTF heart | 45.8857 |
| Liver | 41.0286 |
| Australian | 32.7429 |
| Diabetes | 41.2571 |
Diagnosis accuracy on atrial hypertrophy data (%).
| Training Subset Ratio | NN | SVM | MVSVM | TWSVM | LDSVM | CSVH |
|---|---|---|---|---|---|---|
| 75.0 ± 2.7 | 77.5 ± 3.7 | 77.7 ± 3.5 | 78.1 ± 3.0 | 78.7 ± 3.6 | ||
| 75.2 ± 3.2 | 77.8 ± 4.0 | 77.2 ± 2.8 | 78.3 ± 4.2 | 78.5 ± 3.0 | ||
| 74.1 ± 3.9 | 76.2 ± 4.3 | 76.0 ± 4.3 | 77.9 ± 3.5 | 78.0 ± 4.1 |
p-Values of the Wilcoxon signed-ranks test between CSVH and other classifiers (significance level α = 0.05).
| Training Subset Ratio | CSVH vs. NN | CSVH vs. SVM | CSVH vs. MVSVM | CSVH vs. TWSVM | CSVH vs. LDSVM |
|---|---|---|---|---|---|
| 0.002 | 0.002 | 0.0039 | 0.0059 | 0.0156 | |
| 0.0039 | 0.0098 | 0.0059 | 0.0254 | 0.0273 | |
| 0.002 | 0.0237 | 0.0098 | 0.0195 | 0.0488 |
Imbalanced training dataset details.
| # Patient | # Health | |
|---|---|---|
| T1 | 5 | 20 |
| T2 | 5 | 25 |
| T3 | 5 | 30 |
| T4 | 5 | 35 |
| T5 | 30 | 5 |
| T6 | 45 | 5 |
| T7 | 60 | 5 |
| T8 | 75 | 5 |
Figure 7Classification accuracy of T1, T2, T3, and T4.
Figure 8Classification accuracy of T5, T6, T7, and T8.
The standard deviations of the classifiers on T1-T8.
| Dataset | NN1 | SVM | MVSVM | TWSVM | CSVH |
|---|---|---|---|---|---|
| T1 | 3.1 | 4.2 | 3.9 | 4.0 | 3.2 |
| T2 | 2.8 | 4.1 | 3.2 | 3.5 | 3.0 |
| T3 | 3.7 | 3.8 | 3.7 | 4.5 | 2.7 |
| T4 | 3.3 | 4.0 | 4.3 | 4.1 | 3.6 |
| T5 | 3.6 | 4.2 | 2.9 | 4.4 | 3.7 |
| T6 | 2.6 | 4.0 | 3.4 | 3.6 | 3.2 |
| T7 | 3.7 | 3.9 | 3.3 | 4.0 | 2.8 |
| T8 | 3.8 | 4.6 | 3.7 | 3.8 | 3.6 |