| Literature DB >> 28386181 |
Weiping Zhang1, Jingzhi Yang2, Yanling Fang3, Huanyu Chen3, Yihua Mao4, Mohit Kumar3.
Abstract
The assessment of the physiological state of an individual requires an objective evaluation of biological data while taking into account both measurement noise and uncertainties arising from individual factors. We suggest to represent multi-dimensional medical data by means of an optimal fuzzy membership function. A carefully designed data model is introduced in a completely deterministic framework where uncertain variables are characterized by fuzzy membership functions. The study derives the analytical expressions of fuzzy membership functions on variables of the multivariate data model by maximizing the over-uncertainties-averaged-log-membership values of data samples around an initial guess. The analytical solution lends itself to a practical modeling algorithm facilitating the data classification. The experiments performed on the heartbeat interval data of 20 subjects verified that the proposed method is competing alternative to typically used pattern recognition and machine learning algorithms.Entities:
Keywords: Fuzzy membership functions; Modeling; Variational optimization
Year: 2017 PMID: 28386181 PMCID: PMC5372457 DOI: 10.1016/j.sjbs.2017.01.027
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 2213-7106 Impact factor: 4.219
Figure 1An uncertain signal model for a scalar y.
Figure 2A few examples of Gamma membership functions (Kumar et al., 2016a, Kumar et al., 2016b).
Figure 3An example of the model learned from 2-dimensional data samples using Algorithm 1 (with β = 0.5).
Figure 4An example of the comparison between the Gaussian mixture models and Algorithm 1 (with β = 0.5).
A comparison of different classification algorithms with the proposed method in term of classification accuracy on testing data.
| Method | Dataset 1 | Dataset 2 | Dataset 3 |
|---|---|---|---|
| Nearest neighbors | 100% | 100% | 75% |
| Linear SVM | 91% | 46% | 51% |
| RBF SVM | 90% | 100% | 59% |
| Decision tree | 98% | 100% | 80% |
| Random forest | 98% | 100% | 73% |
| AdaBoost | 93% | 97% | 80% |
| Naive Bayes | 92% | 97% | 57% |
| LDA | 90% | 29% | 52% |
| QDA | 90% | 96% | 57% |
| Analytical fuzzy ( | 100% | 100% | 82% |
A The median accuracy (in %) of different algorithms in classifying the testing heartbeat intervals between two tasks performed by subjects.
| Method | Median of % accuracy ( | Median of % accuracy ( | Median of % accuracy ( | Median of % accuracy ( |
|---|---|---|---|---|
| Nearest neighbors | 87.11 | 90.33 | 91.08 | 92.65 |
| Linear SVM | 87.11 | 89.24 | 90.64 | 91.58 |
| RBF SVM | 84.07 | 84.17 | 86.99 | 90.11 |
| Decision tree | 84.95 | 87.22 | 88.83 | 89.57 |
| Random forest | 86.75 | 88.93 | 90.84 | 92.51 |
| AdaBoost | 88.36 | 90.72 | 91.87 | 92.60 |
| Naive Bayes | 87.40 | 89.27 | 91.05 | 92.18 |
| LDA | 88.67 | 90.70 | 91.59 | 92.99 |
| QDA | 88.04 | 88.46 | 90.08 | 90.97 |
| Analytical fuzzy ( | 88.75 | 91.16 | 92.14 | 93.14 |