| Literature DB >> 29692863 |
Anam Mustaqeem1, Syed Muhammad Anwar1, Muahammad Majid2.
Abstract
Arrhythmia is considered a life-threatening disease causing serious health issues in patients, when left untreated. An early diagnosis of arrhythmias would be helpful in saving lives. This study is conducted to classify patients into one of the sixteen subclasses, among which one class represents absence of disease and the other fifteen classes represent electrocardiogram records of various subtypes of arrhythmias. The research is carried out on the dataset taken from the University of California at Irvine Machine Learning Data Repository. The dataset contains a large volume of feature dimensions which are reduced using wrapper based feature selection technique. For multiclass classification, support vector machine (SVM) based approaches including one-against-one (OAO), one-against-all (OAA), and error-correction code (ECC) are employed to detect the presence and absence of arrhythmias. The SVM method results are compared with other standard machine learning classifiers using varying parameters and the performance of the classifiers is evaluated using accuracy, kappa statistics, and root mean square error. The results show that OAO method of SVM outperforms all other classifiers by achieving an accuracy rate of 81.11% when used with 80/20 data split and 92.07% using 90/10 data split option.Entities:
Mesh:
Year: 2018 PMID: 29692863 PMCID: PMC5859855 DOI: 10.1155/2018/7310496
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1An overview of the steps involved in the proposed classification model.
Figure 2The steps involved in WFS method.
Classification accuracy of SVM methods for different data splits.
| SVM methods | Accuracy with respect to dataset splits percentages | ||||
|---|---|---|---|---|---|
| 50/50 | 60/40 | 70/30 | 80/20 | 90/10 | |
| OAA | 60.18 | 60.22 | 67.65 | 72.22 | 80.00 |
| OAO | 73.45 | 74.59 | 77.20 | 81.11 | 92.07 |
| ECC | 69.91 | 71.82 | 76.47 | 77.78 | 88.89 |
Figure 3A comparison of classification performances for different SVM kernels.
Performance comparison of various classifiers with and without feature selection.
| Feature selection | Performance metrics | Machine learning algorithms | ||||
|---|---|---|---|---|---|---|
| SVM | NB | MLP | RF | KNN | ||
| Yes | Accuracy (%) | 81.11 | 77.78 | 78 | 80 | 77.78 |
| Kappa statistics | 0.72 | 0.62 | 0.65 | 0.66 | 0.61 | |
| Root mean square error | 0.17 | 0.26 | 0.27 | 0.25 | 0.26 | |
|
| ||||||
| No | Accuracy (%) | 64.40 | 75.50 | 73.00 | 72.22 | 62.22 |
| Kappa statistics | 0.52 | 0.58 | 0.47 | 0.47 | 0.21 | |
| Root mean square error | 0.22 | 0.19 | 0.18 | 0.18 | 0.20 | |
Figure 4Accuracy of KNN with varying K values.
Performance comparison of feature selection methods.
| Feature selection method | Number of selected features | Accuracy (%) | Kappa statistics | RMSE |
|---|---|---|---|---|
| PCA | 79 | 76.67 | 0.67 | 0.18 |
| IGFS | 23 | 68.89 | 0.55 | 0.20 |
| Correlation based selection | 91 | 60.00 | 0.45 | 0.22 |
| Wrapper method | 94 | 81.11 | 0.72 | 0.17 |
Figure 5ROC curve for different SVM methods used in this study.
Comparison of the proposed model with state-of-the-art methods.
| Method | Train-test split | Number of selected features | Accuracy (%) |
|---|---|---|---|
|
|
|
|
|
| IBPLN + LM [ | 68-32 | 18 | 87.71% |
| Modular NN [ | 90-10 | 198 | 78.89% |
| SVM [ | 75-25 | 60 | 84% |
| NB [ | 70-30 | 205 | 70.50% |
| KNN [ | 20-fold CV | 148 | 73.80% |
| MLP NN [ | 3-fold CV | - | 88.24% |
| NN [ | 90-10 | 79 | 76.67% |