| Literature DB >> 35957162 |
Saad Irfan1, Nadeem Anjum1, Turke Althobaiti2, Abdullah Alhumaidi Alotaibi3, Abdul Basit Siddiqui1, Naeem Ramzan4.
Abstract
Cardiac arrhythmias pose a significant danger to human life; therefore, it is of utmost importance to be able to efficiently diagnose these arrhythmias promptly. There exist many techniques for the detection of arrhythmias; however, the most widely adopted method is the use of an Electrocardiogram (ECG). The manual analysis of ECGs by medical experts is often inefficient. Therefore, the detection and recognition of ECG characteristics via machine-learning techniques have become prevalent. There are two major drawbacks of existing machine-learning approaches: (a) they require extensive training time; and (b) they require manual feature selection. To address these issues, this paper presents a novel deep-learning framework that integrates various networks by stacking similar layers in each network to produce a single robust model. The proposed framework has been tested on two publicly available datasets for the recognition of five micro-classes of arrhythmias. The overall classification sensitivity, specificity, positive predictive value, and accuracy of the proposed approach are 98.37%, 99.59%, 98.41%, and 99.35%, respectively. The results are compared with state-of-the-art approaches. The proposed approach outperformed the existing approaches in terms of sensitivity, specificity, positive predictive value, accuracy and computational cost.Entities:
Keywords: ECG classification; cardiac arrhythmia; deep learning; feature extraction; hybrid models
Mesh:
Year: 2022 PMID: 35957162 PMCID: PMC9370835 DOI: 10.3390/s22155606
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Literature Overview.
| Approach | Algorithm(s) | Accuracy (%) | Arrhythmias Recognized | Limitations |
|---|---|---|---|---|
| [ | SVM + RF | 77.40 | 5 | A conventional hybrid machine-learning model with average accuracy and high computational cost |
| [ | OPF | 90.09 | 5 | An efficient classifier that is used in conjunction with other ML algorithms but requires extensive data preprocessing to produce optimal results |
| [ | ICCA + ANN | 99.60 | 8 | Achieved a very high accuracy on the cost of exponentially longer training time |
| [ | SVM + GB | 84.82 | 16 | Portrayed average accuracy on a redundant dataset with only 500 records against 16 classes |
| [ | Random Forest + BFS | 85.58 | 16 | Redundant dataset used with only 500 records against 16 classes |
| [ | Echo State Network | 98.60 | 16 | Requires too much computational resources leading to a high cost |
| [ | LSTM | 95.80 | 8 | Model requires longer training time to produce substantial results |
| [ | LSTM | 95.00 | 5 | Only accuracy considered to be a performance metric, not enough to benchmark an approach |
| [ | BiLSTM | 95.00 | 5 | A deep LSTM-BiLSTM model which takes too long to train thus increasing the computational cost |
| [ | CNN, RNN, Auto-Encoder, DBN | - | 16 | A survey paper providing an outline with respect to the deep-learning models used in heartbeat classification. No such limitations |
| [ | Deep CNN | 94.00 | 5 | A deep layered CNN with no such weakness except a deep structure that requires relatively long time to train |
| [ | DNN | 94.00 | 5 | A DNN combined with a genetic algorithm with high computational cost |
| [ | Deep 1D-CNN | 97.00 | 9 | A deep CNN with high accuracy but low F1 score |
| [ | DCNN + TFCV | 97.40 | 5 | Too much training time required considering the amount of preprocessing performed on the dataset |
Figure 1The Proposed Framework.
Figure 2The Architecture of the CNN model.
Hyperparameters for the CNN layers.
| Parameter | Value—D1 and D2 |
|---|---|
| Input size |
|
| Stride |
|
| Kernel number |
|
| Kernel size—Conv1 |
|
| Kernel size—Conv2 |
|
| Kernel size—Conv3 |
|
| Pool size |
|
| Activation |
|
| Padding | same |
Figure 3Architecture of the LSTM Cell.
Hyperparameters for the LSTM layers.
| Parameter | Value—D1 | Value—D2 |
|---|---|---|
|
| 1 × 50 | 1 × 200 |
|
| 64 | 64 |
|
| LeakyReLu | ReLu |
| Optimizer | Adam | Adam |
Figure 4Architecture of the Merger after Compilation.
Parameters for the Merger.
| Parameter | Value—D1 | Value—D2 |
|---|---|---|
|
| 500 | 25 |
|
| 25 | 180 |
| Metric | Accuracy | Accuracy |
| Loss function | Categorical Crossentropy | Categorical Crossentropy |
| Optimizer | Adam | Adam |
UCI Arrhythmia Dataset (D1) Class-Instance Distribution.
| Super-Class | Annotations | Total Instances |
|---|---|---|
| Normal heartbeat | N | 245 |
| Ischemic Changes | IC | 44 |
| Anterior Myocardial Infarction | AM | 15 |
| Inferior Myocardial Infarction | IM | 15 |
| Sinus tachycardia | ST | 13 |
| Sinus bradycardia | SB | 25 |
| Ventricular Premature Contraction | V | 3 |
| Supraventricular Premature Contraction | S | 2 |
| Left bundle branch block | L | 9 |
| Right bundle branch block | R | 50 |
| Left ventricle hypertrophy | LV | 4 |
| Atrial Fibrillation or Flutter | A | 5 |
| Other heartbeats | Q | 22 |
Training and Testing Data Division—D1.
| Class | No. Test Instances | No. Training Instances |
|---|---|---|
| N | 96 | 149 |
| IC | 14 | 30 |
| AM | 7 | 8 |
| IM | 4 | 11 |
| ST | 6 | 7 |
| SB | 12 | 13 |
| V | 2 | 1 |
| S | 1 | 2 |
| L | 2 | 6 |
| R | 24 | 26 |
| LV | 3 | 1 |
| A | 1 | 4 |
| Q | 9 | 13 |
MIT-BIH Arrhythmia Dataset (D2) Sub-class Distribution.
| Super-Class | Annotations | Sub-Classes |
|---|---|---|
| Normal heartbeat | N | e, j, N, L, R |
| Supraventricular ectopic heartbeat | S | a, A, J, S |
| Ventricular ectopic heartbeat | V | E, V |
| Fusion heartbeat | F | F |
| Unclassified heartbeat | Q | f, P, Q |
Figure 5Raw ECG Signal.
Figure 6Denoised ECG Signal.
Imbalanced Instances in D2.
| Class | No. of Instances |
|---|---|
| N | 75,011 |
| L | 8071 |
| R | 7255 |
| A | 2546 |
| V | 7129 |
Figure 7Class distribution (a) before resampling and (b) after resampling in the dataset D2.
Figure 8Normalized ECG Signal.
Training and Testing Data Division—D2.
| Class | No. of Test Instances | No. of Training Instances |
|---|---|---|
| N | 2113 | 7887 |
| L | 2022 | 7978 |
| R | 1967 | 8033 |
| A | 1913 | 8087 |
| V | 1985 | 8015 |
Evaluation of Performance Metrics on D1.
| Heartbeat | Sensitivity (%) | Specificity (%) | PPV (%) | Accuracy (%) |
|---|---|---|---|---|
| N | 98.95 | 97.65 | 97.22 | 98.33 |
| IC | 86.67 | 100.00 | 100.00 | 98.88 |
| AM | 83.33 | 98.85 | 71.43 | 98.33 |
| IM | 80.00 | 100.00 | 100.00 | 99.44 |
| ST | 100.00 | 99.42 | 83.33 | 99.44 |
| SB | 81.82 | 98.17 | 75.00 | 97.14 |
| V | 100.00 | 100.00 | 100.00 | 100.00 |
| S | 100.00 | 100.00 | 100.00 | 100.00 |
| L | 50.00 | 99.43 | 66.67 | 98.33 |
| R | 100.00 | 98.75 | 91.67 | 98.90 |
| LV | 100.00 | 100.00 | 100.00 | 100.00 |
| A | 100.00 | 100.00 | 100.00 | 100.00 |
| Q | 77.78 | 100.00 | 100.00 | 98.89 |
| Average | 89.11 | 99.40 | 91.17 | 99.05 |
Confusion Matrix of the Classified Arrhythmia Heartbeats—D1.
| Predicted | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Actual | N | 94 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| IC | 1 | 13 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| AM | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | |
| IM | 1 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| ST | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| SB | 0 | 0 | 2 | 0 | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| V | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | |
| S | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | |
| L | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | |
| R | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 | 0 | 0 | 0 | |
| LV | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | |
| A | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | |
| Q | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 7 | |
Evaluation of Performance Metrics on D2.
| Heartbeat | Sensitivity (%) | Specificity (%) | PPV (%) | Accuracy (%) |
|---|---|---|---|---|
| N | 98.54 | 98.82 | 95.55 | 98.76 |
| L | 99.56 | 99.91 | 99.65 | 99.84 |
| R | 99.13 | 99.73 | 98.88 | 99.61 |
| A | 96.03 | 99.70 | 98.75 | 98.98 |
| V | 98.60 | 99.81 | 99.24 | 99.57 |
| Average | 98.37 | 99.59 | 98.41 | 99.35 |
Confusion Matrix of the Classified Arrhythmia Heartbeats—D2.
| Predicted | ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
| Actual | N | 2019 | 3 | 4 | 14 | 9 |
| L | 3 | 2015 | 0 | 1 | 5 | |
| R | 10 | 0 | 1945 | 7 | 0 | |
| A | 60 | 0 | 17 | 1889 | 1 | |
| V | 21 | 4 | 1 | 2 | 1970 | |
Overall Accuracy of the Proposed Approach and Referenced Deep-Learning Approaches on D2.
| Approach | Overall Accuracy (%) |
|---|---|
| Deep LSTM [ | 95.80 |
| LSTM [ | 95.00 |
| BiLSTM [ | 95.00 |
| DCNN [ | 94.00 |
| DNN [ | 94.00 |
| Deep 1D-CNN [ | 97.00 |
| DCNN + TFCV [ | 97.40 |
| Proposed Approach | 99.35 |
Overall Results of the Performance Metrics on D2.
| Performance Metric | Deep CNN + TFCV | The Proposed Approach |
|---|---|---|
| Sensitivity (%) | 97.05 | 98.37 |
| Specificity (%) | 99.35 | 99.59 |
| PPV (%) | 97.22 | 98.41 |
| Accuracy (%) | 97.40 | 99.35 |
Comparison of the Training Time with DCNN+TFCV Approach.
| Approach | Training Time(m) |
|---|---|
| DCNN + TFCV | 120 |
| Proposed Approach | 17 |