| Literature DB >> 35877319 |
Jie Yang1,2, Jinfeng Li2, Kun Lan3, Anruo Wei2, Han Wang4,5,6, Shigao Huang7, Simon Fong1,6.
Abstract
There are three primary challenges in the automatic diagnosis of arrhythmias by electrocardiogram (ECG): the significant variation among individual patients, the multiple pathologies in the ECG signal and the high cost in annotating clinical ECG with the corresponding labels. Traditional ECG processing approaches rely heavily on prior knowledge, such as those from feature extraction and waveform analysis. The preprocessing for prior knowledge incurs computational overhead. Furthermore, standard deep learning methods do not fully consider the dynamic temporal, spatial and multi-labeling characteristics of ECG data. In clinical ECG waveforms, it is common to see multi-labeling in which a patient is labeled with multiple classes of arrhythmias. However, multiclass approaches in current research mainly solve the multi-label machine learning problem, ignoring the correlation between diseases, resulting in information loss. In this paper, an arrhythmia detection and classification scheme called multi-label fusion deep learning is proposed. The objective is to build a unified system with automatic feature learning which supports effective multi-label classification. First, a multi-label ECG-based feature selection method is combined with a matrix decomposition and sparse learning theory. The optimal feature subset is selected as a preprocessing algorithm for ECG data. A multi-label classifier is then constructed by fusing CNN and RNN networks to fully exploit the interactions and features of the time and space dimensions. The experimental result demonstrates that the proposed method can achieve a state-of-the-art performance compared to other algorithms in multi-label database experiments.Entities:
Keywords: arrhythmia recognition; electrocardiogram signals; fusion learning; multi-label attribute selection
Year: 2022 PMID: 35877319 PMCID: PMC9312290 DOI: 10.3390/bioengineering9070268
Source DB: PubMed Journal: Bioengineering (Basel) ISSN: 2306-5354
Figure 1ECG signal wave.
Selected ECG data characteristics.
| Type | Prevalence Rate | Number of Records |
|---|---|---|
| Normal | N/A | 914 |
| AF | 11–15% | 1219 |
| PVC | 14–16% | 711 |
| PAC | 5–7% | 609 |
| LBBB | 5–7% | 254 |
| RBBB | 5–7% | 1828 |
Figure 2Multi-label ECG signals with Sinus Tachycardia, Right Atrial Abnormality and Atrial Premature Beat Trigeminy.
Characteristics of the extracted attributes.
| Type of Features | No. | Overview | Specific Content |
|---|---|---|---|
| Time domain features | 27 | A statistical feature is extracted from the RR interval of the ECG signal | the minimum and maximum values of the RR intervals, the median heart rate and the root mean square of the difference between adjacent RR intervals, etc. |
| Frequency domain features | 35 | Mainly based on the features of the ECG signal with windows | Calculation of window signal spectrum parameters. Including spectral center, center-of-mass frequency, wavelet transform coefficient, normalized low frequency power and normalized high frequency power, etc. |
| Morphological features | 30 | Morphological change features | Calculate the depth of S-wave and Q-wave and R-wave, ST slope, width of QRS, etc. according to the position and amplitude of P-wave, Q-wave, R-wave, S-wave and T-wave. |
| Nonlinear features | 26 | Other features | Calculated by nonlinear methods, such as sample entropy, approximate entropy, fuzzy entropy, etc. |
Figure 3Structure diagram of multi-label attribute selection and classification model for arrhythmia detection.
Figure 4Description of CNN for feature extraction of ECG signal data.
Figure 5Structural diagram of the proposed CNN-GRU model.
Configurations the proposed CNN-GRU models.
| Types | Activation Function | Output Shapes | Kernel Size | No. of Filters | Stride | Trainable Parameters |
|---|---|---|---|---|---|---|
| Input | – | 1000 × 1 | – | – | – | 0 |
| Full convolution | ReLU | 1008 × 3 | 20 × 1 | 3 | 1 | 50 |
| Max-pooling | – | 504 × 3 | 2 × 1 | 3 | 2 | 0 |
| Full convolution | ReLU | 520 × 6 | 10 × 1 | 6 | 1 | 160 |
| Max-pooling | – | 260 × 6 | 2 × 1 | 6 | 2 | 0 |
| Full convolution | ReLU | 263 × 6 | 5 × 1 | 6 | 1 | 160 |
| Max-pooling | – | 132 × 6 | 2 × 1 | 6 | 2 | 0 |
| GRU | 20 | – | – | – | 1280 | |
| Fully-connected | ReLU | 20 | – | – | – | 400 |
| Fully-connected | ReLU | 10 | – | – | – | 200 |
| Fully-connected | Softmax | 5 | – | – | – | 55 |
Figure 6Top 20 important attributes ranked using proposed method.
Classification results based on different multi-label classification methods.
| Methods | Accuracy Score | Hamming Loss | Jaccard Similarity | Precision | Recall | F1 |
|---|---|---|---|---|---|---|
| BRSVM | 0.411 | 0.116 | 0.447 | 0.519 | 0.353 | 0.364 |
| MLKNN | 0.560 | 0.115 | 0.588 | 0.72 | 0.515 | 0.561 |
| MLHARAM | 0.487 | 0.149 | 0.625 | 0.567 | 0.637 | 0.552 |
| MLTSVM | 0.261 | 0.143 | 0.327 | 0.582 | 0.369 | 0.439 |
| Label Powerset | 0.718 | 0.137 | 0.752 | 0.854 | 0.661 | 0.717 |
| Classifer Chain | 0.659 | 0.068 | 0.694 | 0.893 | 0.584 | 0.683 |
| LSPC | 0.381 | 0.27 | 0.376 | 0.366 | 0.735 | 0.486 |
| EEMD + FFT + BP * | 0.745 | 0.072 | 0.757 | 0.784 | 0.736 | 0.712 |
| CNN + LSTM | 0.761 | 0.07 | 0.787 | 0.818 | 0.745 | 0.753 |
| FusionGC | 0.763 | 0.06 | 0.788 | 0.815 | 0.748 | 0.754 |
| AS+FusionGC * | 0.774 | 0.062 | 0.795 | 0.839 | 0.734 | 0.773 |
Figure 7Confusion matrix of the classified performance (left). ROC curve of different labels (right). The ROC curves for the different disease labels are marked in different colours. (N = Normal, A = AF, L = LBBB, R = RBBB, PA = PAC, PV = PVC).