| Literature DB >> 34079470 |
Hua Zhang1, Chengyu Liu2, Zhimin Zhang3, Yujie Xing4, Xinwen Liu1, Ruiqing Dong5, Yu He1, Ling Xia6, Feng Liu1.
Abstract
The present study addresses the cardiac arrhythmia (CA) classification problem using the deep learning (DL)-based method for electrocardiography (ECG) data analysis. Recently, various DL techniques have been utilized to classify arrhythmias, with one typical approach to developing a one-dimensional (1D) convolutional neural network (CNN) model to handle the ECG signals in the time domain. Although the CA classification in the time domain is very prevalent, current methods' performances are still not robust or satisfactory. This study aims to develop a solution for CA classification in two dimensions by introducing the recurrence plot (RP) combined with an Inception-ResNet-v2 network. The proposed method for nine types of CA classification was tested on the 1st China Physiological Signal Challenge 2018 dataset. During implementation, the optimal leads (lead II and lead aVR) were selected, and then 1D ECG segments were transformed into 2D texture images by the RP approach. These RP-based images as input signals were passed into the Inception-ResNet-v2 for CA classification. In the CPSC, Georgia, and the PTB_XL ECG databases of the PhysioNet/Computing in Cardiology Challenge 2020, the RP-based method achieved an average F1-score of 0.8521, 0.8529, and 0.8862, respectively. The results suggested the excellent generalization ability of the proposed method. To further assess the performance of the proposed method, we compared the 2D RP-image-based solution with the published 1D ECG-based works on the same dataset. Also, it was compared with two traditional ECG transform into 2D image methods, including the time waveform of the ECG recordings and time-frequency images based on continuous wavelet transform (CWT). The proposed method achieved the highest average F1-score of 0.844, with only two leads of the 12-lead ECG original data, which outperformed other works. Therefore, the promising results indicate that the 2D RP-based method has a high clinical potential for CA classification using fewer lead ECG signals.Entities:
Keywords: ECG; Inception-ResNet-v2; cardiac arrhythmia classification; deep learning; recurrence plot
Year: 2021 PMID: 34079470 PMCID: PMC8165394 DOI: 10.3389/fphys.2021.648950
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
FIGURE 1The architecture of the Inception-ResNet-v2.
FIGURE 2The architecture of the Inception-ResNet A,B,C and Reduction A,B.
Data profile for the ECG dataset.
| Type | Recording | Time length (s) | ||||
| Mean | SD | Min | Median | Max | ||
| Normal | 918 | 15.43 | 7.61 | 10.00 | 13.00 | 60.00 |
| AF | 1,098 | 15.01 | 8.39 | 9.00 | 11.00 | 60.00 |
| I-AVB | 704 | 14.32. | 7.21 | 10.00 | 11.27 | 60.00 |
| LBBB | 207 | 14.92 | 8.09 | 9.00 | 12.00 | 60.00 |
| RBBB | 1,695 | 14.42 | 7.60 | 10.00 | 11.19 | 60.00 |
| PAC | 574 | 19.46 | 12.36 | 9.00 | 14.00 | 60.00 |
| PVC | 653 | 20.21 | 12.85 | 6.00 | 15.00 | 60.00 |
| STD | 826 | 15.13 | 6.82 | 8.00 | 12.78 | 60.00 |
| STE | 202 | 17.15 | 10.72 | 10.00 | 11.89 | 60.00 |
| Total | 6,877 | 15.79 | 9.04 | 6.00 | 12.00 | 60.00 |
FIGURE 3The Normal ECG time waveform and its corresponding RP-based image.
FIGURE 5The ECG time waveforms and their corresponding RP-based images of PAC, PVC, STD and STE.
FIGURE 6The workflow of the proposed approach for CA classification.
Classification performance of Inception-ResNet-v2 based on RP images.
| CA Type | Precision | Recall | F1-score |
| Normal | 0.797 | 0.827 | 0.812 |
| AF | 0.852 | 0.898 | 0.875 |
| I-AVB | 0.916 | 0.930 | 0.923 |
| LBBB | 0.933 | 0.924 | 0.929 |
| RBBB | 0.777 | 0.776 | 0.776 |
| PAC | 0.774 | 0.733 | 0.753 |
| PVC | 0.865 | 0.731 | 0.793 |
| STD | 0.808 | 0.867 | 0.837 |
| STE | 0.899 | 0.901 | 0.900 |
| Avg/total | 0.847 | 0.847 | 0.844 |
FIGURE 7The confusion matrix of the proposed method for CA classification.
Classification performance of different reference models.
| Type | F1-score | |||||||
| Xception | Resnet 50 | Densenet | Resnext | Inception-ResNet-v1 | Inception-v3 | Inception-v4 | Inception-ResNet-v2 | |
| Normal | 0.77 | 0.76 | 0.75 | 0.75 | 0.78 | 0.72 | 0.73 | 0.812 |
| AF | 0.85 | 0.85 | 0.85 | 0.88 | 0.86 | 0.86 | 0.85 | 0.875 |
| I-AVB | 0.88 | 0.88 | 0.87 | 0.88 | 0.89 | 0.87 | 0.87 | 0.923 |
| LBBB | 0.89 | 0.90 | 0.92 | 0.91 | 0.93 | 0.91 | 0.91 | 0.929 |
| RBBB | 0.70 | 0.72 | 0.70 | 0.72 | 0.72 | 0.71 | 0.71 | 0.776 |
| PAC | 0.67 | 0.65 | 0.64 | 0.68 | 0.64 | 0.60 | 0.58 | 0.753 |
| PVC | 0.72 | 0.72 | 0.71 | 0.74 | 0.76 | 0.68 | 0.65 | 0.793 |
| STD | 0.79 | 0.79 | 0.76 | 0.79 | 0.79 | 0.77 | 0.75 | 0.837 |
| STE | 0.85 | 0.83 | 0.86 | 0.85 | 0.90 | 0.87 | 0.86 | 0.900 |
| Avg/total | 0.80 | 0.79 | 0.79 | 0.80 | 0.81 | 0.78 | 0.77 | 0.844 |
Classification performance of different 2D images-based input data.
| Type | F1-score | |||||||
| Wavelet time-frequency images | Time waveform | RP-based images | ||||||
| Cgau8 | Cmor | Fbsp | Gaus8 | Mexh | Morl | |||
| Normal | 0.56 | 0.56 | 0.57 | 0.57 | 0.61 | 0.56 | 0.61 | 0.812 |
| AF | 0.76 | 0.77 | 0.75 | 0.81 | 0.75 | 0.77 | 0.80 | 0.875 |
| I-AVB | 0.82 | 0.80 | 0.81 | 0.77 | 0.80 | 0.78 | 0.81 | 0.923 |
| LBBB | 0.71 | 0.74 | 0.71 | 0.68 | 0.67 | 0.72 | 0.87 | 0.929 |
| RBBB | 0.56 | 0.53 | 0.57 | 0.54 | 0.54 | 0.53 | 0.60 | 0.776 |
| PAC | 0.49 | 0.39 | 0.43 | 0.40 | 0.43 | 0.44 | 0.52 | 0.753 |
| PVC | 0.58 | 0.56 | 0.56 | 0.55 | 0.56 | 0.52 | 0.58 | 0.793 |
| STD | 0.60 | 0.63 | 0.62 | 0.64 | 0.61 | 0.61 | 0.62 | 0.837 |
| STE | 0.53 | 0.41 | 0.51 | 0.44 | 0.47 | 0.56 | 0.81 | 0.900 |
| Avg/total | 0.63 | 0.61 | 0.62 | 0.62 | 0.62 | 0.61 | 0.70 | 0.844 |
The performance of the published 1D ECG-based works and the proposed method.
| Rank | Team | Normal | AF | I-AVB | LBBB | RBBB | PAC | PVC | STD | STE | Avg/total |
| 1 | He et al. | 0.748 | 0.920 | 0.882 | 0.889 | 0.883 | 0.787 | 0.851 | 0.780 | 0.780 | 0.836 |
| 2 | Cai et al. | 0.765 | 0.927 | 0.887 | 0.886 | 0.880 | 0.812 | 0.800 | 0.784 | 0.753 | 0.833 |
| 3 | Chen et al. | 0.752 | 0.930 | 0.871 | 0.915 | 0.839 | 0.832 | 0.833 | 0.800 | 0.667 | 0.823 |
| Yao et al. | 0.789 | 0.920 | 0.850 | 0.872 | 0.933 | 0.736 | 0.861 | 0.789 | 0.556 | 0.812 | |
| He et al. | 0.755 | 0.846 | 0.870 | 0.869 | 0.780 | 0.751 | 0.829 | 0.790 | 0.704 | 0.799 | |
| Chen et al. | 0.795 | 0.897 | 0.865 | 0.821 | 0.911 | 0.734 | 0.852 | 0.788 | 0.509 | 0.797 | |
| RP-based | 0.812 | 0.875 | 0.923 | 0.929 | 0.776 | 0.753 | 0.793 | 0.837 | 0.900 | 0.844 | |
Comparison of the published 1D ECG-based works with the proposed method.
| Team | Input signal | ECG leads | Network | Parameters | Avg/total F1-score |
| Yao et al. | 1D ECG | 12 leads | ResNet+BiLSTM-GMP | 4,984,640 | 0.812 |
| He et al. | 1D ECG | 12 leads | ATI-CNN | No report | 0.799 |
| Chen et al. | 1D ECG | 12 leads | CNN+BRNN+Attention | 28,035 | 0.797 |
| RP-based | 2D RP images | 2 leads (lead II and aVR) | Inception-resnet-v2 | 46,964,673 | 0.844 |
Data profile for the CPSC, PTB_XL, and Georgia ECG dataset.
| Database | Sample frequency | Mean duration | Number of subjects | |||||
| Normal | AF | I-AVB | LBBB | RBBB | PAC | |||
| CPSC | 500 Hz | 16.2 s | 918 | 1,221 | 722 | 236 | 1,857 | 616 |
| PTB_XL | 500 Hz | 10.0 s | 18,092 | 1,514 | 797 | 536 | 0 | 398 |
| Georgia | 500 Hz | 10.0 s | 1,752 | 570 | 769 | 231 | 542 | 639 |
Classification performance of different ECG datasets.
| Database | Avg/total F1-score | Classification of subjects F1-score | |||||
| Normal | AF | I-AVB | LBBB | RBBB | PAC | ||
| CPSC | 0.8521 | 0.7905 | 0.9269 | 0.8921 | 0.8825 | 0.8942 | 0.7266 |
| PTB_XL | 0.8862 | 0.9417 | 0.9167 | 0.8644 | 0.9246 | 0 | 0.7837 |
| Georgia | 0.8529 | 0.9237 | 0.8197 | 0.8706 | 0.8767 | 0.8629 | 0.7639 |