| Literature DB >> 35936913 |
Hua Zhang1, Chengyu Liu2, Fangfang Tang1, Mingyan Li1, Dongxia Zhang3, Ling Xia4, Nan Zhao1, Sheng Li5, Stuart Crozier1, Wenlong Xu6, Feng Liu1.
Abstract
Artificial intelligence (AI) aided cardiac arrhythmia (CA) classification has been an emerging research topic. Existing AI-based classification methods commonly analyze electrocardiogram (ECG) signals in lower dimensions, using one-dimensional (1D) temporal signals or two-dimensional (2D) images, which, however, may have limited capability in characterizing lead-wise spatiotemporal correlations, which are critical to the classification accuracy. In addition, existing methods mostly assume that the ECG data are linear temporal signals. This assumption may not accurately represent the nonlinear, nonstationary nature of the cardiac electrophysiological process. In this work, we have developed a three-dimensional (3D) recurrence plot (RP)-based deep learning algorithm to explore the nonlinear recurrent features of ECG and Vectorcardiography (VCG) signals, aiming to improve the arrhythmia classification performance. The 3D ECG/VCG images are generated from standard 12 lead ECG and 3 lead VCG signals for neural network training, validation, and testing. The superiority and effectiveness of the proposed method are validated by various experiments. Based on the PTB-XL dataset, the proposed method achieved an average F1 score of 0.9254 for the 3D ECG-based case and 0.9350 for the 3D VCG-based case. In contrast, recently published 1D and 2D ECG-based CA classification methods yielded lower average F1 scores of 0.843 and 0.9015, respectively. Thus, the improved performance and visual interpretability make the proposed 3D RP-based method appealing for practical CA classification.Entities:
Keywords: cardiac arrhythmia classification; deep learning; electrocardiogram; recurrence plot; vectorcardiography
Year: 2022 PMID: 35936913 PMCID: PMC9352947 DOI: 10.3389/fphys.2022.956320
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.755
FIGURE 1Frank’s three leads signal Vx, Vy, and Vz of four types of VCG waveforms (top) and corresponding 3D dynamic feature plots (bottom).
FIGURE 2The architecture of 3DRP Inception ResNet (Stem, Inception ResNet models A-C, Reduction models A and B, and Prediction layers).
Data profile for the ECG dataset.
| CA types | Number of data | Single-label data | Experiment segments | 80% | 20% | |
|---|---|---|---|---|---|---|
| Training | Validation | Test | ||||
| NSR | 18092 | 16801 | 1200 | 768 | 192 | 240 |
| AF | 1514 | 1396 | 1200 | 768 | 192 | 240 |
| LBBB | 536 | 370 | 1110 | 710 | 178 | 222 |
| I-AVB | 797 | 689 | 1378 | 883 | 221 | 274 |
FIGURE 3The flow chart of CA classification experiments.
FIGURE 4The 3DRP image reconstructed based on ECG.
FIGURE 5The 3DRP images of NSR/AF/LBBB/I-AVB based on ECG.
FIGURE 6The 3DRP image reconstructed based on VCG.
FIGURE 7The 3DRP images of NSR/AF/LBBB/I-AVB based on VCG.
Classification performance based on ECG and VCG 3DRP methods with No/Min-max/Z-score normalization datasets.
| Experiments | RP normalization | Avg F1-score | Classification of types of F1 score | |||
|---|---|---|---|---|---|---|
| NSR | AF | LBBB | I-AVB | |||
| ECG-based | No | 0.9228 | 0.8847 | 0.9565 | 0.9775 | 0.8723 |
| Min-max | 0.9247 | 0.8986 | 0.9407 | 0.9795 | 0.8799 | |
| Z-score | 0.9254 | 0.8954 | 0.9472 | 0.9843 | 0.8748 | |
| VCG-based | No | 0.9301 | 0.9049 | 0.9610 | 0.9736 | 0.8810 |
| Min-max | 0.9262 | 0.8946 | 0.9560 | 0.9692 | 0.8849 | |
| Z-score | 0.9350 | 0.9030 | 0.9668 | 0.9712 | 0.8991 | |
Classification Precision/Recall/F1-score of experiments.
| Experiments | CA types | Precision | Recall | F1 score |
|---|---|---|---|---|
| ECG-based | NSR | 0.8992 | 0.8917 | 0.8954 |
| AF | 0.9246 | 0.9708 | 0.9472 | |
| LBBB | 0.9778 | 0.9910 | 0.9843 | |
| I-AVB | 0.8966 | 0.8540 | 0.8748 | |
| Avg | 0.9246 | 0.9269 | 0.9254 | |
| VCG-based | NSR | 0.9145 | 0.8917 | 0.9030 |
| AF | 0.9628 | 0.9708 | 0.9668 | |
| LBBB | 0.9563 | 0.9865 | 0.9712 | |
| I-AVB | 0.9041 | 0.8942 | 0.8991 | |
| Avg | 0.9344 | 0.9358 | 0.9350 |
FIGURE 8The confusion matrix of CA classification based on 3DRP ECG-based, and VCG-based.
Training information of the ECG-based and the VCG-based 3DRP methods.
| Experiments | Trainable parameters | Training time | Five-fold validation | ||||
|---|---|---|---|---|---|---|---|
| fold 1 | fold 2 | fold 3 | fold 4 | fold 5 | |||
| ECG-based | 27,038,708 | 262 Min | 149 Min 56 Epochs | 34 Min 11 Epochs | 28 Min 11 Epochs | 26 Min 11Epochs | 25 Min 11Epochs |
| VCG-based | 27,038,708 | 93 Min | 33 Min 26 Epochs | 21 Min 19 Epochs | 13 Min 11 Epochs | 13 Min 11Epochs | 13 Min 11Epochs |
Comparison of different reference models for CA Classification.
| Models | Classification of F1 score | Avg F1 score | Avg precision | Avg recall | |||
|---|---|---|---|---|---|---|---|
| NSR | AF | LBBB | I-AVB | ||||
| RestNet50 ( | 0.8889 | 0.9339 | 0.9515 | 0.8791 | 0.9134 | 0.9116 | 0.9119 |
| Inception V3 ( | 0.8683 | 0.9434 | 0.9556 | 0.8683 | 0.9089 | 0.9068 | 0.9068 |
| Inception V4 ( | 0.8714 | 0.9263 | 0.9471 | 0.8355 | 0.8951 | 0.8920 | 0.8924 |
| Proposed method | 0.9030 | 0.9668 | 0.9712 | 0.8991 | 0.9350 | 0.9344 | 0.9358 |
Comparison of the computational cost of the proposed 3D method VS. reference models.
| Methods | Trainable parameters | Training time | Five-fold validation | ||||
|---|---|---|---|---|---|---|---|
| fold 1 | fold 2 | fold 3 | fold 4 | fold 5 | |||
| RestNet50 ( | 26,641,796 | 127 Min | 55 Min 42 Epochs | 27 Min 21 Epochs | 16 Min 12 Epochs | 14 Min 11 Epochs | 15 Min 11 Epochs |
| Inception V3 ( | 21,831,844 | 71 Min | 32 Min 32 Epochs | 10 Min 11 Epochs | 10 Min 11 Epochs | 9 Min 11 Epochs | 10 Min 11 Epochs |
| Inception V4 ( | 52,049,092 | 148 Min | 72 Min 41 Epochs | 17 Min 11 Epochs | 25 Min 16 Epochs | 17 Min 11 Epochs | 17 Min 11 Epochs |
|
| 27,038,708 | 93 Min | 33 Min 26 Epochs | 21 Min 19 Epochs | 13 Min 11 Epochs | 13 Min 11 Epochs | 13 Min 11 Epochs |
Comparison of performance of the proposed 3D method VS. 2D and 1D classification methods.
| Methods | Input signals | Avg F1 score | Classification of subjects’ F1 score | |||
|---|---|---|---|---|---|---|
| NSR | AF | LBBB | I-AVB | |||
| 1D ( | 1D raw ECG | 0.8483 | 0.9812 | 0.9627 | 0.8658 | 0.5833 |
| 2D ( | 2D images | 0.9015 | 0.8917 | 0.9365 | 0.9276 | 0.8503 |
| Proposed method | 3D images | 0.9350 | 0.9030 | 0.9668 | 0.9712 | 0.8991 |
Comparison of the computational costs of the proposed 3D method VS. 2D and 1D classification methods.
| Methods | Trainable parameters | Training time | Five-fold validation | ||||
|---|---|---|---|---|---|---|---|
| fold 1 | fold 2 | fold 3 | fold 4 | fold 5 | |||
| 1D ( | 10,466,148 | 107 Min | 36 Min 20 Epochs | 16 Min 9 Epochs | 16 Min 9 Epochs | 21 Min 12Epochs | 18 Min 10Epochs |
| 2D ( | 29,141,450 | 79 Min | 46 Min 56 Epochs | 9 Min 12 Epochs | 8 Min 11 Epochs | 8 Min 11 Epochs | 8 Min 11Epochs |
|
| 27,038,708 | 93 Min | 33 Min 26 Epochs | 21 Min 19 Epochs | 13 Min 11 Epochs | 13 Min 11Epochs | 13 Min 11Epochs |
Generalization ability of the proposed method for CA classification on extra datasets.
| Database | Mean duration | Number of subjects | |||
|---|---|---|---|---|---|
| NSR | AF | LBBB | I-AVB | ||
| CPSC | 16.2s | 918 | 1000 | 567 | 1422 |
| Georgia | 10.0s | 1000 | 1054 | 438 | 1284 |