| Literature DB >> 35448245 |
Zhipeng Cai1, Tiantian Wang1, Yumin Shen1, Yantao Xing1, Ruqiang Yan1,2, Jianqing Li2, Chengyu Liu1.
Abstract
Premature ventricular contraction (PVC) is one of the common ventricular arrhythmias, which may cause stroke or sudden cardiac death. Automatic long-term electrocardiogram (ECG) analysis algorithms could provide diagnosis suggestion and even early warning for physicians. However, they are mutually exclusive in terms of robustness, generalization and low complexity. In this study, a novel PVC recognition algorithm that combines deep learning-based heartbeat template clusterer and expert system-based heartbeat classifier is proposed. A long short-term memory-based auto-encoder (LSTM-AE) network was used to extract features from ECG heartbeats for K-means clustering. Thus, the templates were constructed and determined based on clustering results. Finally, the PVC heartbeats were recognized based on a combination of multiple rules, including template matching and rhythm characteristics. Three quantitative parameters, sensitivity (Se), positive predictive value (P+) and accuracy (ACC), were used to evaluate the performances of the proposed method on the MIT-BIH Arrhythmia database and the St. Petersburg Institute of Cardiological Technics database. Se on the two test databases was 87.51% and 87.92%, respectively; P+ was 92.47% and 93.18%, respectively; and ACC was 98.63% and 97.89%, respectively. The PVC scores on the third China Physiological Signal Challenge 2020 training set and hidden test set were 36,256 and 46,706, respectively, which could rank first in the open-source codes. The results showed that the combination strategy of expert system and deep learning can provide new insights for robust and generalized PVC identification from long-term single-lead ECG recordings.Entities:
Keywords: K-means clustering algorithm; electrocardiogram; premature ventricular contraction; rule-based decision algorithm
Mesh:
Substances:
Year: 2022 PMID: 35448245 PMCID: PMC9025768 DOI: 10.3390/bios12040185
Source DB: PubMed Journal: Biosensors (Basel) ISSN: 2079-6374
The Detailed Information of Three Database.
| Database | ECG Length | # PVC Beats | # Non_PVC Beats | # Total Beats | Sampling Frequency (Hz) | |
|---|---|---|---|---|---|---|
| Training | MIT-BIH 1 | 30 min | 6990 | 92,851 | 99,841 | 360 |
| Test | INCART-12 | 30 min | 20,008 | 155,652 | 175,660 | 275 |
| CPSC2020 Training | ~24 h | 42,075 | 853,636 | 895,711 | 400 |
1 Four records (102, 104, 107, and 217) containing paced beats in MIT-BIH database were excluded in this study. # means the number of each beats.
Figure 1Flowchart of proposed method.
Figure 2Structure of LSTM-AE in this study.
The example of classification accuracy in MIT-BIH-AR database under different hyperparameter setting (record 100).
| Batch | 64 | 128 | 256 | |
|---|---|---|---|---|
| Feature Numbers | ||||
| 16 | 99.62% | 99.65% | 98.61% | |
| 32 | 99.68% |
| 98.59% | |
| 64 | 99.33% | 99.60% | 99.65% | |
Figure 3The ranked feature vectors of PVC and Non_PVC from record 228, according to the t-test p-value in ascending order.
Figure 4The results of clustering from record 210. (a,b) are all heartbeats superposition of each cluster; (c,d) are the 10 heartbeats extracted from each cluster to build templates; (e,f) are templates of the cluster.
Figure 5Results of the proposed method on the MIT-BIH-AR database and INCART database, respectively. (a) The evaluation indices of the proposed method on MIT-BIH-AR database; (b) the evaluation indices of the proposed method on INCART database.
PVC recognition results on the MIT-BIH-AR database.
| Record | Se (%) | P+ (%) | ACC (%) | Record | Se (%) | P+ (%) | ACC (%) |
|---|---|---|---|---|---|---|---|
| 100 | 100.00 | 100.00 | 100.00 | 202 | 94.74 | 81.82 | 99.77 |
| 101 | - | - | 100.00 1 | 203 | 73.76 | 91.06 | 95.00 |
| 103 | - | - | 100.00 1 | 205 | 92.96 | 100.00 | 99.81 |
| 105 | 90.24 | 68.52 | 99.18 | 207 | 65.07 | 61.54 | 91.50 |
| 106 | 79.81 | 100.00 | 94.82 | 208 | 92.42 | 100.00 | 97.08 |
| 108 | 88.24 | 65.22 | 99.43 | 209 | 100.00 | 100.00 | 100.00 |
| 109 | 76.32 | 100.00 | 99.64 | 210 | 75.77 | 96.71 | 98.03 |
| 111 | 100.00 | 4.35 | 98.96 | 212 | - | - | 100.00 1 |
| 112 | - | - | 100.00 1 | 213 | 98.18 | 99.08 | 99.79 |
| 113 | - | - | 100.00 1 | 214 | 60.78 | 100.00 | 95.57 |
| 114 | 95.35 | 100.00 | 99.89 | 215 | 91.46 | 100.00 | 99.58 |
| 115 | - | - | 100.00 1 | 219 | 79.69 | 100.00 | 99.40 |
| 116 | 91.67 | 100.00 | 99.62 | 220 | - | - | 100.00 1 |
| 117 | - | - | 100.00a | 221 | 97.22 | 100.00 | 99.55 |
| 118 | 93.75 | 40.54 | 98.99 | 222 | - | 0.00 | 88.99 2 |
| 119 | 99.55 | 100.00 | 99.90 | 223 | 63.21 | 100.00 | 93.28 |
| 121 | 100.00 | 100.00 | 100.00 | 228 | 98.62 | 100.00 | 99.76 |
| 122 | - | - | 100.00a | 230 | 100.00 | 100.00 | 100.00 |
| 123 | 100.00 | 100.00 | 100.00 | 231 | 100.00 | 100.00 | 100.00 |
| 124 | 78.72 | 100.00 | 99.38 | 232 | 0.00 | - | 99.89 2 |
| 200 | 94.97 | 99.74 | 98.34 | 233 | 94.10 | 99.74 | 98.34 |
| 201 | 99.49 | 89.95 | 98.83 | 234 | 100.00 | 100.00 | 100.00 |
1 This single record excludes PVC beats, and there is no false detection of PVC beats. Therefore, the TP, FN, and FP of this record are all 0. 2 This single record excludes PVC beats but false detects Non_PVC beats as PVC beats. Therefore, TP and FN of this record are 0, but TP is not 0.
PVC recognition results on the INCART database.
| ID | Se (%) | P+ (%) | ACC (%) | ID | Se (%) | P+ (%) | ACC (%) | ID | Se (%) | P+ (%) | ACC (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| I01 | 100.00 | 86.00 | 97.97 | I26 | 25.00 | 50.00 | 99.73 | I51 | 97.63 | 100.00 | 99.32 |
| I02 | 87.34 | 94.34 | 98.47 | I27 | 100.00 | 100.00 | 100.00 | I52 | 100.00 | 100.00 | 100.00 |
| I03 | 92.00 | 100.00 | 99.59 | I28 | 75.00 | 33.33 | 99.59 | I53 | 96.94 | 100.00 | 98.50 |
| I04 | 22.31 | 93.10 | 96.01 | I29 | 68.33 | 99.63 | 90.45 | I54 | 68.18 | 93.75 | 99.66 |
| I05 | 83.40 | 99.52 | 97.62 | I30 | 80.13 | 99.83 | 93.86 | I55 | 94.12 | 100.00 | 99.95 |
| I06 | 100.00 | 81.82 | 99.92 | I31 | 70.99 | 99.28 | 87.44 | I56 | 100.00 | 100.00 | 100.00 |
| I07 | 100.00 | 5.88 | 99.41 | I32 | 84.21 | 97.96 | 99.38 | I57 | 100.00 | 48.84 | 99.23 |
| I08 | 86.61 | 99.02 | 97.65 | I33 | 100.00 | 16.67 | 99.73 | I58 | 100.00 | 100.00 | 100.00 |
| I09 | 73.17 | 83.33 | 99.43 | I34 | - | 0.00 | 99.03 | I59 | 64.20 | 96.30 | 98.56 |
| I10 | 83.13 | 100.00 | 99.62 | I35 | 77.46 | 100.00 | 97.18 | I60 | - | 0.00 | 98.87 2 |
| I11 | 100.00 | 50.00 | 99.81 | I36 | 86.89 | 100.00 | 98.49 | I61 | - | - | 100.00 1 |
| I12 | 33.33 | 14.29 | 99.43 | I37 | 99.56 | 100.00 | 99.92 | I62 | 32.45 | 100.00 | 76.21 |
| I13 | 100.00 | 100.00 | 100.00 | I38 | 86.61 | 100.00 | 97.29 | I63 | 58.70 | 100.00 | 97.13 |
| I14 | 100.00 | 100.00 | 100.00 | I39 | 94.25 | 100.00 | 98.99 | I64 | 69.57 | 100.00 | 99.63 |
| I15 | 33.33 | 50.00 | 99.89 | I40 | 92.39 | 92.39 | 99.47 | I65 | 93.46 | 100.00 | 99.06 |
| I16 | 100.00 | 50.00 | 99.87 | I41 | 100.00 | 33.33 | 99.88 | I66 | 97.50 | 100.00 | 99.79 |
| I17 | 92.59 | 100.00 | 99.88 | I42 | 99.29 | 99.87 | 99.58 | I67 | 97.93 | 100.00 | 99.63 |
| I18 | 91.80 | 99.70 | 98.98 | I43 | 97.86 | 99.91 | 98.87 | I68 | 95.65 | 99.35 | 99.70 |
| I19 | 84.59 | 100.00 | 93.65 | I44 | 100.00 | 100.00 | 100.00 | I69 | 99.40 | 98.81 | 99.86 |
| I20 | 75.45 | 100.00 | 98.98 | I45 | 100.00 | 100.00 | 100.00 | I70 | - | 0.00 | 92.50 2 |
| I21 | 87.50 | 77.78 | 99.86 | I46 | 98.34 | 99.76 | 99.70 | I71 | - | 0.00 | 86.22 2 |
| I22 | 69.73 | 99.23 | 98.18 | I47 | 98.92 | 96.84 | 99.80 | I72 | 91.19 | 33.85 | 68.17 |
| I23 | 61.54 | 100.00 | 99.77 | I48 | 98.72 | 100.00 | 99.87 | I73 | 94.29 | 100.00 | 99.80 |
| I24 | 16.67 | 50.00 | 99.77 | I49 | 100.00 | 96.43 | 99.95 | I74 | 98.18 | 100.00 | 99.79 |
| I25 | 60.00 | 37.50 | 99.59 | I50 | 50.00 | 50.00 | 99.87 | I75 | 99.02 | 100.00 | 99.71 |
1 This single record excludes PVC beats, and there is no false detection of PVC beats. Therefore, the TP, FN, and FP of this record are all 0. 2 This single record excludes PVC beats but false detects Non_PVC beats as PVC beats. Therefore, TP and FN of this record are 0, but TP is not 0.
Figure A1Evaluation indices of the proposed method in 12-lead INCART database.
Detailed information on three databases.
| Code No. | CPSC1077 1 | CPSC1091 | CPSC1093 | CPSC1082 | CPSC1089 | This Work |
|---|---|---|---|---|---|---|
| Method | DenseNet + Rules | DL-based 2 +Rules | Bidirectional LSTM | WT + DL-based 3 | CNN | LSTM-AE + K-Means + Rules |
| PVC Score of Test | 41,479 | 55,706 | 95,900 | 97,913 | 142,228 | 46,706 |
| PVC Score of Training | - | 16,467 | 6370 | 4482 | 11,086 | 36,256 |
| Running Time (s) | 1600.35 ± 311.32 | 695.55 ± 185.45 | 12,810.90 ± 726.48 | 18,260.57 ± 2100.84 | 368.29 ± 33.27 | 215.93 ± 59.32 |
1 This team did not publish their code, so we could not obtain the evaluation score of their algorithm on the training set. The other codes are available in http://2020.icbeb.org/CSPC2020 (accessed on 17 March 2022). 2 This DL-based method refers to a deep learning architecture containing multi-dilated convolutional blocks and a squeeze-and-excitation network. 3 This DL-based method refers to the combination of one-dimensional convolutional layers and gated recurrent unit layers.
Comparison of PVC recognition between the proposed method and existing methods on MIT-BIH-AR database and INCART database.
| Author | Class and Focus | Method | Database | # Total Beats | # PVC Beats | Se (%) | P+ (%) | ACC (%) |
|---|---|---|---|---|---|---|---|---|
| Talbi et al., 2016 [ | PVC, Non_PVC | KNN + FLP | MIT-BIH-AR | 95,743 | 7147 | 80.88 | - | 94.63 |
| Wang et al., 2017 [ | PVC, Non_PVC | Statistics +SVM | 110,906 | - | 75.00 | - | 93.13 | |
| Jung et al., 2017 [ | PVC, Non_PVC | Wavelet-based SPC | - | - | 87.20 | 84.60 | 97.90 | |
| Mazidi et al., 2019 [ | PVC, Non_PVC | SVM | 82,163 | 7111 | 99.91 | - | 99.78 | |
| Li et al., 2019 [ | PVC, Non_PVC | Wavelet Transform | 100,372 | 6990 | 82.55 | 82.39 | 97.56 | |
| Cai et al., 2020 [ | Normal, PAC, PVC | +CNN | 98,426 | 6734 | 76.54 | 90.47 | 85.56 | |
| Kalidas et al., 2020 [ | PVC, Non_PVC | Rules | 93,432 | 6898 | 96.58 | 97.20 | - | |
| Wang et al., 2021 [ | PVC, Non_PVC | SSAE + Random Forests | 24,922 | 2187 | 95.47 | 98.75 | 98.25 | |
| This study. 2021 | PVC, Non_PVC | OTSU + CNN | 99,841 | 6990 | 87.51 | 92.47 | 98.63 | |
| Li et al., 2013 [ | PVC, Non_PVC | LSTM-AE + K-Means+ | INCART | 175,892 | 20,011 | 93.40 | 66.50 | 94.00 |
| Oster et al., 2015 [ | PVC, Non_PVC | Rules | 175,871 | 20,011 | 95.40 | 99.30 | - | |
| Rahhal et al., 2018 [ | Normal, PVC and Others | Template-matching | - | - | 85.20 | 80.90 | 92.00 | |
| Kalidas et al., 2020 [ | PVC, Non_PVC | SKF with X-factor Mode | 175,674 | 19,990 | 88.08 | 94.70 | - | |
| This study. 2021 | PVC, Non_PVC | SDAEs + DNN | 175,660 | 20,008 | 87.92 | 93.18 | 97.89 |
# means the number of each beat.