| Literature DB >> 28729684 |
Qin Qin1, Jianqing Li2, Li Zhang3, Yinggao Yue1, Chengyu Liu1.
Abstract
Automatic feature extraction and classification are two main tasks in abnormal ECG beat recognition. Feature extraction is an important prerequisite prior to classification since it provides the classifier with input features, and the performance of classifier depends significantly on the quality of these features. This study develops an effective method to extract low-dimensional ECG beat feature vectors. It employs wavelet multi-resolution analysis to extract time-frequency domain features and then applies principle component analysis to reduce the dimension of the feature vector. In classification, 12-element feature vectors characterizing six types of beats are used as inputs for one-versus-one support vector machine, which is conducted in form of 10-fold cross validation with beat-based and record-based training schemes. Tested upon a total of 107049 beats from MIT-BIH arrhythmia database, our method has achieved average sensitivity, specificity and accuracy of 99.09%, 99.82% and 99.70%, respectively, using the beat-based training scheme, and 44.40%, 88.88% and 81.47%, respectively, using the record-based training scheme.Entities:
Mesh:
Year: 2017 PMID: 28729684 PMCID: PMC5519637 DOI: 10.1038/s41598-017-06596-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Data profile for the beat-based 10-fold cross validation scheme.
| Folder | Number of beats in training set | Total training beats | Number of beats in test set | Total test beats | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | L | N | P | R | V | A | L | N | P | R | V | |||
| 1 | 2292 | 7265 | 67520 | 6323 | 6530 | 6417 | 96347 | 254 | 807 | 7502 | 702 | 725 | 712 | 10702 |
| 2 | 2292 | 7265 | 67520 | 6323 | 6530 | 6416 | 96346 | 254 | 807 | 7502 | 702 | 725 | 713 | 10703 |
| 3 | 2292 | 7265 | 67520 | 6323 | 6530 | 6416 | 96346 | 254 | 807 | 7502 | 702 | 725 | 713 | 10703 |
| 4 | 2292 | 7265 | 67520 | 6323 | 6530 | 6416 | 96346 | 254 | 807 | 7502 | 702 | 725 | 713 | 10703 |
| 5 | 2291 | 7265 | 67520 | 6323 | 6530 | 6416 | 96345 | 255 | 807 | 7502 | 702 | 725 | 713 | 10704 |
| 6 | 2291 | 7265 | 67520 | 6322 | 6529 | 6416 | 96343 | 255 | 807 | 7502 | 703 | 726 | 713 | 10706 |
| 7 | 2291 | 7265 | 67520 | 6322 | 6529 | 6416 | 96343 | 255 | 807 | 7502 | 703 | 726 | 713 | 10706 |
| 8 | 2291 | 7265 | 67520 | 6322 | 6529 | 6416 | 96343 | 255 | 807 | 7502 | 703 | 726 | 713 | 10706 |
| 9 | 2291 | 7264 | 67519 | 6322 | 6529 | 6416 | 96341 | 255 | 808 | 7503 | 703 | 726 | 713 | 10708 |
| 10 | 2291 | 7264 | 67519 | 6322 | 6529 | 6416 | 96341 | 255 | 808 | 7503 | 703 | 726 | 713 | 10708 |
Data profile for the record-based 10-fold cross validation scheme.
| Folder | Number of beats in training set | Total training beats | Number of beats in test set | Total test beats | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | L | N | P | R | V | A | L | N | P | R | V | |||
| 1 | 2414 | 5581 | 70682 | 5483 | 5090 | 6912 | 96162 | 132 | 2491 | 4340 | 1542 | 2165 | 217 | 10887 |
| 2 | 2544 | 5949 | 70989 | 4947 | 5725 | 6461 | 96615 | 2 | 2123 | 4033 | 2078 | 1530 | 668 | 10434 |
| 3 | 2427 | 6615 | 68715 | 5647 | 7170 | 6979 | 97553 | 119 | 1457 | 6307 | 1378 | 85 | 150 | 9496 |
| 4 | 2544 | 6071 | 70167 | 4998 | 5430 | 6760 | 95970 | 2 | 2001 | 4855 | 2027 | 1825 | 369 | 11079 |
| 5 | 2545 | 5581 | 70719 | 4947 | 6002 | 7027 | 96821 | 1 | 2491 | 4303 | 2078 | 1253 | 102 | 10228 |
| 6 | 1128 | 5949 | 70189 | 5483 | 6858 | 6503 | 96110 | 1418 | 2123 | 4833 | 1542 | 397 | 626 | 10939 |
| 7 | 1960 | 6615 | 69882 | 4998 | 5005 | 6809 | 95269 | 586 | 1457 | 5140 | 2027 | 2250 | 320 | 11780 |
| 8 | 2534 | 6071 | 69583 | 5647 | 5725 | 6596 | 96156 | 12 | 2001 | 5439 | 1378 | 1530 | 533 | 10893 |
| 9 | 2231 | 4124 | 72718 | 5483 | 7170 | 6824 | 98550 | 315 | 3948 | 2304 | 1542 | 85 | 305 | 8499 |
| 10 | 2539 | 5949 | 71772 | 4998 | 5430 | 6294 | 96982 | 7 | 2123 | 3250 | 2027 | 1825 | 835 | 10067 |
Record division of the training and test sets for the record-based cross validation scheme.
| Folder | Training set | Test set | Folder | Training set | Test set |
|---|---|---|---|---|---|
| 1 | All other records | 100, 101, 109, 118, 217 | 6 | All other records | 111, 202, 203, 217, 232 |
| 2 | All other records | 105, 106, 107, 111, 124 | 7 | All other records | 102, 118, 207, 209, 210 |
| 3 | All other records | 104, 112, 113, 114, 207 | 8 | All other records | 104, 124, 214, 215, 219 |
| 4 | All other records | 102, 116, 117, 212, 214 | 9 | All other records | 109, 207, 217, 222 |
| 5 | All other records | 107, 109, 122, 123, 231 | 10 | All other records | 102, 111, 212, 233 |
Figure 1Block diagram of the proposed feature extraction algorithm.
Figure 2Beat segmentation of a typical ECG signal.
Figure 3The decomposition process of the 8-level WMRA.
Figure 4Low-dimensional feature vector generated by PCA using wavelet coefficients.
Instructions for the definitions of TP, TN, FP, and FN for N beat.
| A | L | N | P | R | V | |
|---|---|---|---|---|---|---|
| A |
|
|
|
|
|
|
| L |
|
|
|
|
|
|
| N |
|
|
|
|
|
|
| P |
|
|
|
|
|
|
| R |
|
|
|
|
|
|
| V |
|
|
|
|
|
|
Figure 5Classification rates with different numbers of principle components.
Figure 6Classification performance for the beat-based (a) and record-based (b) schemes using 10-fold cross validation.
Classification Results for the 10-fold cross validation.
| Beat type | Beat-based training scheme | Record-based training scheme | |||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
| Mean | A | 83.35 | 99.91 | 99.52 | 0.76 | 93.98 | 91.63 |
| L | 99.32 | 99.97 | 99.92 | 0.00 | 99.44 | 77.68 | |
| N | 99.67 | 98.14 | 99.21 | 90.79 | 53.19 | 70.18 | |
| P | 99.87 | 100.0 | 99.99 | 11.00 | 99.95 | 85.02 | |
| R | 99.27 | 99.96 | 99.92 | 4.17 | 99.96 | 88.46 | |
| V | 97.45 | 99.79 | 99.63 | 74.32 | 75.83 | 75.82 | |
|
|
|
|
|
|
|
| |
| Standard deviation | A | 1.76 | 0.03 | 0.05 | 1.26 | 5.78 | 5.42 |
| L | 0.21 | 0.02 | 0.02 | 0.00 | 1.07 | 8.77 | |
| N | 0.07 | 0.21 | 0.07 | 10.82 | 18.45 | 10.11 | |
| P | 0.10 | 0.01 | 0.01 | 17.78 | 0.08 | 3.43 | |
| R | 0.31 | 0.02 | 0.03 | 12.42 | 0.03 | 7.18 | |
| V | 0.67 | 0.04 | 0.07 | 10.59 | 15.12 | 14.58 | |
|
|
|
|
|
|
|
| |
Comparison between the related works and the method proposed in this study.
| Literatures and feature extraction methods | Feature selection (dimension) | Beat types | Training/test beats | Classifiers | Independent training/test data | k-fold cross validation |
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| Spectral correlation[ | Yes (88) | 5 | Totally 6259 | SVM | Unknown | 10-fold | 99.20 | 99.70 | 98.60 |
| Wavelet transform, morphological features[ | No (28) | 5 | 10675/93894 | Artificial neural network | No | No | 88.60 | 96.18 | 97.86 |
| Morphological features[ | Yes (6) | 6 | 35848/35848 | Linear discriminant analysis | No | No | 91.19 | 98.65 | 94.03 |
| Morphological features[ | No (13) | 3 | 600/30273 | SVM, neural network | No | No | 98.52 | 99.19 | 97.14 |
| Time domain features[ | No (9) | 6 | 42427/14142 | Decision tree | No | No | 97.50 | 99.80 | 99.51 |
| Morphological features[ | No (16) | 3 | 15509/8081 | SVM, neural network | Yes | No | 92.82 | 93.74 | 92.85 |
| Morphological features[ | No (8) | 5 | 12570/12570 | Regression neural network | No | No | 85.50 | 99.40 | 99.40 |
| Fourier transform, wavelet package[ | Yes (70) | 16 | 3345/2542 |
| No | No | 85.59 | 99.56 | 93.59 |
| Wavelet transform, cosine transform[ | Yes (18) | 4 | 720/360 | SVM | Unknown | No | 98.60 | 95.50 | 96.50 |
| Wavelet transform[ | Yes (24) | 5 | 900/900 | SVM, genetic algorithm | No | No | 98.50 | 99.69 | 98.80 |
| Higher order spectral[ | No (7) | 5 | 330/500 | SVM | Unknown | No | 90.00 | 87.93 | 85.79 |
| Wavelet transform[ | Yes (20) | 4 | 360/360 | SVM | Unknown | No | 98.62 | 99.54 | 98.61 |
| Temporal and spectral features[ | Yes (15) | 6 | 1440/720 | SVM | No | No | 97.60 | 93.80 | 95.20 |
| Temporal and spectral features[ | Yes (13) | 8 | Totally 17857 | SVM | No | 5-fold | 95.00 | 99.00 | 98.60 |
| Higher order statistics, wavelet packet[ | Yes (28) | 5 | 3345/2542 |
| Yes | No | 89.80 | 97.80 | — |
| Hilbert-Huang transform[ | Yes (18) | 6 | 10700/10700 | SVM | No | No | 98.64 | 99.77 | 99.51 |
| Wavelet transform[ | Yes (18) | 5 | Totally 101352 | SVM | Yes | 44-fold | — | — | 86.40 |
| 16 | 24100/86009 | No | No | 99.32 | — | 99.01 | |||
| Approximate entropy, wavelet packet[ | Yes (9) | 5 | 145/145 | SVM, PNN | Unknown | No | 98.70 | 99.70 | 98.60 |
| Non-linear and center-clipping transform[ | No (5) | 5 | 13640/13640 | Wavelet neural network | No | No | 98.78 | 99.70 | 98.78 |
| Eigenvector method[ | Yes (12) | 4 | 360/360 | Recurrent neural network | Unknown | No | 98.89 | 99.25 | 98.06 |
| Higher order statistics[ | No (24) | 5 | 4000/14299 | RBF neural network | No | No | 92.93 | 98.52 | 95.18 |
| Geometrical features[ | No (18) | 7 | 4035/3150 | SVM, | No | No | 97.52 | 99.65 | 98.06 |
| Wavelet transform, morphological features[ | Yes (8) | 3 | 50928/49636 | Linear discriminant analysis | Yes | No | 80.00 | — | 94.00 |
| Wavelet transform, linear prediction model[ | No (12) | 3 | 50554/49273 | Linear discriminant analysis | Unknown | No | 86.50 | — | 86.50 |
| Cross correlation[ | No (30) | 3 | 41961/51285 | Artificial neural network | Unknown | No | 97.49 | — | 95.24 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 7Comparison between our method and traditional wavelet transform-based methods.