| Literature DB >> 32019220 |
Robert Czabanski1, Krzysztof Horoba2, Janusz Wrobel2, Adam Matonia2, Radek Martinek3, Tomasz Kupka2, Michal Jezewski1, Radana Kahankova3, Janusz Jezewski2, Jacek M Leski1.
Abstract
Atrial fibrillation (AF) is a serious heart arrhythmia leading to a significant increase of the risk for occurrence of ischemic stroke. Clinically, the AF episode is recognized in an electrocardiogram. However, detection of asymptomatic AF, which requires a long-term monitoring, is more efficient when based on irregularity of beat-to-beat intervals estimated by the heart rate (HR) features. Automated classification of heartbeats into AF and non-AF by means of the Lagrangian Support Vector Machine has been proposed. The classifier input vector consisted of sixteen features, including four coefficients very sensitive to beat-to-beat heart changes, taken from the fetal heart rate analysis in perinatal medicine. Effectiveness of the proposed classifier has been verified on the MIT-BIH Atrial Fibrillation Database. Designing of the LSVM classifier using very large number of feature vectors requires extreme computational efforts. Therefore, an original approach has been proposed to determine a training set of the smallest possible size that still would guarantee a high quality of AF detection. It enables to obtain satisfactory results using only 1.39% of all heartbeats as the training data. Post-processing stage based on aggregation of classified heartbeats into AF episodes has been applied to provide more reliable information on patient risk. Results obtained during the testing phase showed the sensitivity of 98.94%, positive predictive value of 98.39%, and classification accuracy of 98.86%.Entities:
Keywords: AF detection; HRV features; atrial fibrillation (AF); heart rate variability (HRV); support vector machine (SVM)
Mesh:
Year: 2020 PMID: 32019220 PMCID: PMC7038413 DOI: 10.3390/s20030765
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1ECG signals (04043 and 04045) taken from the MIT-BIH AF database, with recognized segments of atrial fibrillation and the non-AF ones.
Figure 2Two HR signals expressed in beats per minute (bpm) with clinically recognized AF segments of different characteristics of HR changes in relation to normal sinus rhythm (non-AF). The AF segments are marked using the experts’ annotations provided for particular records in the MIT-BIH AF database.
Figure 3Distribution of successive RR intervals in the polar coordinate system, illustrating the definitions of the de Haan’s index describing the short-term HR variability.
The size of the analyzed classes described by the number of AF (NAF), non-AF (NnAF) episodes, as well as by the percentage of AF episodes (NAF) to the total number of the heartbeats (NSIG) in a given signal.
|
| |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| 332 | 11057 | 525 | 14634 | 813 | 3293 | 30873 | 5810 | 39681 | 138 | 33759 | 934 | 53115 |
|
| 39880 | 41687 | 43459 | 47260 | 39100 | 39546 | 16979 | 55929 | 13944 | 36634 | 16101 | 44579 | 2019 |
| 0.83% | 20.96% | 1.19% | 23.64% | 2.04% | 7.69% | 64.52% | 9.41% | 74.00% | 0.38% | 67.71% | 2.05% | 96.34% | |
|
| |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| 445 | 27505 | 39277 | 60245 | 40035 | 6758 | 33118 | 14194 | 11478 | 45083 | 2310 | 44252 | 519664 |
|
| 34371 | 27663 | 0 | 0 | 16538 | 29820 | 10217 | 45078 | 34016 | 13752 | 37519 | 15279 | 701370 |
| 1.28% | 49.86% | 100.0% | 100.0% | 70.77% | 18.48% | 76.42% | 23.95% | 25.23% | 76.63% | 5.80% | 74.33% | 42.56% | |
The results of MIT-BIH AF database evaluation using the LSVM classifier trained with the balanced data separately extracted from each of the signal. The second column provides the percentage of the size of the training data (NTRN) to the number of all heartbeats (NALL) in the MIT-BIH AF database. Columns 3–8 shows the results for testing data only (not used during the learning phase), the last column presents the F-Score values (FSALL) calculated using all heartbeats from the MIT-BIH AF database. The best results are in boldface.
| Signal | Performance Measure [%] | |||||||
|---|---|---|---|---|---|---|---|---|
| CA | Se | Sp | PPV | NPV | FS | FSALL | ||
| 00735 | 0.03 | 93.60 ± 0.53 * | 97.54 ± 0.52 | 90.68 ± 1.10 | 88.60 ± 1.15 | 98.03 ± 0.39 | 92.85 ± 0.54 | 92.85 ± 0.54 |
| 03665 | 0.91 | 81.42 ± 0.42 | 60.55 ± 1.04 | 96.84 ± 0.08 | 93.41 ± 0.13 | 76.87 ± 0.46 | 73.47 ± 0.77 | 73.81 ± 0.76 |
| 04015 | 0.04 | 76.17 ± 1.63 | 48.14 ± 4.20 | 96.93 ± 0.46 | 92.09 ± 0.82 | 71.66 ± 1.62 | 63.12 ± 3.55 | 63.15 ± 3.55 |
| 04043 | 1.20 | 91.47 ± 1.93 | 95.61 ± 1.84 | 88.41 ± 2.95 | 85.99 ± 2.99 | 96.48 ± 1.44 | 90.52 ± 2.01 | 90.63 ± 1.98 |
| 04048 | 0.07 | 89.56 ± 2.69 | 87.37 ± 7.56 | 91.18 ± 1.59 | 88.06 ± 1.46 | 91.01 ± 4.75 | 87.53 ± 3.88 | 87.54 ± 3.88 |
|
| 0.27 | 77.81 ± 1.22 | 51.32 ± 3.02 |
|
| 73.02 ± 1.20 | 66.25 ± 2.56 | 66.39 ± 2.54 |
| 04746 | 1.39 | 91.55 ± 0.15 | 98.80 ± 0.08 | 86.20 ± 0.27 | 84.08 ± 0.26 | 98.99 ± 0.07 | 90.85 ± 0.14 | 90.99 ± 0.14 |
| 04908 | 0.48 | 92.70 ± 0.72 | 96.64 ± 0.39 | 89.79 ± 1.32 | 87.53 ± 1.42 | 97.31 ± 0.29 | 91.85 ± 0.74 | 91.90 ± 0.74 |
| 04936 | 1.14 | 87.53 ± 0.51 | 91.68 ± 0.36 | 84.47 ± 0.94 | 81.34 ± 0.91 | 93.22 ± 0.26 | 86.20 ± 0.48 | 86.36 ± 0.48 |
|
| 0.01 | 90.29 ± 0.53 |
| 83.75 ± 1.07 | 81.89 ± 0.94 |
| 89.68 ± 0.49 | 89.68 ± 0.49 |
| 05121 | 1.32 | 87.38 ± 0.66 | 96.21 ± 0.41 | 80.87 ± 1.16 | 78.79 ± 1.02 | 96.66 ± 0.35 | 86.63 ± 0.62 | 86.78 ± 0.61 |
| 05261 | 0.08 | 83.01 ± 2.89 | 64.54 ± 7.01 | 96.70 ± 0.23 | 93.50 ± 0.49 | 78.78 ± 3.28 | 76.17 ± 5.10 | 76.19 ± 5.09 |
| 06426 | 0.17 | 91.84 ± 0.60 | 89.91 ± 0.89 | 93.26 ± 1.05 | 90.83 ± 1.27 | 92.59 ± 0.58 | 90.36 ± 0.66 | 90.38 ± 0.66 |
| 06453 | 0.04 | 92.30 ± 0.83 | 95.70 ± 2.44 | 89.78 ± 1.57 | 87.45 ± 1.55 | 96.63 ± 1.74 | 91.36 ± 1.01 | 91.36 ± 1.01 |
| 06995 | 2.25 | 91.07 ± 0.61 | 86.59 ± 1.44 | 94.37 ± 0.30 | 91.89 ± 0.41 | 90.54 ± 0.92 | 89.15 ± 0.82 | 89.40 ± 0.80 |
| 07879 | 1.35 | 91.16 ± 0.27 | 93.12 ± 0.51 | 89.72 ± 0.29 | 86.98 ± 0.32 | 94.65 ± 0.38 | 89.95 ± 0.31 | 90.10 ± 0.31 |
| 07910 | 0.55 | 88.31 ± 0.87 | 89.04 ± 1.23 | 87.77 ± 1.10 | 84.35 ± 1.23 | 91.55 ± 0.89 | 86.63 ± 0.98 | 86.71 ± 0.97 |
| 08215 | 0.84 | 89.19 ± 0.46 | 96.44 ± 1.00 | 83.84 ± 0.84 | 81.53 ± 0.72 | 96.96 ± 0.81 | 88.35 ± 0.50 | 88.46 ± 0.49 |
| 08219 | 1.16 | 85.99 ± 1.11 | 72.70 ± 2.54 | 95.80 ± 0.55 | 92.75 ± 0.90 | 82.64 ± 1.35 | 81.49 ± 1.70 | 81.76 ± 1.67 |
| 08378 | 0.94 | 91.24 ± 0.27 | 97.58 ± 0.29 | 86.56 ± 0.47 | 84.29 ± 0.45 | 97.98 ± 0.23 | 90.45 ± 0.28 | 90.55 ± 0.27 |
|
| 1.13 |
| 96.11 ± 0.47 | 92.82 ± 0.28 | 90.81 ± 0.30 | 97.00 ± 0.35 |
|
|
| 08434 | 0.19 | 92.81 ± 0.39 | 97.37 ± 0.51 | 89.44 ± 0.64 | 87.23 ± 0.66 | 97.87 ± 0.40 | 92.02 ± 0.42 | 92.04 ± 0.42 |
| 08455 | 1.25 | 92.99 ± 0.21 | 97.73 ± 0.19 | 89.49 ± 0.39 | 87.28 ± 0.40 | 98.16 ± 0.15 | 92.21 ± 0.22 | 92.32 ± 0.21 |
* mean ± standard deviation.
The results of MIT-BIH AF database evaluation using the LSVM classifier trained with the balanced dataset (NTRN = 17 000), extracted from: the signal of highest FS (TD1), mixed signals of the highest Se and FS (TD2), mixed signals that were characterized by the highest Se, Sp and FS (TD3), and all the signals (TD4). The best results obtained for testing data only and for all data are in boldface.
| Training | Performance Measure [%] | |||||
|---|---|---|---|---|---|---|
| Se | Sp | PPV | NPV | FS | CA | |
| Testing data only | ||||||
| TD1 | 96.11 ± 0.47 * | 92.82 ± 0.28 | 90.81 ± 0.30 | 97.00 ± 0.35 | 93.39 ± 0.22 | 94.22 ± 0.19 |
| TD2 | 95.63 ± 0.76 | 92.43 ± 0.29 | 90.31 ± 0.31 | 96.64 ± 0.56 | 92.89 ± 0.36 | 93.79 ± 0.30 |
| TD3 | 82.53 ± 0.94 | 95.19 ± 0.17 | 92.68 ± 0.22 | 88.08 ± 0.56 | 87.31 ± 0.53 | 89.82 ± 0.37 |
|
|
|
|
|
|
|
|
| All the database | ||||||
| TD1 | 96.17 ± 0.46 | 92.89 ± 0.27 | 90.93 ± 0.30 | 97.03 ± 0.34 | 93.47 ± 0.22 | 94.28 ± 0.19 |
| TD2 | 95.70 ± 0.74 | 92.52 ± 0.28 | 90.46 ± 0.31 | 96.68 ± 0.55 | 93.00 ± 0.36 | 93.87 ± 0.29 |
| TD3 | 82.81 ± 0.92 | 95.25 ± 0.16 | 92.82 ± 0.21 | 88.21 ± 0.55 | 87.53 ± 0.52 | 89.96 ± 0.37 |
|
|
|
|
|
|
|
|
* mean ± standard deviation.
The results of MIT-BIH AF database evaluation when FHR variability features were excluded from recognition of the AF episodes. The LSVM classifier was trained with the balanced dataset (NTRN = 17 000), extracted from: the signal of highest FS (TD1), mixed signals of the highest Se and FS (TD2), mixed signals that were characterized by the highest Se, Sp and FS (TD3), and all the signals (TD4). The best results obtained for testing data only and for all data are in boldface.
| Training | Performance Measure [%] | |||||
|---|---|---|---|---|---|---|
| Se | Sp | PPV | NPV | FS | CA | |
| Testing data only | ||||||
| TD1 | 91.04 ± 1.75 * | 93.61 ± 0.34 | 91.34 ± 0.41 | 93.42 ± 1.20 | 91.18 ± 0.91 | 92.52 ± 0.71 |
| TD2 | 91.90 ± 1.30 | 92.64 ± 0.40 | 90.21 ± 0.48 | 93.95 ± 0.91 | 91.04 ± 0.72 | 92.32 ± 0.58 |
| TD3 | 82.08 ± 1.13 | 95.17 ± 0.20 | 92.62 ± 0.25 | 87.81 ± 0.67 | 87.03 ± 0.64 | 89.61 ± 0.45 |
|
|
|
|
|
|
|
|
| All the database | ||||||
| TD1 | 91.16 ± 1.73 | 93.68 ± 0.34 | 91.44 ± 0.40 | 93.48 ± 1.19 | 91.29 ± 0.90 | 92.61 ± 0.70 |
| TD2 | 92.03 ± 1.28 | 92.73 ± 0.39 | 90.37 ± 0.47 | 94.02 ± 0.90 | 91.12 ± 0.71 | 92.43 ± 0.57 |
| TD3 | 82.37 ± 1.12 | 95.23 ± 0.19 | 92.76 ± 0.25 | 87.95 ± 0.66 | 87.25 ± 0.63 | 89.76 ± 0.45 |
|
|
|
|
|
|
|
|
* mean ± standard deviation.
Figure 4An example of matching the reference and detected AF episodes before and after aggregation. Accidental changes of AF status eliminated in the aggregation stage are marked.
Figure 5The values of Se and PPV obtained for 361 combinations of the applied validation criteria: window width w = {10, 20,…,190} and percentage threshold p = {5, 10,…, 95}.
Figure 6The distances of points defined by (Se(w, p), PPV(w, p)) to the point of the highest values (100%, 100%), obtained for the 361 combinations of the following validation criteria: window width w = {10, 20,…,190} and percentage threshold p = {5, 10,…, 95}. The brightest rectangle indicates the maximum FS (the highest algorithm performance) determined for a given combination (w, p).
The AF detection performance obtained without and with aggregation process, which was optimized for the window width of 110 beats and percentage threshold equal to 55%, calculated for all 1,221,033 reference annotated heartbeats.
| Performance Measures [%] | No Aggregation | With Aggregation |
|---|---|---|
| Se | 98.10 | 98.94 |
| Sp | 97.50 | 98.80 |
| PPV | 96.67 | 98.39 |
| NPV | 98,57 | 99,21 |
| CA | 97.75 | 98.86 |
| FS | 97.38 | 98.66 |
An overview of published results of existing AF detection methods using the MIT-BIH Atrial Fibrillation Database.
| Method | Features | Window | Key Techniques | Results | |||
|---|---|---|---|---|---|---|---|
| Se | Sp | PPV | CA | ||||
| Tateno et al. 2001 [ | RR difference | 100 beats | Histogram, Kolmogorov-Smirnov test, ROC. | 94.4 | 97.2 | 96.1 | – |
| Huang et al. 2011 [ | RR difference | 23 beats | Histogram, SD analysis, Kolmogorov-Smirnov test. | 96.1 | 98.1 | – | – |
| Petrucci et al. 2005 [ | RR difference, RR prematurity | 60 s | Geometric measures of histogram, Score system. | 92 | – | 92 | – |
| Lian et al. 2011 [ | RR interval, RR difference | 128 beats | Mapping RR intervals versus RR differences, thresholds. | 95.8 | 96.4 | – | – |
| Petrenas et al. 2015 [ | RR interval | 8 beats | Thresholds | 97.1 | 98.3 | – | – |
| Logan et al. 2005 [ | RR interval | 600 beats | Variance of normalized RR interval, simple majority voting. | 96 | 89 | – | – |
| Islam et al. 2016 [ | RR interval | 70 beats | Normalization of RR intervals by an affine transformation. | 96.39 | 96.38 | 95.19 | 96.38 |
| Babaeizadeh. et al. 2009 [ | RR interval P-wave measurements | - | Stationary first-order Markov process, decision tree. | 94 | 99 | 98 | – |
| Zhou et al. 2014 [ | RR interval | 127 beats | Mapping the RR sequence into symbolic one, Shannon entropy, ROC. | 96.89 | 98.25 | 97.62 | 97.67 |
| Zhou et al. 2015 [ | RR interval | 127 beats | Online version of [ | 97.37 | 98.44 | 97.89 | 97.99 |
| Cui et al. 2017 [ | RR interval | 150 beats | Mapping the RR sequence into symbolic one, dissimilarity index. | 97.04 | 97.96 | – | 97.78 |
| Dash et al. 2009 [ | RR difference | 128 beats | Turning points, RMSSDD, Shannon entropy, ROC. | 94.4 | 95.1 | – | – |
| Lake et al. 2011 [ | RR difference | 12 beats | Coefficient of sample entropy (CoSEn), ROC. | 91 | 94 | – | – |
| Kennedy et al. 2016 [ | RR difference | 30 beats | Random forest, k-nearest neighbor. | 97.6 | 98.3 | 92.1 | – |
| Andersen et al. 2017 [ | RR interval | 300 beats | Sample entropy, Shannon entropy, CoSEn, SVM. | 96.81 | 96.20 | – | 96.45 |
| Kumar et al. 2018 [ | ECG features | 1000 samples | Wavelet transform, Random forest. | 95.8 | 97.8 | – | 96.8 |
| Nuryani et al. 2015 [ | RR difference | SVM with radial basis function. | 95.81 | 98.44 | – | 97.50 | |
| Colloca et al. 2013 [ | RR difference | 30 | Entropy, SVM with radial basis function | 99.20 | - | 59.33 | 86.60 |
| Faust et al. 2018 [ | - | 100 | Deep Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). | 98.51 | 98.32 | – | 98.67 |
| Andersen et al. 2019 [ | - | 31 | Deep learning combining with the convolutional- and Recurrent-Neural Networks. | 98.98 | 96.95 | 95.76 | 97.80 |
| Wrobel et al. 2018 [ | HR irregularity features | 21 | Linear classifier | 95.42 | 96.12 | 94.97 | 95.62 |
| Proposed method 2019 | HR irregularity features | 21 | LSVM | 98.94 | 98.39 | 98.86 | 98.66 |