| Literature DB >> 34261486 |
Bambang Tutuko1, Siti Nurmaini2, Alexander Edo Tondas3, Muhammad Naufal Rachmatullah1, Annisa Darmawahyuni1, Ria Esafri1, Firdaus Firdaus1, Ade Iriani Sapitri1.
Abstract
BACKGROUND: Generalization model capacity of deep learning (DL) approach for atrial fibrillation (AF) detection remains lacking. It can be seen from previous researches, the DL model formation used only a single frequency sampling of the specific device. Besides, each electrocardiogram (ECG) acquisition dataset produces a different length and sampling frequency to ensure sufficient precision of the R-R intervals to determine the heart rate variability (HRV). An accurate HRV is the gold standard for predicting the AF condition; therefore, a current challenge is to determine whether a DL approach can be used to analyze raw ECG data in a broad range of devices. This paper demonstrates powerful results for end-to-end implementation of AF detection based on a convolutional neural network (AFibNet). The method used a single learning system without considering the variety of signal lengths and frequency samplings. For implementation, the AFibNet is processed with a computational cloud-based DL approach. This study utilized a one-dimension convolutional neural networks (1D-CNNs) model for 11,842 subjects. It was trained and validated with 8232 records based on three datasets and tested with 3610 records based on eight datasets. The predicted results, when compared with the diagnosis results indicated by human practitioners, showed a 99.80% accuracy, sensitivity, and specificity. RESULT: Meanwhile, when tested using unseen data, the AF detection reaches 98.94% accuracy, 98.97% sensitivity, and 98.97% specificity at a sample period of 0.02 seconds using the DL Cloud System. To improve the confidence of the AFibNet model, it also validated with 18 arrhythmias condition defined as Non-AF-class. Thus, the data is increased from 11,842 to 26,349 instances for three-class, i.e., Normal sinus (N), AF and Non-AF. The result found 96.36% accuracy, 93.65% sensitivity, and 96.92% specificity.Entities:
Keywords: 1D-convolutional neural network; Atrial fibrillation; Cloud deep learning
Mesh:
Year: 2021 PMID: 34261486 PMCID: PMC8281594 DOI: 10.1186/s12911-021-01571-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Flow diagram of deep learning model for AF diagnosis system
ECG Record description for two-class case (N and AF)
| Dataset | Frequency sampling (Hz) | Class | Records | Training data | Validation data | Testing/unseen data |
|---|---|---|---|---|---|---|
| PhysioNet/CinC challenge 2017 | 300 | N | 5154 | – | ||
| AF | 771 | – | ||||
| China physiological signal challenge 2018 | 500 | N | 918 | 7409 | 823 | – |
| AF | 1098 | – | ||||
| MIT–BIH atrial fibrillation | 250 | AF | 291 | – | ||
| MIT–BIH arrhythmia | 360 | AF | 6 | – | – | – |
| ECG long term | 128 | AF | 38 | – | – | – |
| Paroxysmal AF | 128 | AF | 48 | – | – | – |
| AF termination challenge | 128 | AF | 10 | – | – | – |
| Fantasia | 250 | N | 24 | – | – | – |
| N | 1646 | – | – | – | ||
| ECG recording from Chapman University and Shaoxing People’s Hospital | 500 | AF | 1780 | – | – | 3610 |
| Indonesian Hospital (ECG 1) | 500 | N | 42 | – | – | – |
| AF | 3 | – | – | – | ||
| Indonesian Hospital (ECG 2) | 400 | AF | 13 | – | – | – |
| Total | 7409 | 823 | 3610 |
ECG record description for three-class case (N, AF, and Non-AF)
| Dataset | Frequency sampling (Hz) | Conditions | Class | Records | Training data | Validation data |
|---|---|---|---|---|---|---|
| PhysioNet/CinC challenge 2017 | 300 | N | N | 5154 | ||
| AF | AF | 771 | ||||
| Others | Non-AF | 2557 | ||||
| Noisy | 46 | |||||
| Normal | N | 918 | ||||
| China physiological signal challenge 2018 | 500 | AF (Atrial Fibrillation) | AF | 1098 | ||
| I-AVB (First-degree atrioventricular block) | 704 | |||||
| LBBB (Left bundle branch block | 207 | |||||
| RBBB (Right bundle branch block | Non-AF | 1695 | ||||
| PAC (Premature atrial contraction) | 574 | |||||
| PVC (Premature ventricular contraction) | 653 | |||||
| STD (ST-segment elevated) | 826 | |||||
| STE (ST-segment elevated | 202 | 23,714 | 2635 | |||
| MIT-BIH trial fibrillation | 250 | AF (Atrial Fibrillation) | AF | 291 | ||
| ECG recording from Chapman University and Shaoxing People’s Hospital | 500 | SR (Sinus rhythm) | N | 1826 | ||
| AF (Atrial fibrillation) | AF | 1780 | ||||
| SB (Sinus bradycardia) | 3889 | |||||
| ST (Sinus tachycardia) | 1568 | |||||
| AFL (atrial flutter) | 445 | |||||
| SI (Sinus irregularity) | 399 | |||||
| SVT (Supraventricular tachycardia | Non-AF | 587 | ||||
| AT (Atrial tachycardia) | 121 | |||||
| AVNRT (Atrioventricular node reentrant) | 16 | |||||
| AVRT (Atrioventricular reentrant tachycardia) | 8 | |||||
| SAAWR (Sinus atrium to atrial wandering rhythm) | 7 | |||||
| Indonesia Hospital (ECG 1) | 500 | Non-AF (other rhythms) | Non-AF | 7 |
Fig. 2Sample of ECG raw data for (a) N, (b) AF, and (c) Non-AF rhythms from 11 total datasets with several devices, length of recording and frequency sampling
Fig. 3Proposed the AFibNet methodology
Feature learning interpretation
| Layer | Input nodes | Filter number | Kernel size/pool size | Output nodes | Feature interpretation |
|---|---|---|---|---|---|
| Input | 2700, 1 | – | – | ECG amplitude for one episode | |
| Convolution 1 | 2700, 1 | 64 | 3 | 2698 | 64 feature map |
| Convolution 2 | 2698 | 64 | 3 | 2696 | 64 feature map |
| Max pooling 1 | 2696 | – | 2 | 1348 | Feature reduction (1348 nodes for one episode) |
| Convolution 3 | 1348 | 128 | 3 | 1346 | 128 feature map |
| Convolution 4 | 1346 | 128 | 3 | 1344 | 128 feature map |
| Max pooling 2 | 1344 | – | 2 | 672 | Feature reduction (672 nodes for one episode) |
| Convolution 5 | 672 | 256 | 3 | 670 | 256 feature map |
| Convolution 6 | 670 | 256 | 3 | 668 | 256 feature map |
| Convolution 7 | 668 | 256 | 3 | 666 | 256 feature map |
| Max pooling 3 | 666 | – | 2 | 333 | Feature reduction (672 nodes for one episode) |
| Convolution 8 | 333 | 512 | 3 | 331 | 512 feature map |
| Convolution 9 | 331 | 512 | 3 | 329 | 512 feature map |
| Convolution 10 | 329 | 512 | 3 | 327 | 512 feature map |
| Max pooling 4 | 327 | – | 2 | 163 | Feature reduction (163 nodes for one episode) |
| Convolution 11 | 163 | 512 | 3 | 161 | 512 feature map |
| Convolution 12 | 161 | 512 | 3 | 159 | 512 feature map |
| Convolution 13 | 159 | 512 | 3 | 157 | 512 feature map |
| Max pooling 5 | 157 | - | 2 | 78 | Feature reduction (78 nodes for one episode) |
| Flatten | 39,936 | – | – | – | Dot product between 78 nodes and 512 feature map |
| Dense | – | – | – | 1000 | Weight params |
| Dense | – | – | – | 1000 | Weight params |
| Output | – | – | – | 1 | Class |
Data segementation with a tenfold scheme for a combination of three datasets (MIT-BIH Atrial Fibrillation, the 2017 PhysioNet/CinC Challenge, the China Physiological Signal Challenge 2018 databases)
| Fold | Training data | Validation data | Total | ||
|---|---|---|---|---|---|
| N | AF | N | AF | ||
| 1 | 16,485 | 32,149 | 1790 | 3614 | 54,038 |
| 2 | 16,391 | 32,243 | 1884 | 3520 | 54,038 |
| 3 | 16,424 | 32,210 | 1851 | 3553 | 54,038 |
| 4 | 16,402 | 32,232 | 1873 | 3531 | 54,038 |
| 5 | 16,469 | 32,165 | 1806 | 3598 | 54,038 |
| 6 | 16,520 | 32,114 | 1755 | 3649 | 54,038 |
| 7 | 16,476 | 32,158 | 1799 | 3605 | 54,038 |
| 8 | 16,453 | 32,181 | 1822 | 3582 | 54,038 |
| 9 | 16,416 | 32,219 | 1859 | 3544 | 54,038 |
| 10 | 16,439 | 32,196 | 1836 | 3567 | 54,038 |
All performance of the AFibNet with several datasets
| Dataset | Class | Number of subjects | Performance (%) | ||
|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | |||
| The 2017 PhysioNet/CinC challenge | N | ||||
| China physiological signal challenge 2018 | AF | 8232 | 99.8 | 99.8 | 99.8 |
| MIT-BIH atrial fibrillation | |||||
| ECG long term AF | AF | 38 | 100 | 100 | – |
| Paroxysmal AF | AF | 48 | 100 | 100 | – |
| MIT-BIH Arrhythmia | AF | 6 | 100 | 100 | – |
| AF termination challenge | AF | 10 | 100 | 100 | – |
| Fantasia | N | 24 | 100 | 100 | – |
| Indonesian Hospital (ECG 1) | N | 42 | 100 | 100 | 100 |
| AF | 3 | ||||
| Indonesian Hospital (ECG 2) | AF | 13 | 100 | 100 | – |
| ECG recording from Chapman University and Shaoxing People’s Hospital | N | 1646 | 98.86 | 98.88 | |
| F | 1780 | ||||
| All unseen data testing | N | 1712 | 98.94 | 98.97 | 98.97 |
| AF | 1898 | ||||
Training and validation dataset: The sample of data used to fit the and provide an unbiased evaluation of a model fit on the training dataset while tuning model hyperparameters. Unseen data: The unseen data can include data having an attribute not seen by the data set
Two-class classification performance of the AFibNet with intra-patient mechanism
| Fold | Performances (%) | ||||
|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specifisity | Precision | F1-Score | |
| 1 | 76.29 | 57.02 | 79.10 | 53.19 | 54.12 |
| 2 | 96.37 | 94.90 | 97.28 | 91.83 | 93.24 |
| 3 | 98.02 | 96.64 | 98.46 | 95.66 | 96.14 |
| 4 | 99.08 | 98.41 | 99.27 | 97.97 | 98.19 |
| 5 | 99.29 | 98.88 | 99.46 | 98.24 | 98.55 |
| 6 | 98.88 | 98.13 | 99.14 | 97.36 | 97.73 |
| 7 | 99.34 | 98.89 | 99.49 | 98.41 | 98.64 |
| 8 | 98.22 | 97.10 | 98.46 | 96.59 | 96.83 |
| 9 | 99.18 | 98.48 | 99.34 | 98.08 | 98.28 |
| 10 | 98.91 | 98.01 | 99.13 | 97.39 | 97.69 |
| Average | 96.36 | 93.65 | 96.92 | 92.47 | 92.94 |
AFibNet performance for each class
| Performance (%) | N | AF | Non-AF |
|---|---|---|---|
| Accuracy | 99.89 | 99.13 | 99.01 |
| Sensitivity | 100 | 97.92 | 98.77 |
| Specifisity | 99.84 | 99.33 | 99.32 |
The number of parameters produce based on 1D-CNNs architecture to show the computational complexity
| Layer name | Output shape | Parameters |
|---|---|---|
| Convolution 1 | (None, 2698, 64) | 256 |
| Convolution 2 | (None, 2696, 64) | 12,352 |
| Maxpooling 1 | (None, 1348, 64) | 0 |
| Convolution 3 | (None, 1346, 128) | 24,704 |
| Convolution 4 | (None, 1344, 128) | 49,280 |
| Maxpooling 2 | (None, 672, 128) | 0 |
| Convolution 5 | (None, 670, 256) | 98,560 |
| Convolution 6 | (None, 668, 256) | 196,864 |
| Convolution 7 | (None, 666, 256) | 196,864 |
| Maxpooling 3 | (None, 333, 256) | 0 |
| Convolution 8 | (None, 331, 512) | 393,728 |
| Convolution 9 | (None, 329, 512) | 786,944 |
| Convolution 10 | (None, 327, 512) | 786,944 |
| Maxpooling 4 | (None, 163, 512) | 0 |
| Convolution 11 | (None, 161, 512) | 786,944 |
| Convolution 12 | (None, 159, 512) | 786,944 |
| Convolution 13 | (None, 157, 512) | 786,944 |
| Maxpooling 5 | (None, 78, 512) | 0 |
| Flatten | (None, 39936) | 0 |
| Dense | (None, 1000) | 39,936,000 |
| Dense | (None, 1000) | 1,001,000 |
| Class | (None, 1) | 1001 |
| Total of parameters | 45, 846, 329 |
The sample of CPU and GPU process as a cloud server
| Specification | CPU | GPU | Testing (s) |
|---|---|---|---|
| 1 | CPU1: 4 Core, 8 thread, @2.8 GHz | – | 0.30 |
| Memory: 16 GG, Disk: 1000 Gb | |||
| 2 | CPU1: 4 Core, 8 thread, @2.8 GHz | GPU1: GTX | 0.18 |
| Memory: 16 Gb, Disk: 1000 Gb | 1050 Ti, 4Gb | ||
| 3 | CPU2: 8 Core, 16 thread, @3.6 GHz | – | 0.14 |
| Memory: 32 Gb, Disk: 1000 Gb | |||
| 4 | CPU2: 8 Core, 16 thread, @3.6 GHz | GPU2: RTX | 0.02 |
| Memory: 32 Gb, Disk: 1000 Gb | 2080 Ti, 11Gb |
Fig. 4Processing time of 1D-CNNs in four server specification
Fig. 5Processing time of load model in cloud system
Fig. 6Throughput
AFibNet performance with tenfold cross validation for two-class
| Fold | Classifier performances (%) | ||||
|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | F1-score | Precision | |
| 1 | 98.22 | 98.24 | 98.24 | 97.98 | 97.74 |
| 2 | 99.94 | 99.94 | 99.94 | 99.93 | 99.93 |
| 3 | 99.98 | 99.98 | 99.98 | 99.97 | 99.97 |
| 4 | 100 | 100 | 100 | 100 | 100 |
| 5 | 99.96 | 99.97 | 99.97 | 99.95 | 99.94 |
| 6 | 99.98 | 99.98 | 99.98 | 99.97 | 99.97 |
| 7 | 99.94 | 99.94 | 99.94 | 99.93 | 99.93 |
| 8 | 100 | 100 | 100 | 100 | 100 |
| 9 | 100 | 100 | 100 | 100 | 100 |
| 10 | 99.98 | 99.98 | 99.98 | 99.97 | 99.97 |
| Average | 99.8 | 99.8 | 99.8 | 99.77 | 99.74 |
Benchmarking with other DL for AF detection
| Authors | Method | Total subject | Acc. (%) | Sens. (%) | Spec. | ROC-AUC Score (%) |
|---|---|---|---|---|---|---|
| Faust et al. [ | RNNs-LSTM | 102 | 98.51 | – | – | – |
| Hong et al. [ | CNNs-RNNs | ± 20,000 | – | – | – | 98.57 |
| Zhang et al. [ | CNNs | 177,941 | 91.88 | 94.23 | – | – |
| Yildirim et al. [ | DNNs | 3605 | 97.91 | 96.52 | 98.31 | – |
| Proposed model | 1D-CNNs | 8416 | 100 | 100 | 100 | – |
| 11,842 | 98.94 | 98.97 | 98.97 | – | ||
| 26,349 | 96.36 | 93.65 | 96.92 | - |
Acc Accuracy, Sens Sensitivity, Spec Specificity