| Literature DB >> 31199302 |
Soonil Kwon1, Joonki Hong2, Eue-Keun Choi1, Euijae Lee1, David Earl Hostallero2, Wan Ju Kang2, Byunghwan Lee3, Eui-Rim Jeong4, Bon-Kwon Koo1, Seil Oh1, Yung Yi2.
Abstract
BACKGROUND: Wearable devices have evolved as screening tools for atrial fibrillation (AF). A photoplethysmographic (PPG) AF detection algorithm was developed and applied to a convenient smartphone-based device with good accuracy. However, patients with paroxysmal AF frequently exhibit premature atrial complexes (PACs), which result in poor unmanned AF detection, mainly because of rule-based or handcrafted machine learning techniques that are limited in terms of diagnostic accuracy and reliability.Entities:
Keywords: atrial fibrillation; deep learning; diagnosis; photoplethysmography; pulse oximetry
Mesh:
Year: 2019 PMID: 31199302 PMCID: PMC6592499 DOI: 10.2196/12770
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Figure 1Flowchart illustrating the deep learning process. For each subject, 15-min PPG data during pre- and post- direct-current cardioversion periods were obtained. Each 15-min sample was preprocessed by removing bias, applying bypass filters, and normalization. Then, each sample was subdivided into 30-second samples with 20-second overlaps for data augmentation. The 30-second samples were trained and tested by 1-dimensional convolutional neural network (1D-CNN) and recurrent neural network (RNN) methods. Each sample was labeled as atrial fibrillation (AF) or sinus rhythm (SR). The number in parenthesis shows an example of the corresponding confidence level. In this example, the confidence level for diagnosing AF was 0.9 in 1D-CNN and RNN models, whereas 0.1 in SR for both models. 1D-CNN: 1-dimensional convolutional neural network; AF: atrial fibrillation; PPG: photoplethysmography; RNN: recurrent neural network; SR: sinus rhythm.
The clinical characteristics of the study population (N=75).
| Variables | Values | |
| Age (years), mean (SD) | 63 (7.8) | |
| Male, n (%) | 68 (91) | |
| Body mass index (kg/m2), mean (SD) | 25.2 (2.9) | |
| Body surface area (m2), mean (SD) | 1.83 (0.16) | |
| Persistenta | 57 (76) | |
| Long-standing persistentb | 18 (24) | |
| CHA2 DS2-VASc scorec | 1 (1,2) | |
| Congestive heart failure | 5 (7) | |
| Hypertension | 38 (51) | |
| Diabetes mellitus | 10 (13) | |
| Stroke or transient ischemic attack | 8 (11) | |
| Myocardial infarction | 1 (1) | |
| Valvular heart disease | 5 (7) | |
| Dyslipidemia | 19 (25) | |
| Chronic renal failure | 3 (4) | |
| Chronic obstructive pulmonary disease | 0 | |
| Hyperthyroidism | 6 (8) | |
| Previous AF ablation history, n (%) | 3 (5) | |
| Propafenone | 19 (25) | |
| Flecainide | 9 (12) | |
| Pilsicainide | 4 (5) | |
| Sotalol | 1 (1) | |
| Amiodarone | 40 (53) | |
| Beta blocker | 18 (24) | |
| Calcium channel blockerd | 12 (16) | |
| Digoxin | 1 (1) | |
| Aspirin | 2 (3) | |
| Warfarin | 18 (24) | |
| Nonvitamin K oral anticoagulant | 55 (73) | |
| Angiotensin converting enzyme inhibitor | 0 | |
| Angiotensin II receptor blocker | 16 (21) | |
| Diuretics | 7 (9) | |
| Statin | 13 (17) | |
aAF history more than 1 month and less than 1 year.
bAF history more than 1 year.
cThe value is expressed as both median and interquartile range.
dNondihydropyridine class.
Figure 2Typical examples of 15-second-long photoplethysmography and corresponding synchronized electrocardiogram samples for (A) stereotypic normal sinus rhythm and (B) atrial fibrillation with suggested confidence level using the 1-dimensional convolutional neural network algorithm. AF: atrial fibrillation; CL: confidence level; ECG: electrocardiogram; PPG: photoplethysmography; SR: sinus rhythm.
The diagnostic performance of various algorithms for classifying photoplethysmography samples of atrial fibrillation and sinus rhythm after electrically cardioverted patients.
| Algorithms | Accuracy | Mean sensitivity (%) | Mean specificity (%) | Mean positive predictive value (%) | Mean negative predictive value (%) | AUCa | 95% CI | True mean confidence level (%) | False CL |
| 1-Dimensional convolutional neural network | 97.58 | 99.32 | 95.85 | 95.98 | 99.30 | 0.998 | (0.995-1.000) | 98.56 | 78.75 |
| Recurrent neural network | 97.15 | 98.27 | 96.04 | 96.12 | 98.24 | 0.996 | (0.993-0.998) | 98.37 | 82.57 |
| Support vector machine, root-mean square of the successive differences of RR intervals + ShEb | 86.82 | 89.13 | 84.50 | 85.16 | 88.63 | 0.868 | (0.854-0.881) | —c | — |
| SVM, autocorrelationd | 91.43 | 93.26 | 89.60 | 89.94 | 93.02 | 0.977 | (0.972-0.982) | — | — |
| SVM, ensemblee | 90.72 | 88.57 | 92.87 | 92.53 | 89.07 | 0.976 | (0.970-0.981) | — | — |
aAUC: mean area under the receiver operating characteristic curves. The standard errors by binomial exact test were all <0.01 except SVM with ensemble (0.01).
bSVM using RMSSD and ShE as a feature.
cNot applicable.
dSVM using autocorrelation method.
eSVM using RMSSD, ShE and autocorrelation.
Figure 3The receiver operating characteristic (ROC) curves of 2 deep learning classifiers (1-dimensional convolutional neural network, 1D-CNN and recurrent neural network, RNN) compared with other previous high-end atrial fibrillation (AF) detectors. (A) A Comparison of several ROC curves by different AF-detection algorithms. (B) The area under the curve and corresponding 95% CI by different algorithms. Both 1D-CNN and RNN methods showed significantly better diagnostic performance than previous detectors. 1D-CNN: 1-dimensional convolutional neural network; RMSSD: root-mean square of the successive differences of RR intervals; RNN: recurrent neural network; ShE: Shannon entropy; SVM: support vector machine.
Figure 4Comparison of performances of deep learning classifiers and previous state-of-the-art atrial fibrillation detectors by premature atrial complexes (PACs) burden. The performance of classifying photoplethysmography samples during post- direct-current cardioversion period as sinus rhythm by each algorithm was measured by specificity. (A) Scenario A was obtained by the 5-fold cross-validation with random assignment of patients. In this case, each algorithm faced new patient’s data during testing. (B) Scenario B was obtained by the 5-fold cross-validation with random assignment of samples. This approach assumed that the training distribution could emulate the test distribution. Regardless of the method, both 1-dimensional convolutional neural network and recurrent neural network maintained higher specificity over burden of PACs. Both DL classifiers showed higher specificity in Scenario B than Scenario A. 1D-CNN: 1-dimensional convolutional neural network; PAC: premature atrial complex; PPG: photoplethysmography; RNN: recurrent neural network root-mean square of successive difference of RR intervals.
Figure 5The characteristics of confidence level (CL) calculated by deep learning (DL) classifiers. Data were obtained by repeating the 5-fold cross-validation test over 10 times. (A) Comparison of true and false CLs of 1-dimensional convolutional neural network (1D-CNN) and recurrent neural network (RNN) methods by Box-and-Whiskers plot. True CLs indicate the cases where the diagnosis of a DL classifier was correct. Conversely, false CLs indicate cases where a DL classifier was incorrect. In the both 1D-CNN and RNN methods, the distributions of true or false CLs were significantly different (P<.001) for both 1D-CNN and RNN methods. If cut-off level of CL is to be 95% (dashed line), the diagnostic accuracy was 99.6% in 1D-CNN and 99.2% in RNN. Therefore, a diagnosis with a CL ≥95% can be regarded as certain. (B) The association between the probability of misdiagnosis and sample proportions and the respective CLs. Because 91% of the tested samples showed CL ≥95%, most diagnoses made by DL classifiers were valid. The probability of false diagnoses decreases from 50% to 0% as the CL increases from 50% to 100%. Comparing 1D-CNN and RNN, there was no significant difference in CLs (P=.98). *P<.001, calculated by Student t test. **P=.98, calculated by Student t test. 1D-CNN: 1-dimensional convolutional neural network; RNN: recurrent neural network.