| Literature DB >> 35265872 |
Ruhi Mahajan1, Rishikesan Kamaleswaran2, Oguz Akbilgic3.
Abstract
Background: Atrial fibrillation (AF) is one of the most common cardiovascular problems, and its asymptomatic tendency makes AF detection challenging. Machine and deep learning methods are commonly used in AF detection. Objective: The purpose of this study was to evaluate the information provided by convolutional neural network (CNN) and random forest (RF) machine learning models for AF classification.Entities:
Keywords: Arrhythmia detection; Convolutional neural networks; Electrocardiography; Feature extraction; Random forest classifier
Year: 2020 PMID: 35265872 PMCID: PMC8890095 DOI: 10.1016/j.cvdhj.2020.04.001
Source DB: PubMed Journal: Cardiovasc Digit Health J ISSN: 2666-6936
Figure 1Representative electrocardiographic recordings of 10-second duration for different classes of cardiac rhythms. AF = atrial fibrillation.
Figure 2One-dimensional, 12-layer convolutional neural network architecture designed to distinguish 4 classes of cardiac rhythms using single-lead electrocardiography. AF = atrial fibrillation; BN = batch normalization; CL = convolution layer; FC = fully connected layer.
Figure 3Box plots of 4 representative features extracted for classification of cardiac rhythms. AF = atrial fibrillation; HF = high frequency; LF = low frequency; PSD = power spectral density.
Details of 166 features extracted with feature engineering
| Type | Description | No. of features |
|---|---|---|
| Group 1: Time domain | Descriptive measures of PR interval, duration of QT interval, RR intervals, first- and second-order RR intervals | 56 |
| Quantile based on RR intervals | ||
| KS test based on RR intervals | ||
| Heart rate variability measures | ||
| Group 2: Frequency domain | PSD of wavelet coefficients | 54 |
| VLF power | ||
| LF power | ||
| HF power | ||
| PSD ratios | ||
| Group 3: Nonlinear | Descriptive measures of sample entropy computed on ECG and wavelet coefficients | 20 |
| Group 4: Linear | PSPR features from ECG recording sampled at 8 Hz | 36 |
| Descriptive measures of wavelet coefficients |
ECG = electrocardiography; HF = high frequency; KS = Kolmogorov-Smirnov; LF = low frequency; PSD = power spectral density; PSPR = probabilistic symbolic pattern recognition; VLF = very low frequency.
Performance of various classifiers while using different categories of features for classification
| Classifier | Average F1 score | |||
|---|---|---|---|---|
| Group 1: Time domain | Group 2: Frequency domain | Group 4: Nonlinear | Group 4: Linear | |
| SVM | 0.71 | 0.38 | 0.40 | 0.38 |
| LDA | 0.19 | 0.30 | 0.41 | 0.26 |
| KNN | 0.71 | 0.26 | 0.38 | 0.22 |
| QDA | 0.20 | 0.32 | 0.42 | 0.19 |
| Decision trees | 0.66 | 0.39 | 0.40 | 0.30 |
| Random forest | 0.73 | 0.46 | 0.46 | 0.36 |
KNN = k-nearest neighbor; LDA = linear discriminant analysis; QDA = quadratic discriminant analysis; SVM = support vector machine.
Confusion matrix obtained from the cross-validated training dataset using random forest model
| Normal | 4740 | 18 | 288 | 30 |
| AF | 56 | 530 | 151 | 21 |
| Other | 706 | 73 | 1604 | 32 |
| Noise | 85 | 10 | 41 | 143 |
| Normal | AF | Other | Noise |
AF = atrial fibrillation.
Figure 4F1 scores of 3 main cardiac rhythms obtained using feature engineering–based random forest (RF) (on the hidden test dataset), convolutional neural network (CNN), and an ensemble of support vector machine (SVM) and CNN classifiers (on the validation dataset). AF = atrial fibrillation.
Figure 5Heat map of the correlation matrix showing normalized Pearson correlation between a set of 166 manually extracted and convolutional neural network (CNN)–extracted features. Red box highlights CNN features that are more correlated with time–domain features.