| Literature DB >> 33665429 |
Ennio Idrobo-Ávila1, Humberto Loaiza-Correa1, Rubiel Vargas-Cañas2, Flavio Muñoz-Bolaños3, Leon van Noorden4.
Abstract
The electrocardiogram is traditionally used to diagnose a large number of heart pathologies. Research to improve the readability and classification of cardiac signals includes studies geared toward sonification of the electrocardiographic signal and others involving features related to music processing, such as Mel-frequency cepstral coefficients. In terms of music processing features, this study seeks to use music information retrieval (MIR) features as electrocardiographic signal descriptors. The study compares the discriminatory capability of the introduced features in relation to standard groups such as heart rate variability, wavelet transform, descriptive statistics, Mel coefficients and fractal analysis, evaluated using classification algorithms; the signals analyzed were extracted from public databases. The group of features extracted from wavelet transform and the MIR group showed a high level of discrimination; the best representation of the ECG signals in the study was achieved in most cases by the MIR features. Moreover, a correlation coefficient higher than 0.8 was found between a number of MIR and other feature groups, indicating a likely relationship between the electrocardiographic signals and MIR features. These results suggest the feasibility of representing the analyzed signals by music information retrieval descriptors, giving the potential to consider these electrocardiographic signals as analogues to musical signals.Entities:
Keywords: ECG signal classification; Heart rate variability; Music; Neural networks; PhysioNet physiological signals database
Year: 2021 PMID: 33665429 PMCID: PMC7905363 DOI: 10.1016/j.heliyon.2021.e06257
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1Block diagram of the experimental procedure.
Figure 2Mother wavelet Daubechies 6. For more information on mother wavelets see [33].
Figure 3Database augmentation stage.
List of MIR features extracted from ECG signals (26 features) MIT-BIH database.
| Features | Description |
|---|---|
| mean pitch, standard deviation pitch, zero-crossing, low energy rate, tempo, minimum tempo, maximum tempo, mean tempo, standard deviation of tempo, pulse clarity, event density, minimum novelty, maximum novelty, mean novelty, standard deviation of novelty, key, mode, spectral spread, spectral distribution centroid, spectral roll-off, spectral skewness, spectral kurtosis, spectral flatness, spectral regularity, spectral entropy, root mean square | Although, many features were included, a greater influence of tempo-related features was expected due to the rhythmic nature of the heart. The main MIR characteristics considered here include root mean square, tempo, zero-cross, spectral flatness, and spectral spread. Root mean square is related to sound intensity [ |
List of features extracted from ECG signals, C-database.
| Group | Features | Total |
|---|---|---|
| MFCC | mfcc1, mfcc2, mfcc3, mfcc4, mfcc5, mfcc6, mfcc7, mfcc8, mfcc9, mfcc10, mfcc11, mfcc12, mfcc13, maximum mfcc, minimum mfcc, mean mfcc, mfcc variance, mfcc skewness, mfcc kurtosis | 19 |
| Descriptive statistics | maximum, minimum, mean, variance, skewness, kurtosis, median, mode, energy, entropy | 10 |
| Fractal analysis | Higuchi fractal dimension, Katz fractal dimension, Hurst exponent, HRV detrended fluctuation analysis alpha1, HRV detrended fluctuation analysis alpha2 | 5 |
| HRV | mean R-R interval, root mean square of the successive differences, median of Euclidean distance, interquartile range of Euclidean distance, mean of the heart rate, probability of intervals greater than 50ms, triangular index from the interval histogram, performing triangular interpolation, correlation dimension, approximate entropy, standard deviation1 of the Poincaré plot, standard deviation2 of the Poincaré plot, ratio of standard deviation1 and standard deviation2 of the Poincaré plot, very low-frequency components, low-frequency components, high-frequency components, ratio of low and high-frequency components, power of low-frequency components, power of high-frequency components, total power, HRV detrended fluctuation analysis alpha1, HRV detrended fluctuation analysis alpha2 | 22 |
| Wavelet coefficients | maximum(cfs0-7), minimum(cfs0-7), mean(cfs0-7), variance(cfs0-7), median(cfs0-7), mode(cfs0-7), energy(cfs0-7), entropy(cfs0-7), Higuchi fractal dimension (cfs0-7) | 72 |
| MIR | As described in | 26 |
| 154 | ||
Configuration parameters of classical artificial intelligence algorithms implemented.
| Classification algorithms | Configuration MIT-BIH database | Configuration |
|---|---|---|
| AdaBoost | Base estimator: Tree, Number of estimators: 50, Learning rate: 1, Classification algorithm: SAMME.R | Base estimator: Tree, Number of estimators: 50, Learning rate: 1, Classification algorithm: SAMME.R |
| CN2 rule inducer | Unordered rule, Evaluation measure: entropy, Beam width: 5, Regression loss function: linear | Unordered rule, Evaluation measure: entropy, Beam width: 5, Regression loss function: linear |
| Neural network | Multi-layer perceptron with backpropagation. Neurons in hidden layers: 150, Activation: ReLu, Regularization: alpha = 0.002, Solver: Adam | Multi-layer perceptron with backpropagation. Neurons in hidden layers: 5, Activation: ReLu, Regularization: alpha = 3, Solver: L-BFGS-B |
| Random forest | Number of trees: 20 | Number of trees: 10 |
| Decision trees | Induce binary tree, Minimum number of instances in leaves: 7, Limit the maximal tree depth to: 100 | Induce binary tree, Minimum number of instances in leaves: 2, Limit the maximal tree depth to: 100 |
| K-nearest neighbors | Number of neighbors: 5, Metric: Manhattan, Weight: Distance | Number of neighbors: 3, Metric: Manhattan, Weight: Distance |
Figure 4Example of ECG baseline correction.
Figure 5Example of wavelet-based shrinkage filtering.
Ranking of MIR features using information gain ratio (IGR).
| Ranking | MIR feature | IGR | Ranking | MIR feature | IGR |
|---|---|---|---|---|---|
| 1 | mean tempo | 0.1181 | 14 | standard deviation of tempo | 0.0432 |
| 2 | tempo | 0.1137 | 15 | spectral spread | 0.0367 |
| 3 | minimum tempo | 0.0995 | 16 | low energy rate | 0.0324 |
| 4 | pulse clarity | 0.0773 | 17 | spectral regularity | 0.0296 |
| 5 | root mean square | 0.0731 | 18 | spectral flatness | 0.0283 |
| 6 | spectral entropy | 0.0708 | 19 | standard deviation pitch | 0.0240 |
| 7 | spectral skewness | 0.0641 | 20 | zero-crossing | 0.0233 |
| 8 | spectral roll-off | 0.0597 | 21 | key | 0.0165 |
| 9 | spectral centroid | 0.0579 | 22 | mode | 0.0060 |
| 10 | spectral kurtosis | 0.0569 | 23 | standard deviation of novelty | 0.0040 |
| 11 | event density | 0.0526 | 24 | maximum novelty | 0.0028 |
| 12 | maximum tempo | 0.0483 | 25 | mean of novelty | 0.0015 |
| 13 | mean pitch | 0.0453 | 26 | minimum novelty | 0.0009 |
Figure 6Area under the ROC curve (AUC), MIT-BIH database: classification with MIR features.
Figure 7Accuracy, MIT-BIH database: classification with MIR features.
Figure 8Area under the ROC curve (AUC), C-database: classification analysis of features and classifier algorithms.
Figure 9Accuracy, C-database: classification analysis of features and classifier algorithms.
Figure 10Neural network classification with ranked MIR features (C-database).
Correlation coefficients (Corr) between selected MIR features and other groups of features (ECG features).
| MIR Features | ECG Features | Corr | |
|---|---|---|---|
| tempo | - | mean heart rate | +0.979 |
| tempo | - | mean R-R interval | -0.958 |
| root mean square | - | energy (cfs0) | +0.951 |
| root mean square | - | variance | +0.951 |
| root mean square | - | variance (cfs0) | +0.950 |
| root mean square | - | energy | +0.950 |
| root mean square | - | maximum (cfs0) | +0.946 |
| root mean square | - | maximum | +0.935 |
| zero-crossing | - | Higuchi fractal dimension | +0.842 |
| root mean square | - | mean | +0.834 |
| spectral spread | - | Higuchi fractal dimension | +0.829 |
| root mean square | - | mean (cfs0) | +0.829 |
| spectral flatness | - | Higuchi fractal dimension | +0.819 |
Confusion matrix: classification with MIR features and neural network.
| Predicted heart condition | ||||||
|---|---|---|---|---|---|---|
| N | T | IMI | AMI | Σ | ||
| Actual heart condition | N | 189 (90%) | 28 (10%) | 42 (15%) | 65 (20%) | 324 |
| T | 0 (0%) | 248 (84%) | 40 (14%) | 0 (0%) | 288 | |
| IMI | 0 (0%) | 9 (3%) | 180 (62%) | 63 (20%) | 252 | |
| AMI | 22 (10%) | 9 (3%) | 27 (9%) | 194 (60%) | 252 | |
| Σ | 211 | 294 | 289 | 322 | 1116 | |
Figure 11Neural network classification with ranked MIR features (C-database without AMI class).