| Literature DB >> 28831239 |
Vladimir S Kublanov1, Anton Yu Dolganov1, David Belo2, Hugo Gamboa2.
Abstract
The paper presents results of machine learning approach accuracy applied analysis of cardiac activity. The study evaluates the diagnostics possibilities of the arterial hypertension by means of the short-term heart rate variability signals. Two groups were studied: 30 relatively healthy volunteers and 40 patients suffering from the arterial hypertension of II-III degree. The following machine learning approaches were studied: linear and quadratic discriminant analysis, k-nearest neighbors, support vector machine with radial basis, decision trees, and naive Bayes classifier. Moreover, in the study, different methods of feature extraction are analyzed: statistical, spectral, wavelet, and multifractal. All in all, 53 features were investigated. Investigation results show that discriminant analysis achieves the highest classification accuracy. The suggested approach of noncorrelated feature set search achieved higher results than data set based on the principal components.Entities:
Year: 2017 PMID: 28831239 PMCID: PMC5555018 DOI: 10.1155/2017/5985479
Source DB: PubMed Journal: Appl Bionics Biomech ISSN: 1176-2322 Impact factor: 1.781
Figure 1Diagram of the study.
Figure 2Example of F[LFwt/HFwt(t)] with the decision threshold.
Figure 3The features of multifractal analysis.
List of studied features.
| Feature | Description | Equation |
|---|---|---|
|
| Mean value of the R-R | ( |
| HR | Heart rate | ( |
| SDNN | Standard deviation of the R-R | ( |
| CV | Coefficient of the variation | ( |
| RMSSD | Square root of mean of squares of differences between successive R-R | ( |
| NN50 | Variation higher than 50 ms in R-R signal | — |
|
| Mode of the R-R signal | — |
| VR | Variation range of the R-R signal | — |
| AM0 | Amplitude of the mode | — |
| SI | Stress index | ( |
| IAB | Index of autonomic balance | ( |
| ARI | Autonomic rhythm index | ( |
| IARP | Index of adequate regulation processes | ( |
| HF(Fr) | High frequency Fourier spectral power | — |
| LF(Fr) | Low frequency Fourier spectral power | — |
| VLF(Fr) | Very low frequency Fourier spectral power | — |
| TP(Fr) | Total power of the Fourier spectrum | — |
| LF/HF(Fr) | Autonomic balance exponent of the Fourier spectrum | — |
| HFmax(Fr) | Maximum power of the HF | — |
| HF | Normalized power of the HF, LF, and VLF Fourier spectrum | ( |
| IC | Index of centralization | ( |
| IAS | Index of the subcortical nervous center's activation | ( |
| RF | Respiration frequency | — |
| HF(wt) | High frequency wavelet spectral power | — |
| LF(wt) | Low frequency wavelet spectral power | — |
| VLF(wt) | Very low frequency wavelet spectral power | — |
| HF | Normalized power of the HF, LF, and VLF wavelet spectrum | — |
| SDHF(wt), SDLF(wt), and SDVLF(wt) | Standard deviations of the HF( | — |
| TP(wt) | Total power of the wavelet spectrum | — |
| LF/HF(wt) | Autonomic balance exponent of the wavelet spectrum | — |
| (LF/HF)max | Maximal value of dysfunctions | — |
| (LF/HF)int | Intensity of dysfunctions | — |
| Nd | Number of dysfunctions | — |
|
| Hurst exponent | ( |
|
| Smallest fluctuations of the LF and VLF spectral band | — |
|
| Greatest fluctuations of the LF and VLF spectral band | — |
| WLF, WVLF | Spectrum width of the LF and VLF spectral band | — |
|
| Correlation degree of the LF and VLF spectral band | — |
|
| Spectrum height of the LF and VLF spectral band | — |
|
| 1/2-width measure of the LF and VLF spectral band | — |
Figure 4Flowchart of the noncorrelated combination selection.
Noncorrelated combination selection data.
|
| Total | Selected | Calculation |
|---|---|---|---|
| 2 | 1378 | 586 | 0.027 |
| 3 | 23,426 | 1669 | 0.477 |
| 4 | 292,825 | 1339 | 11.559 |
| 5 | 2,869,685 | 295 | 228.267 |
Figure 5Flowchart of classifier efficacy evaluation algorithm.
Calculation times of classifier efficacy evaluation, sec.
| Features in combinations | LDA | QDA | NN3 | NN4 | NN5 | RBF SVM | DT | Naive Bayes |
|---|---|---|---|---|---|---|---|---|
| 2 | 165 | 113 | 281 | 281 | 281 | 139 | 89 | 130 |
| 3 | 482 | 346 | 917 | 860 | 806 | 403 | 269 | 375 |
| 4 | 397 | 288 | 640 | 643 | 642 | 325 | 222 | 300 |
| 5 | 88 | 64 | 141 | 140 | 140 | 71 | 50 | 66 |
Figure 6Classifier score for 2-feature combinations.
Figure 7Classifier score for 3-feature combinations.
Figure 8Classifier score for 4-feature combinations.
Figure 9Classifier score for 5-feature combinations.
Figure 10Maximal scores achieved by each learning machine approach.
Best classification scores.
| Score, % | Features | |||
|---|---|---|---|---|
| Linear discriminant analysis | ||||
| 91.33 ± 1.75 | HR | VLF | LF/HF(Fr) | VLF(wt) |
| 90.30 ± 1.37 | HR | VLF | VLF(wt) | (LF/HF)int |
| 90.04 ± 1.85 | HR | LF/HF(Fr) | VLF(wt) | VLFn(wt) |
| 90.44 ± 1.60 | HR | VLF | LF/HF(Fr) | SDVLF |
| 90.11 ± 1.80 | HR | LF/HF(Fr) | SDVLF | VLF |
| 90.16 ± 1.61 | HR | SDVLF | VLF | (LF/HF)int |
|
| ||||
| Quadratic discriminant analysis | ||||
| 90.31 ± 1.71 | HR | VLF | LF/HF(Fr) | VLF(wt) |
|
| ||||
| 3-nearest neighbors | ||||
| 87.14 ± 2.12 | LF/HF(Fr) | SDVLF | VLF | W1/2VLF |
|
| ||||
| 4-nearest neighbors | ||||
| 85.56 ± 2.40 | SDVLF | VLF | LF/HF(wt) | W1/2VLF |
|
| ||||
| 5-nearest neighbors | ||||
| 86.63 ± 1.30 | HR | HF(Fr) | LF | W1/2VLF |
|
| ||||
| Support vector machine, radial base function | ||||
| 86.73 ± 2.24 | IAS | RF |
| WVLF |
|
| ||||
| Decision trees, max depth 5 | ||||
| 87.10 ± 3.40 | IARP | LF/HF(Fr) | IAS | WLF |
|
| ||||
| Decision trees, no max depth | ||||
| 87.34 ± 3.08 | IARP | LF/HF(Fr) | IAS | WLF |
|
| ||||
| Naïve Bayes classifier | ||||
| 88.17 ± 1.07 | VLF(Fr) | VLF | LF/HF(Fr) | W1/2LF |
Features occurrences for classification score higher than 85%.
| Features | Occurrences, % | Features | Occurrences, % |
|---|---|---|---|
| VLF | 50.89 | Nd | 4.73 |
| VLF(Fr) | 50.89 |
| 4.73 |
| VLF | 47.93 | WLF | 4.73 |
|
| 34.91 | IARP | 3.55 |
| LF/HF(Fr) | 34.32 | IC | 2.96 |
| HR | 33.73 | HF(Fr) | 2.96 |
| SDVLF | 30.18 |
| 2.96 |
| (LF/HF)max | 24.26 | LF | 2.37 |
| LF/HF(wt) | 18.34 | SI | 2.37 |
| (LF/HF)int | 18.34 | LF | 1.78 |
|
| 17.75 |
| 1.78 |
|
| 13.61 | ARI | 1.78 |
| WVLF | 13.02 | HF | 1.78 |
| IAS | 10.65 | SDHF | 1.18 |
| VLF(wt) | 7.69 | IAB | 0.59 |
| RF | 6.51 | NN50 | 0.59 |
|
| 5.92 |
| 0.59 |
|
| 5.33 |
| 0.59 |
Feature occurrences for classification score higher than 90%.
| Features | Occurrences, % |
|---|---|
| HR | 100.00 |
| LF/HF(Fr) | 62.50 |
| VLF(wt) | 62.50 |
| VLF | 50.00 |
| VLF | 50.00 |
| SDVLF | 37.50 |
| (LF/HF)int | 37.50 |
Dataset analysis by PCA.
| Principal component | Explained variance, % | Cumulative variance, % |
|---|---|---|
| 1 | 34.88 | 34.88 |
| 2 | 17.65 | 52.54 |
| 3 | 13.03 | 65.57 |
| 4 | 8.87 | 74.45 |
| 5 | 5.38 | 79.83 |
| 6 | 4.10 | 83.93 |
| 7 | 3.55 | 87.47 |
| 8 | 2.42 | 89.89 |
| 9 | 1.78 | 91.67 |
| 10 | 1.36 | 93.03 |
| 11 | 1.01 | 94.04 |
| 12 | 0.95 | 94.98 |
| 13 | 0.84 | 95.82 |
| 14 | 0.74 | 96.56 |
| 15 | 0.61 | 97.17 |
Figure 11Scores of the PCA achieved by each learning machine approach.