| Literature DB >> 25793605 |
Paolo Melillo1, Raffaele Izzo2, Ada Orrico1, Paolo Scala3, Marcella Attanasio1, Marco Mirra2, Nicola De Luca2, Leandro Pecchia4.
Abstract
BACKGROUND: There is consensus that Heart Rate Variability is associated with the risk of vascular events. However, Heart Rate Variability predictive value for vascular events is not completely clear. The aim of this study is to develop novel predictive models based on data-mining algorithms to provide an automatic risk stratification tool for hypertensive patients.Entities:
Mesh:
Year: 2015 PMID: 25793605 PMCID: PMC4368686 DOI: 10.1371/journal.pone.0118504
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient baseline characteristics.
| Clinical Features | Low-risk subjects | High-risk subjects | p-value |
|---|---|---|---|
| Age (years) | 71.4±7 | 74.1±6.5 | 0.136 |
| Sex (female) | 41 (33.6%) | 8 (47.1%) | 0.277 |
| Family history of hypertension | 41 (33.6%) | 7 (41.2%) | 0.622 |
| Family history of stroke | 10 (8.2%) | 3 (17.6%) | 0.236 |
| Smoking | 35 (28.7%) | 5 (29.4%) | 0.983 |
| Diabetes | 18 (14.8%) | 3 (17.6%) | 0.834 |
| Diastolic Blood Pressure (mmHg) | 76.3±9.1 | 73.5±8.4 | 0.204 |
| Systolic Blood Pressure (mmHg) | 136.6±19.5 | 141.7±23.5 | 0.326 |
| Total Cholesterol (mg/dl) | 175.7±35.1 | 182.9±42.7 | 0.460 |
| Low Density Lipoprotein (mg/dl) | 101±30.1 | 102±34.3 | 0.907 |
| High Density Lipoprotein (mg/dl) | 52.4±13.1 | 53.3±15.3 | 0.813 |
| Body Mass Index (kg/m2) | 27.6±3.9 | 27.9±4.9 | 0.793 |
| Body Surface Area (m2) | 1.9±0.2 | 1.9±0.2 | 0.442 |
| Alpha-blockers | 17 (13.9%) | 3 (17.6%) | 0.782 |
| Beta-blockers | 50 (41%) | 6 (35.3%) | 0.487 |
| ACE inhibitor | 37 (30.3%) | 8 (47.1%) | 0.247 |
| Dihydropyridine | 27 (22.1%) | 7 (41.2%) | 0.131 |
| Intima Media Thickness (mm) | 2.3±0.7 | 2.4±1.1 | 0.685 |
| Left Ventricular Mass index (g/m2) | 130.1±26.1 | 140.2±25.1 | 0.135 |
| Ejection Fraction (%) | 59.3±10.9 | 57.8±13 | 0.591 |
Data are expressed as mean and standard deviation for continuous variables (e.g. age) and as count and percentage of patients per each group for categorical variables (e.g. gender).
Fig 1Feature importance computed by using Random Forest algorithm.
CD: Correlation dimension. SampEn: Sample entropy. LFpeak: peak frequency of LF band. SD2: long-term variability in Poincaré Plot. LF: absolute power in low frequency band (0.04–0.15 Hz). SDNN: standard deviation of all RR intervals. HF: absolute power in high frequency band (0.15–0.4 Hz). VLF%: relative power in very low frequency band (0–0.04 Hz). LF%: relative power in low frequency band (0.04–0.15 Hz). HRVTi: HRV triangular index. HF%: relative power in high frequency band (0.15–0.4 Hz). SD1: short-term variability in Poincaré Plot. TP: total power. DET: determinism. LF/HF: the ratio between LF and HF. VLFpeak: peak frequency of VLF band. TINN: triangular interpolation of RR interval histogram. NN50: number of differences between adjacent RR intervals that are longer than 50 ms. REC: recurrence rate. Lmean: mean length of lines in recurrence plot. AppEn: Approximate Entropy. HFpeak: peak frequency of HF band. Alpha1: short-term fluctuations in Detrended Fluctuation Analysis. RMSSD: square root of the mean of the sum of the squares of differences between adjacent RR intervals. HFnu: power in high frequency band (0.15–0.4 Hz), expressed in normalized unit. LFnu: power in low frequency band (0.04–0.15 Hz), expressed in normalized unit. AVNN: Average of all RR intervals. ShanEn: Shannon Entropy. DIV: Divergence. VLF: absolute power in very low frequency band (0–0.04 Hz). Alpha2: long-term fluctuations in Detrended Fluctuation Analysis. Lmax: maximal length of lines in recurrence plot. pNN50: percentage of differences between adjacent RR intervals that are longer than 50 ms.
Performance measurement (10-fold-crossvalidation estimation) of the proposed algorithms based on HRV features.
| Classifier | Parameters | Feature selection (# features) | AUC | ACC | SEN | SPE |
|---|---|---|---|---|---|---|
| AB | NI: 220; CF 0.5; MI: 20 | None (33) | 94.5% | 91.8% | 93.2% | 90.4% |
| AB | NI: 20; CF: 0.3; MI: 10 | CFS (8) | 92.2% | 85.6% | 86.3% | 84.9% |
|
|
|
|
|
|
|
|
| C4.5 | CF: 0.3; MI: 5 | None (33) | 80.3% | 76.7% | 78.1% | 75.3% |
| C4.5 | CF: 0.3; MI: 5 | CFS (8) | 82.8% | 80.8% | 87.7% | 74.0% |
|
|
|
|
|
|
|
|
| MLP | LR 0.3; M 0.6; NE 200 | None (33) | 86.7% | 82.9% | 80.8% | 84.9% |
|
|
|
|
|
|
|
|
| MLP | LR 0.3; M 0.2; NE 1800 | Χ2-FS (10) | 86.1% | 78.8% | 82.2% | 75.3% |
| NF | - | None (33) | 72.4% | 65.8% | 76.7% | 54.8% |
|
| - |
|
|
|
|
|
| NF | - | Χ2-FS (10) | 77.8% | 71.9% | 82.2% | 61.6% |
|
|
|
|
|
|
|
|
| RF | NT 20 NF 5 | CFS (8) | 92.3% | 87.7% | 90.4% | 84.9% |
| RF | NT 400 NF 4 | Χ2-FS (10) | 93.2% | 89.0% | 93.2% | 84.9% |
|
|
|
|
|
|
|
|
| SVM | G: 2.3 | CFS (8) | 89.1% | 81.5% | 84.9% | 78.1% |
| SVM | G: 1.6 | Χ2-FS (10) | 89.2% | 80.8% | 86.3% | 75.3% |
CFS: correlation-based feature selection algorithm (a subset of 8 HRV features)
Χ2-FS: chi-squared feature selection algorithm (a subset of 10 HRV features)
NI: number of iteration
ML: minimum number of instances per leaf.
CF: confidence factor for pruning
LR: learning rate
M: momentum
NE: number of epoch
NT: number of trees
NF: number of randomly chosen features
G: gamma
AUC: area under the curve
CI: confidence interval
ACC: accuracy
SEN: sensitivity
SPE: specificity
In bold: the best performances of each classifier.
Performance measurements estimated on the test set (hold-out estimation) of the best classifiers based on HRV features.
| Class. | Parameters | Feature selection (# features) | AUC | ACC (95% CI) | SEN | SPE |
|---|---|---|---|---|---|---|
| AB | NI: 120; CF: 0.45; MI: 10 | Χ2-FS(10) | 81.9% | 83.9%(76.9–86.6) | 71.4% | 85.7% |
| C4.5 | CF: 0.1; MI: 5 | Χ2-FS (10) | 69.8% | 75.0% (67.7–79.1) | 57.1% | 77.6% |
| MLP | LR: 0.6; M: 0.4; NE: 200 | CFS (8) | 64.7% | 76.8% (69.5–80.6) | 42.9% | 81.6% |
| NF | - | CFS (8) | 74.9% | 69.6% (62.4–74.4) | 57.1% | 71.4% |
| RF | NT: 300 NF: 5 | None (33) | 88.8% | 85.7% (78.7–88.1) | 71.4% | 87.8% |
| SVM | G: 1.4 | None (33) | 90.1% | 83.9% (76.9–86.6) | 71.4% | 85.7% |
Class.: Classifier
AB: Adaboost
MLP: Multilayer Perceptron
NB: Naïve Bayes classifier
RF: Random Forest
SVM: Support Vector Machine
NI: number of iteration
ML: minimum number of instances per leaf.
CF: confidence factor for pruning
LR: learning rate
M: momentum
NE: number of epoch
NT: number of trees
NF: number of randomly chosen features
G: gamma
Χ2-FS: chi squared feature selection algorithm (a subset of 10 HRV features)
CFS: correlation-based feature selection algorithm (a subset of 8 HRV features)
AUC: area under the curve
ACC: accuracy
CI: confidence interval
SEN: sensitivity
SPE: specificity.
Performance measurements of classification based on echographic parameters.
| Parameter | AUC | ACC (95% CI) | SEN | SPE |
|---|---|---|---|---|
| LVMi | 63.5% | 69.5% (69.9–73.0) | 41.2% | 73.9% |
| IMT MAX | 49.1% | 61.9% (57.3–65.8) | 40.0% | 64.9% |
LVMi.: Left ventricular mass index
IMT MAX: maximum of intima media thickness
AUC: area under the curve
ACC: accuracy
CI: confidence interval
SEN: sensitivity
SPE: specificity.
Fig 2Receiver-operator characteristic curves for predicting vascular events by HRV-based classifiers and echographic parameters.
The HRV-based classifiers are able to predict vascular events with higher sensitivity and specificity rate than echographic parameters. Sensitivity is determined from the proportion of patient developing a vascular event identified as high risk; specificity is determined from the proportion of patient free of vascular events identified as low risk. Solid lines represent classifier based on HRV features, dash-dot lines represent classifications based on echographic parameters. AB: Adaboost. MLP: Multilayer Perceptron. NB: Naïve Bayes classifier. RF: Random Forest. SVM: Support Vector Machine. LVMi.: Left ventricular mass index. IMT MAX: maximum of intima media thickness.
Fig 3Decision tree for prediction of vascular events.
The decision tree shows the set of rules adopted for classify high and low risk subjects: if HRVTi is higher than 13.6, the subject is classified as low risk, otherwise if SampEn lower than 0.997 or LF% lower than 18.1%, the subject is classified as high risk. The remaining subjects (with higher SampEn and LF%), are classified based on LF and CF: as high risk, if LF is higher than 0.001 s2 and CD is lower 3.43, otherwise as low risk. HRVTi: HRV Triangular Index. SampEn: Sample Entropy. LF: Low Frequency. LF%: Low Frequency expressed as percentage of Total Power. CD: correlation dimension.