| Literature DB >> 33286015 |
Xueya Yan1, Lulu Zhang1, Jinlian Li2, Ding Du1, Fengzhen Hou1.
Abstract
Surges in sympathetic activity should be a major contributor to the frequent occurrence of cardiovascular events towards the end of nocturnal sleep. We aimed to investigate whether the analysis of hypnopompic heart rate variability (HRV) could assist in the prediction of cardiovascular disease (CVD). 2217 baseline CVD-free subjects were identified and divided into CVD group and non-CVD group, according to the presence of CVD during a follow-up visit. HRV measures derived from time domain analysis, frequency domain analysis and nonlinear analysis were employed to characterize cardiac functioning. Machine learning models for both long-term and short-term CVD prediction were then constructed, based on hypnopompic HRV metrics and other typical CVD risk factors. CVD was associated with significant alterations in hypnopompic HRV. An accuracy of 81.4% was achieved in short-term prediction of CVD, demonstrating a 10.7% increase compared with long-term prediction. There was a decline of more than 6% in the predictive performance of short-term CVD outcomes without HRV metrics. The complexity of hypnopompic HRV, measured by entropy-based indices, contributed considerably to the prediction and achieved greater importance in the proposed models than conventional HRV measures. Our findings suggest that Hypnopompic HRV assists the prediction of CVD outcomes, especially the occurrence of CVD event within two years.Entities:
Keywords: XGBoost; cardiovascular disease; heart rate variability; machine learning; sleep
Year: 2020 PMID: 33286015 PMCID: PMC7516674 DOI: 10.3390/e22020241
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Heart rate variability (HRV) metrics used in this study.
| Usage | Metric | Units | Description |
|---|---|---|---|
| on each 5-min HRV segment | TP | ms2 | Total power in frequency range (0.003–0.4 Hz) [ |
| LF | ms2 | Power in low frequency range (0.04–0.15 Hz) [ | |
| HF | ms2 | Power in high frequency range (0.15–0.4 Hz) [ | |
| HFnorm | n.u. | HF power in normalized units (HF/(LF + HF) × 100) [ | |
| on entire 1-h HRV data | SDNN | ms | Standard deviation of all RR intervals [ |
| RMSSD | ms | The square root of the mean of the sum of squares of differences between adjacent RR intervals [ | |
| MSE | Multiscale sample entropy [ | ||
| MPE | Modified permutation entropy of RR intervals [ |
Baseline clinical characteristics of participants involved in this study.
| CVD | non-CVD |
| |
|---|---|---|---|
| Number of participants | 1219 | 998 | |
| Age (years) | 63[58,69] | 60[50,73] | <0.001 * |
| Gender (Male/Female,%) | 47.3/52.7 | 39.2/60.8 | <0.001 * |
| BMI (kg/m2) | 28.2[25.4,31.3] | 27.1[24.4,30.4] | <0.001 * |
| Height (cm) | 167[160,175] | 165[158.8,174] | <0.001 * |
| Waist/hip ratio | 95.1[90.1,99.2] | 89.9[81.5,96.2] | <0.001 * |
| Smoking status (Never/Current/Former,%) | 49.9/7.3/42.8 | 54.6/7.1/38.3 | 0.023 * |
| Lifetime cigarette smoke (packs/year) | 0[0,19] | 0[0,12] | 0.014 * |
| Diabetes (Yes/No,%) | 7.2/92.8 | 3.3/96.7 | <0.001 * |
| Hypertension (Yes/No,%) | 41.5/58.5 | 33.6/66.4 | <0.001 * |
| AHI (events/hour) | 9.9[4.2,19.1] | 8.3[3.3,16.9] | 0.025 * |
| RDI (events/hour) | 30.3[19.2,45] | 26.9[17.1,40.1] | <0.001 * |
Note: Values are reported as number and percent, or as median [lower quartile, upper quartile]. BMI = body mass index; AHI = apnea hypopnea index; RDI = respiratory disturbance index. * represents a significant difference, p < 0.05 ( test, -test or non-parametric test).
HRV metrics of participants involved in this study.
| CVD | non-CVD |
| |
|---|---|---|---|
| TP(ms2) | 2299.4[1458.6,3410.9] | 2324.5[1412.7,3802.3] | 0.186 |
| LF(ms2) | 496.4[296,807.1] | 528.1282.7,929.9] | 0.004 * |
| HF(ms2) | 251.3[120.2,596.9] | 308.3[132,707.8] | 0.004 * |
| HFnorm(n.u.) | 35.9[24.2,50.8] | 38.2[25.4,52.1] | 0.188 |
| SDNN(ms) | 63.5[52.4,76.5] | 64.8[51.7,79.1] | 0.18 |
| RMSSD(ms) | 36.3[25.1,60.3] | 39.4[26.3,62.1] | 0.137 |
| MSE1 | 1.41[1.14,1.71] | 1.49[1.21,1.78] | 0.005 * |
| MSE2 | 1.47[1.25,1.69] | 1.51[1.3,1.73] | 0.037 * |
| MSE3 | 1.46[1.28,1.64] | 1.49[1.28,1.66] | 0.062 |
| MSE4 | 1.49[1.32,1.64] | 1.48[1.3,1.66] | 0.941 |
| MSE5 | 1.53[1.38,1.68] | 1.54[1.36,1.69] | 0.979 |
| MSE6 | 1.57[1.4,1.73] | 1.57[1.39,1.7] | 0.465 |
| MSE7 | 1.57[1.41,1.73] | 1.58[1.4,1.72] | 0.543 |
| MSE8 | 1.58[1.42,1.74] | 1.57[1.4,1.71] | 0.068 |
| MSE9 | 1.58[1.4,1.74] | 1.56[1.4,1.71] | 0.06 |
| MSE10 | 1.57[1.41,1.73] | 1.55[1.39,1.7] | 0.018 * |
| MPE | 5.69[5.49,5.84] | 5.61[5.41,5.82] | <0.001 * |
Note: Values are reported as median [lower quartile, upper quartile]. * represents a significant difference, p < 0.05 ( test, -test or non-parametric test).
Figure 1The results of distribution similarity tests between the under-sampled and the original non-CVD group. Logistic or classified features, including gender, smoking status, hypertension and diabetes, were excluded. In K-S test, value less than 0.05 represents for a significant difference of distribution, while a value of JSD closing to one corresponds to a significant difference of distribution. K-S test = Kolmogorov-Smirnov test; JSD = Jensen-Shannon divergence.
The performance of the long-term prediction model and short-term prediction model.
| ACC (%) | TPR (%) | TNR (%) | PPV (%) | F1 (%) | MCC | ||
|---|---|---|---|---|---|---|---|
| long-term | 1-fold | 69.7 | 80.2 | 56.8 | 69.4 | 74.4 | 0.38 |
| 2-fold | 74.4 | 84.8 | 61.8 | 73.0 | 78.5 | 0.48 | |
| 3-fold | 72.6 | 87.7 | 54.3 | 70.1 | 77.9 | 0.45 | |
| 4-fold | 73.8 | 80.7 | 65.3 | 74.0 | 77.2 | 0.47 | |
| 5-fold | 77.1 | 87.9 | 63.9 | 74.8 | 80.8 | 0.54 | |
| average | 73.5 | 84.2 | 60.4 | 72.3 | 77.8 | 0.46 | |
| short-term | 1-fold | 82.1 | 78.6 | 85.7 | 84.6 | 81.5 | 0.64 |
| 2-fold | 75.0 | 64.3 | 85.7 | 81.8 | 72.0 | 0.51 | |
| 3-fold | 85.7 | 100.0 | 71.4 | 71.4 | 87.5 | 0.75 | |
| 4-fold | 85.7 | 78.6 | 92.9 | 91.7 | 84.6 | 0.72 | |
| 5-fold | 78.6 | 85.7 | 71.4 | 75.0 | 80.0 | 0.58 | |
| average | 81.4 | 81.4 | 81.4 | 82.2 | 81.1 | 0.64 |
Note: ACC = Accuracy; TPR = Sensitivity or Recall; TNR = Specificity; PPV = Precision; F1 = F1-Score; MCC = Matthew correlation coefficient.
The performance of two prediction models consisting of different feature vectors.
| Prediction Model | Components of Feature Vector | ACC (%) | TPR (%) | TNR (%) | PPV (%) | F1 (%) | MCC |
|---|---|---|---|---|---|---|---|
| long-term | clinical characteristics and HRV metrics | 73.5 | 84.2 | 60.4 | 72.3 | 77.8 | 0.46 |
| only clinical characteristics | 72.9 | 82.4 | 61.3 | 72.3 | 77.0 | 0.45 | |
| short-term | clinical characteristics and HRV metrics | 81.4 | 81.4 | 81.4 | 82.2 | 81.1 | 0.64 |
| only clinical characteristics | 76.4 | 85.7 | 67.1 | 72.8 | 78.4 | 0.55 |
Note: All models were based on 5-fold cross validation, but only reported the average value of each metrics in this table. ACC = Accuracy; TPR = Sensitivity or Recall; TNR = Specificity; PPV = Precision; F1 = F1-Score; MCC = Matthew correlation coefficient.
Figure 2Feature importance in (a) long-term model and (b) short-term model. The horizontal axis shows the relative feature importance (i.e., the ratio of used times of each feature to the total used times of all features).