| Literature DB >> 27835634 |
Wenhui Chen1,2,3, Lianrong Zheng1,2,3, Kunyang Li1,2,3, Qian Wang1,2,3, Guanzheng Liu1,2,3, Qing Jiang1,2,3.
Abstract
Risk assessment of congestive heart failure (CHF) is essential for detection, especially helping patients make informed decisions about medications, devices, transplantation, and end-of-life care. The majority of studies have focused on disease detection between CHF patients and normal subjects using short-/long-term heart rate variability (HRV) measures but not much on quantification. We downloaded 116 nominal 24-hour RR interval records from the MIT/BIH database, including 72 normal people and 44 CHF patients. These records were analyzed under a 4-level risk assessment model: no risk (normal people, N), mild risk (patients with New York Heart Association (NYHA) class I-II, P1), moderate risk (patients with NYHA III, P2), and severe risk (patients with NYHA III-IV, P3). A novel multistage classification approach is proposed for risk assessment and rating CHF using the non-equilibrium decision-tree-based support vector machine classifier. We propose dynamic indices of HRV to capture the dynamics of 5-minute short term HRV measurements for quantifying autonomic activity changes of CHF. We extracted 54 classical measures and 126 dynamic indices and selected from these using backward elimination to detect and quantify CHF patients. Experimental results show that the multistage risk assessment model can realize CHF detection and quantification analysis with total accuracy of 96.61%. The multistage model provides a powerful predictor between predicted and actual ratings, and it could serve as a clinically meaningful outcome providing an early assessment and a prognostic marker for CHF patients.Entities:
Mesh:
Year: 2016 PMID: 27835634 PMCID: PMC5105944 DOI: 10.1371/journal.pone.0165304
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart of entire work.
N: normal people; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV; S1: basic measures of 24-h RR interval data, which reflect long-term data variation); S2: basic measures of the second 5-min segment, which representing a stable measurement condition of short-term data; S3: mid-value of basic measures of 5-min segments, which showing an intermediate state of short-term data; D1: mean value of basic measures of 5-min segments, for robustness improvement; D2: standard deviation of each basic measure of 5-min segments; D3: root mean square of each basic measure of 5-min segments; D4: coefficient variation of each basic measure of 5-min segments; D5: percentage of abnormal value (value intervening M±S) of each basic measure of 5-min segments; D6: sample entropy of each basic measure of 5-min segments; D7: fuzzy entropy of each basic measure of 5-min segments.; DT-SVM: decision tree based support vector machine.
Fig 2Multistage classification algorithm based on DT-SVM for risk assessment.
Upper diagram: tree-structured classifier. Lower diagram: wrappers for feature selection. N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV; DSF: disease screening function; RAF: risk assessment function, in which I is for discriminating the higher risk from the lower risk, II is for distinction of moderate risk and mild risk; BE: backward elimination; SD: significance difference.
Classification performance of classical SVM in 4-level risk assessment.
| Method | Input Feature | Accuracy (%) |
|---|---|---|
| Classical SVM | SI | 76.27 |
| DI | 67.80 | |
| SI+DI | 76.27 |
SI: static indices; DI: dynamic indices;
* represents that significance value of features were under 0.1.
Performance of different feature combinations for disease detection and quantification.
| Groups | Accuracy | Destination | Method | ||
|---|---|---|---|---|---|
| SI | DI | SI | |||
| N vs. P | 86.44 | 89.83 | C-SVM | ||
| P1 vs. P2&P3 | 73.91 | ||||
N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV; SI: static indices; DI: dynamic indices;
* represents that significance level of features were under 0.1; C-SVM: classical SVM.
Result of node selection for level 1 among all samples.
| Node | number of | number of | number of | number of | number of |
|---|---|---|---|---|---|
| N vs. rest | 75 | ||||
| P1 vs. rest | 0 | 17 | 39 | 51 | 129 |
| P2 vs. rest | 15 | 41 | 73 | 86 | 94 |
| P3 vs. rest | 28 | 52 | 68 | 83 | 97 |
| N&P1 vs. rest | 48 | 65 | 88 | 102 | 78 |
| N&P2 vs. rest | 29 | 51 | 71 | 86 | 94 |
| N&P3 vs. rest | 33 | 61 | 85 | 97 | 83 |
N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV.
Result of node selection for level 2 among CHF patients.
| Node | number of | number of | number of | number of | number of |
|---|---|---|---|---|---|
| P1 vs. rest | 0 | 3 | 19 | 39 | 141 |
| P2 vs. rest | 0 | 0 | 7 | 22 | 157 |
| P3 vs. rest | 131 |
P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV.
Fig 3Multistage risk assessment model of CHF.
DSF: disease screening function to detect normal from patients; RAF: risk assessment function, in which I is for discriminating the higher risk from the lower risk, II is for distinction of moderate risk and mild risk; N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV.
Selected optimal feature subsets for each level with backward elimination.
| Node | Input feature numbers | Optimal feature subsets | Effectiveness (%) |
|---|---|---|---|
| Node 1 | 111 | S1T1,S1T5,S1F4,S2T1,S3T1,S3T3,S2F3,S3E8 | 7.21 |
| Node 2 | 49 | D5F4,D5E1 | 4.08 |
| Node 3 | 12 | D2T4,D3F3,D4T1,D5E5 | 33.33 |
*: Meaning of features were defined in HRV Measurement; Effectiveness is ratio of number of selected features to number of input features at each node.
Fig 4Confusion matrices.
N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV.
Classification performance.
| Node | TP | TN | FP | FN | ACC (%) | SEN (%) | SPE (%) | PRE (%) | AUC (%) | Total ACC (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| Node1 | 36 | 22 | 0 | 1 | 98.31 | 97.3 | 100 | 100 | 98.65 | |
| Node2 | 8 | 13 | 0 | 1 | 95.45 | 88.89 | 100 | 100 | 94.45 | |
| Node3 | 6 | 7 | 0 | 0 | 100 | 100 | 100 | 100 | 100 |
TP: true positive, TN: true negative, FP: false positive, FN: false negative;
ACC = (TP + TN)/(TP + TN + FP + FN), SEN = TP/(TP + FN), SPE = TN/(TN + FN), PRE = TP/(TP + FP), AUC = 1/2(SEN + SPE).
Highlight.
| Reference | Classes | Samples*Time | Feature | Feature Selection | Classifier | Accuracy | Highlight |
|---|---|---|---|---|---|---|---|
| Yu et al. | 83*68min | SI | GA | SVM | 96.38% | CHF detection based on bi-spectral HRV analysis and genetic algorithm | |
| Isler et al. | ( | 83*5min | SI | GA | KNN | 96.39% | CHF detection by combining classical HRV with wavelet entropy measures |
| Melillo et al. | 44*24h | SI | ESM | CART | 85.40% | 2-level CHF quantification in patients with CHF via long-term HRV and CART algorithm | |
| Our work | BE |
N: normal samples; P: CHF patients, in which 1 is of NYHA I-II, 2 is of NYHA III, 3 is of NYHA III-IV; SI: static indices; DI: dynamic indices; GA: genetic algorithm; ESM: exhaustive search method; BE: backward elimination; SVM: support vector machine; KNN: k-nearest neighbor; DT-SVM: decision tree based support vector machine.