| Literature DB >> 33865317 |
Nan Liu1,2,3, Marcel Lucas Chee4, Zhi Xiong Koh5, Su Li Leow6, Andrew Fu Wah Ho6,5, Dagang Guo7, Marcus Eng Hock Ong6,8,5.
Abstract
BACKGROUND: Chest pain is among the most common presenting complaints in the emergency department (ED). Swift and accurate risk stratification of chest pain patients in the ED may improve patient outcomes and reduce unnecessary costs. Traditional logistic regression with stepwise variable selection has been used to build risk prediction models for ED chest pain patients. In this study, we aimed to investigate if machine learning dimensionality reduction methods can improve performance in deriving risk stratification models.Entities:
Keywords: Chest pain; Dimensionality reduction; Emergency department; Heart rate n-variability (HRnV); Heart rate variability (HRV); Machine learning
Mesh:
Year: 2021 PMID: 33865317 PMCID: PMC8052947 DOI: 10.1186/s12874-021-01265-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
List of traditional heart rate variability (HRV) and novel heart rate n-variability (HRnV) parameters used in this study. HRnV is a new representation of beat-to-beat variation in ECGs and parameter “n” controls the formation of new RR-intervals that are used for parameter calculation. Details of HRnV definition can be found in [28]
| HRV | HR | HR | HR | HR | HR |
|---|---|---|---|---|---|
| Mean NN | HR2V Mean NN | HR2V1 Mean NN | HR3V Mean NN | HR3V1 Mean NN | HR3V2 Mean NN |
| SDNN | HR2V SDNN | HR2V1 SDNN | HR3V SDNN | HR3V1 SDNN | HR3V2 SDNN |
| RMSSD | HR2V RMSSD | HR2V1 RMSSD | HR3V RMSSD | HR3V1 RMSSD | HR3V2 RMSSD |
| Skewness | HR2V Skewness | HR2V1 Skewness | HR3V Skewness | HR3V1 Skewness | HR3V2 Skewness |
| Kurtosis | HR2V Kurtosis | HR2V1 Kurtosis | HR3V Kurtosis | HR3V1 Kurtosis | HR3V2 Kurtosis |
| Triangular index | HR2V Triangular index | HR2V1 Triangular index | HR3V Triangular index | HR3V1 Triangular index | HR3V2 Triangular index |
| NN50 | HR2V NN50 | HR2V1 NN50 | HR3V NN50 | HR3V1 NN50 | HR3V2 NN50 |
| pNN50 | HR2V pNN50 | HR2V1 pNN50 | HR3V pNN50 | HR3V1 pNN50 | HR3V2 pNN50 |
| – | HR2V NN50 | HR2V1 NN50 | HR3V NN50 | HR3V1 NN50 | HR3V2 NN50 |
| – | HR2V pNN50 | HR2V1 pNN50 | HR3V pNN50 | HR3V1 pNN50 | HR3V2 pNN50 |
| Total powera | HR2V Total power | HR2V1 Total power | HR3V Total power | HR3V1 Total power | HR3V2 Total power |
| VLF power | HR2V VLF power | HR2V1 VLF power | HR3V VLF power | HR3V1 VLF power | HR3V2 VLF power |
| LF power | HR2V LF power | HR2V1 LF power | HR3V LF power | HR3V1 LF power | HR3V2 LF power |
| HF power | HR2V HF power | HR2V1 HF power | HR3V HF power | HR3V1 HF power | HR3V2 HF power |
| LF power norm | HR2V LF power norm | HR2V1 LF power norm | HR3V LF power norm | HR3V1 LF power norm | HR3V2 LF power norm |
| HF power norm | HR2V HF power norm | HR2V1 HF power norm | HR3V HF power norm | HR3V1 HF power norm | HR3V2 HF power norm |
| LF/HF | HR2V LF/HF | HR2V1 LF/HF | HR3V LF/HF | HR3V1 LF/HF | HR3V2 LF/HF |
| Poincaré SD1 | HR2V Poincaré SD1 | HR2V1 Poincaré SD1 | HR3V Poincaré SD1 | HR3V1 Poincaré SD1 | HR3V2 Poincaré SD1 |
| Poincaré SD2 | HR2V Poincaré SD2 | HR2V1 Poincaré SD2 | HR3V Poincaré SD2 | HR3V1 Poincaré SD2 | HR3V2 Poincaré SD2 |
| Poincaré SD1/SD2 ratio | HR2V Poincaré SD1/SD2 | HR2V1 Poincaré SD1/SD2 | HR3V Poincaré SD1/SD2 | HR3V1 Poincaré SD1/SD2 | HR3V2 Poincaré SD1/SD2 |
| SampEn | HR2V SampEn | HR2V1 SampEn | HR3V SampEn | HR3V1 SampEn | HR3V2 SampEn |
| ApEn | HR2V ApEn | HR2V1 ApEn | HR3V ApEn | HR3V1 ApEn | HR3V2 ApEn |
| DFA, α1 | HR2V DFA, α1 | HR2V1 DFA, α1 | HR3V DFA, α1 | HR3V1 DFA, α1 | HR3V2 DFA, α1 |
| DFA, α2 | HR2V DFA, α2 | HR2V1 DFA, α2 | HR3V DFA, α2 | HR3V1 DFA, α2 | HR3V2 DFA, α2 |
Mean NN average of R-R intervals, SDNN standard deviation of R-R intervals, RMSSD square root of the mean squared differences between R-R intervals, NN50 the number of times that the absolute difference between 2 successive R-R intervals exceeds 50 ms pNN50, NN50 divided by the total number of R-R intervals, NN50n the number of times that the absolute difference between 2 successive RRI/RRI sequences exceeds 50 × n ms, pNN50n NN50n divided by the total number of RRI/RRI sequences, VLF very low frequency, LF low frequency, HF high frequency, SD standard deviation, SampEn sample entropy, ApEn approximate entropy, DFA detrended fluctuation analysis
aIn frequency domain analysis, the power of spectral components is the area below the relevant frequencies presented in absolute units (square milliseconds)
Summary of machine learning dimensionality reduction methods used in this study
| Methods | Descriptions |
|---|---|
| Principal component analysis (PCA) [ | PCA decomposes data into a set of successive orthogonal components that explain a maximum amount of the variance |
| Kernel PCA (KPCA) [ | KPCA extends PCA by using kernel functions to achieve non-linear dimensionality reduction |
| Latent semantic analysis (LSA) [ | LSA is similar to PCA but differs in that the data matrix does not need to be centered |
| Gaussian random projection (GRP) [ | GRP projects the original input features onto a randomly generated matrix where components are drawn from a Gaussian distribution |
| Sparse random projection (SRP) [ | SRP projects the original input features onto a sparse random matrix, which is an alternative to dense Gaussian random projection matrix |
| Multidimensional scaling (MDS) [ | MDS is a technique used for analyzing similarity or dissimilarity data, seeking a low-dimensional representation of the data in which the distances respect well the distances in the original high-dimensional space |
| Isomap [ | Isomap is a manifold learning algorithm, seeking a lower-dimensional embedding that maintains geodesic distances between all points |
| Locally linear embedding (LLE) [ | LLE projects the original input features to a lower-dimensional space by preserving distances within local neighborhoods |
Baseline characteristics of patient cohorts
| Total ( | MACE ( | Non-MACE ( | ||
|---|---|---|---|---|
| 60 (51–68) | 61 (54–68) | 59 (50–68) | 0.035 | |
| 542 (68.2) | 188 (76.1) | 354 (64.6) | 0.002 | |
| 0.623 | ||||
| Chinese | 492 (61.9) | 159 (64.4) | 333 (60.8) | 0.374 |
| Indian | 129 (16.2) | 34 (13.8) | 95 (17.3) | 0.246 |
| Malay | 150 (18.9) | 46 (18.6) | 104 (19.0) | 0.984 |
| Other | 24 (3.0) | 8 (3.2) | 16 (2.9) | 0.984 |
| Temperature (°C) | 36.4 (36.0–36.7) | 36.3 (36.0–36.7) | 36.4 (36.0–36.7) | 0.793 |
| Heart rate (beats/min) | 76 (67–89) | 80 (69–92.5) | 75 (66–87) | 0.03 |
| Respiratory rate (breaths/min) | 18 (17–18) | 18 (17–18) | 18 (17–18) | 0.716 |
| Systolic blood pressure (mmHg) | 138 (123.0–159.0) | 142 (123.5–165.5) | 137 (122.0–156.2) | 0.037 |
| Diastolic blood pressure (mmHg) | 76.0 (68.0–86.0) | 78.0 (70.0–89.0) | 75.0 (67.0–84.0) | 0.001 |
| SpO2 (%) | 99.0 (97.0–100.0) | 99.0 (97.0–100.0) | 99.0 (97.0–100.0) | 0.842 |
| Pain score | 2.0 (0.0–5.0) | 2.0 (0.0–5.0) | 2.0 (0.0–5.0) | 0.008 |
| Glasgow Coma Scale (GCS) score | 15.0 (15.0–15.0) | 15.0 (15.0–15.0) | 15.0 (15.0–15.0) | 0.121 |
| Ischaemic heart disease | 343 (43.1) | 115 (46.6) | 228 (41.6) | 0.22 |
| Diabetes | 278 (35.0) | 106 (42.9) | 172 (31.4) | 0.002 |
| Hypertension | 509 (64.0) | 161 (65.2) | 348 (63.5) | 0.707 |
| Hypercholesterolaemia | 476 (59.9) | 151 (61.1) | 325 (59.3) | 0.683 |
| Stroke | 58 (7.3) | 15 (6.1) | 43 (7.8) | 0.458 |
| Cancer | 29 (3.6) | 7 (2.8) | 22 (4.0) | 0.537 |
| Respiratory disease | 31 (3.9) | 5 (2.0) | 26 (4.7) | 0.102 |
| Chronic kidney disease | 87 (10.9) | 26 (10.5) | 61 (11.1) | 0.896 |
| Congestive heart failure | 38 (4.8) | 9 (3.6) | 29 (5.3) | 0.407 |
| History of PCI | 199 (25.0) | 68 (27.5) | 131 (23.9) | 0.316 |
| History of CABG | 71 (8.9) | 26 (10.5) | 45 (8.2) | 0.355 |
| History of AMI | 133 (16.7) | 48 (19.4) | 85 (15.5) | 0.205 |
| Active smoker | 197 (24.8) | 73 (29.6) | 124 (22.6) | 0.045 |
| ST elevation | 65 (8.2) | 48 (19.4) | 17 (3.1) | < 0.001 |
| ST depression | 92 (11.6) | 69 (27.9) | 23 (4.2) | < 0.001 |
| T wave inversion | 209 (26.3) | 86 (34.8) | 123 (22.4) | < 0.001 |
| Pathological Q wave | 86 (10.8) | 51 (20.6) | 35 (6.4) | < 0.001 |
| QTc prolongation | 174 (21.9) | 73 (29.6) | 101 (18.4) | 0.001 |
| Left axis deviation | 64 (8.1) | 16 (6.5) | 48 (8.8) | 0.34 |
| Right axis deviation | 16 (2.0) | 6 (2.4) | 10 (1.8) | 0.773 |
| Left bundle branch block | 8 (1.0) | 3 (1.2) | 5 (0.9) | 0.991 |
| Right bundle branch block | 56 (7.0) | 14 (5.7) | 42 (7.7) | 0.385 |
| Interventricular conduction delay | 30 (3.8) | 13 (5.3) | 17 (3.1) | 0.201 |
| Left atrial abnormality | 12 (1.5) | 4 (1.6) | 8 (1.5) | 0.886 |
| Left ventricular hypertrophy | 103 (13.0) | 38 (15.4) | 65 (11.9) | 0.21 |
| Right ventricular hypertrophy | 6 (0.8) | 1 (0.4) | 5 (0.9) | 0.747 |
| Troponin (ng/L) | 0 (0–39.5) | 40 (10–170) | 0 (0–15.2) | < 0.001 |
| Creatine kinase-MB | 2.4 (1.8–3.2) | 2.7 (2.1–6.0) | 2.4 (1.7–2.7) | < 0.001 |
| HEART | 5.0 (4.0 to 7.0) | 7.0 (6.0 to 8.0) | 4.0 (3.0 to 6.0) | < 0.001 |
| TIMI | 2.0 (1.0 to 4.0) | 3.0 (2.0 to 4.0) | 2.0 (1.0 to 3.0) | < 0.001 |
| GRACE | 104.0 (83.5 to 128.0) | 119.0 (97.0 to 139.0) | 98.0 (78.0 to 125.0) | < 0.001 |
IQR interquartile range, MACE major adverse cardiac events, PCI percutaneous coronary intervention, CABG coronary artery bypass graft, AMI acute myocardial infarction, HEART History, ECG, Age, Risk factors and Troponin, TIMI Thrombolysis in Myocardial Infarction, GRACE Global Registry of Acute Coronary Events
Fig. 1Variable preselection using p-value in univariable analysis for dimensionality reduction: a prediction area under the curve versus the p-value, and (b) the number of preselected variables versus the p-value
Fig. 2Prediction performance based on the eight dimensionality reduction algorithms versus the number of feature dimensions after reduction
Fig. 3ROC curves (based on the optimal number of dimensions) generated by the stepwise model, eight dimensionality reduction models, and three clinical scores
Comparison of performance of the HRnV models (based on 5-fold cross-validation), HEART, TIMI, and GRACE scores in predicting 30-day major adverse cardiac events (MACE). The cut-off values were defined as the points nearest to the upper-left corner on the ROC curves
| AUC (95% CI) | Cut-off | Sensitivity % (95% CI) | Specificity % (95% CI) | PPV % (95% CI) | NPV % (95% CI) | |
|---|---|---|---|---|---|---|
| 0.887 (0.859–0.916) | 0.3140 | 79.4 (74.3–84.4) | 78.8 (75.4–82.3) | 62.8 (57.5–68.2) | 89.4 (86.7–92.2) | |
| 0.899 (0.872–0.926) | 0.2881 | 85.4 (81.0–89.8) | 78.5 (75.0–81.9) | 64.1 (59.0–69.3) | 92.3 (89.9–94.7) | |
| 0.896 (0.869–0.923) | 0.3489 | 81.8 (77.0–86.6) | 82.1 (78.9–85.3) | 67.3 (62.0–72.6) | 90.9 (88.4–93.4) | |
| 0.899 (0.872–0.926) | 0.2884 | 85.4 (81.0–89.8) | 78.6 (75.2–82.1) | 64.3 (59.1–69.5) | 92.3 (89.9–94.7) | |
| 0.896 (0.868–0.923) | 0.2965 | 85.0 (80.6–89.5) | 78.5 (75.0–81.9) | 64.0 (58.8–69.2) | 92.1 (89.6–94.5) | |
| 0.898 (0.871–0.925) | 0.2940 | 84.6 (80.1–89.1) | 79.6 (76.2–82.9) | 65.1 (59.9–70.3) | 92.0 (89.5–94.4) | |
| 0.901 (0.874–0.928) | 0.3095 | 83.4 (78.8–88.0) | 81.6 (78.3–84.8) | 67.1 (61.8–72.4) | 91.6 (89.1–94.1) | |
| 0.888 (0.860–0.917) | 0.3468 | 78.5 (73.4–83.7) | 82.7 (79.5–85.8) | 67.1 (61.7–72.5) | 89.5 (86.9–92.2) | |
| 0.898 (0.870–0.925) | 0.3140 | 85.0 (80.6–89.5) | 79.4 (76.0–82.8) | 65.0 (59.8–70.2) | 92.2 (89.7–94.6) | |
| 0.841 (0.808–0.874) | 5 | 78.9 (73.9–84.0) | 72.8 (69.1–76.5) | 56.7 (51.4–61.9) | 88.5 (85.5–91.4) | |
| 0.681 (0.639–0.723) | 2 | 63.6 (57.6–69.6) | 58.4 (54.3–62.5) | 40.8 (35.9–45.7) | 78.0 (74.0–82.1) | |
| 0.665 (0.623–0.707) | 107 | 64.0 (58.0–70.0) | 60.8 (56.7–64.9) | 42.4 (37.3–47.4) | 78.9 (75.0–82.8) |
AUC area under the curve, CI confidence interval, PPV positive predictive value, NPV negative predictive value, HEART History, ECG, Age, Risk factors and Troponin, TIMI Thrombolysis in Myocardial Infarction, GRACE Global Registry of Acute Coronary Events
Fig. 4Calibration curves (based on the optimal number of dimensions) generated by the stepwise model, eight dimensionality reduction models, and three clinical scores
Fig. 5ROC curves (based on the optimal number of dimensions) generated by the stepwise model, eight dimensionality reduction models, and three clinical scores, where the prediction models were built without using cardiac troponin.s