| Literature DB >> 34585531 |
Qi Wang1,2, Bin Li1, Kangyu Chen2, Fei Yu2, Hao Su2, Kai Hu2, Zhiquan Liu2, Guohong Wu2, Ji Yan2, Guohai Su1.
Abstract
AIMS: Predicting the risk of malignant arrhythmias (MA) in hospitalized patients with heart failure (HF) is challenging. Machine learning (ML) can handle a large volume of complex data more effectively than traditional statistical methods. This study explored the feasibility of ML methods for predicting the risk of MA in hospitalized HF patients. METHODS ANDEntities:
Keywords: Heart failure; Machine learning; Tachycardia, Ventricular; Ventricular fibrillation
Mesh:
Year: 2021 PMID: 34585531 PMCID: PMC8712774 DOI: 10.1002/ehf2.13627
Source DB: PubMed Journal: ESC Heart Fail ISSN: 2055-5822
Figure 1Flow chart of the study protocol. CART, classification and regression tree; ECG, electrocardiogram; Echo, Echocardiography; HF, heart failure; MARS, multivariate adaptive regression splines; RF, random forest; ROC, receiver operating characteristic; XGBoost, eXtreme gradient boosting.
Main baseline characteristics of patients in the training and validation sets
| Variable | Training set ( | Validation set ( |
|
|---|---|---|---|
| Age (years) | 70 (61,77) | 69 (61,77) | 0.237 |
| Female | 776 (39.5) | 327 (39.4) | 0.955 |
| BMI (kg/m2) | 23 (21, 26) | 24 (21, 27) | 0.039 |
| Heart rate (beats/min) | 78 (68, 90) | 79 (70, 91) | 0.370 |
| SBP (mmHg) | 128 (114, 143) | 129 (115, 144) | 0.190 |
| Hospital stay (days) | 9 (7, 12) | 9 (7, 12) | 0.345 |
| NYHA class II | 349 (17.8) | 159 (19.2) | 0.385 |
| Coronary heart disease | 898 (45.7) | 374 (45.1) | 0.748 |
| Hypertension | 1008 (51.3) | 430 (51.8) | 0.815 |
| Atrial flutter or fibrillation | 700 (35.6) | 296 (35.7) | 0.992 |
| Diabetes mellitus | 443 (22.6) | 176 (21.2) | 0.432 |
| COPD | 100 (5.1) | 39 (4.7) | 0.663 |
| QRSd ≧ 120 ms | 754(38.4) | 302(36.4) | 0.318 |
| LBBB morphology | 283 (14.4) | 114 (13.7) | 0.641 |
| LVEF (%) | 46 (33, 61) | 44 (33, 60) | 0.188 |
| NT‐proBNP (pg/mL) | 2443 (1411, 4657) | 2521 (1450, 4930) | 0.537 |
| Scr (μmol/L) | 85 (69, 109) | 84 (69, 108) | 0.710 |
| CRT/ICD | 92 (4.7) | 46 (5.5) | 0.339 |
| PCI/CABG | 261 (13.3) | 116 (14.0) | 0.627 |
| ARNI/ACEI/ARB | 1287 (65.5) | 547 (65.9) | 0.849 |
| Beta blocker | 1268 (64.6) | 523 (63.0) | 0.435 |
| Antisterone | 1751 (89.2) | 746 (89.9) | 0.570 |
| Loop diuretic | 1829 (93.1) | 766 (92.3) | 0.432 |
| Cardiotonic drug | 1226 (62.4) | 530 (63.9) | 0.474 |
| Nitrate | 1058 (53.9) | 457 (55.1) | 0.564 |
| Oral antiarrhythmic drug | 452 (23.0) | 200 (24.1) | 0.537 |
| Chinese medicine | 144 (7.3) | 55 (6.6) | 0.508 |
ACEI, angiotensin converting enzyme inhibitor; ARB, angiotensin receptor antagonist; ARNI, angiotensin receptor neprilysin inhibitor; BMI, body mass index; CABG, coronary artery bypass graft; COPD, chronic obstructive pulmonary diseases; CRT, cardiac resynchronization therapy; ICD, implantable cardioverter defibrillator; LBBB, left bundle branch block; LVEF, left ventricular ejection fraction; NT‐proBNP, amino‐terminal pro‐brain natriuretic peptide; NYHA functional class, New York Heart Association functional class; PCI, percutaneous coronary intervention; QRSd, QRS duration; SBP, systolic blood pressure; Scr, serum creatinine.
Important features and performance of different prediction models
| Model | Important features | Training set AUC (95% CI) | Validation set AUC (95% CI) | Training set Brier score (95% CI) | Validation set Brier score (95% CI) |
|---|---|---|---|---|---|
| Lasso‐logistic model 1 | LBBB, oral antiarrhythmic drug, antithrombotic drug, Mg, LVEF, cardiac metabolic drug, | 0.905 (0.866–0.943) | 0.867 (0.819–0.915) | 0.027 (0.023–0.035) | 0.042 (0.032–0.056) |
| Lasso‐logistic model 2 | LBBB, oral antiarrhythmic drug, Mg, LDH | 0.881 (0.844–0.918) | 0.828 (0.764–0.892) | 0.030 (0.026–0.039) | 0.041 (0.031–0.054) |
| MARS | LBBB, Mg, oral antiarrhythmic drug, AST, antithrombotic drug, TRPG, SPAP, | 0.926 (0.896–0.955) | 0.852 (0.793–0.910) | 0.025 (0.020–0.032) | 0.036 (0.027–0.049) |
| CART | LBBB, myoglobin, oral antiarrhythmic drug, FT4, LAD | 0.773 (0.713–0.832) | 0.743 (0.661–0.824) | 0.026 (0.020–0.033) | 0.042 (0.032–0.057) |
| Random forest (top15) | Mg, LBBB, neutrophil, RBG, CK, globulin, TP, MCV, CKMB, TSH, LVEDD, WBC, LDH, MCH, CO2 | 0.779 (0.720–0.837) | 0.804 (0.726–0.881) | 0.034 (0.029–0.044) | 0.040 (0.030–0.054) |
| XGBoost (top15) | LBBB, Mg, oral antiarrhythmic drug, CK, TSH, FT4, AST, | 0.998 (0.997–1.000) | 0.864 (0.810–0.918) | 0.005 (0.003–0.009) | 0.037 (0.027–0.051) |
ALT, alanine aminotransferase; AST, aspartate aminotransferase; BMI, body mass index; CK, creatine kinase; CKMB, creatine kinase MB form; FS, fractional shortening; FT4, free tetraiodothyronine; LAD, left atrial diameter; LBBB, left bundle branch block; LDH, lactate dehydrogenase; LDL‐C, low density lipoprotein cholesterol; LVEDD, left ventricular end diastolic diameter; LVEF, left ventricular ejection fraction; MCH, mean corpuscular haemoglobin; MCV, mean corpuscular volume; RBG, random blood glucose; SPAP, systolic pulmonary artery pressure; TP, total protein; TRPG, tricuspid regurgitant pressure gradient; TSH, thyroid stimulating hormone; WBC, leukocyte count.
Figure 2Classification and regression tree (CART) analysis for malignant arrhythmia. Below ‘Yes’ and ‘No’ in the boxes are the proportions of patients in each group with and without malignant arrhythmia. The ratios at the bottom of the boxes represent the number of patients in each group as a percentage of the study population. The colour of the boxes varies with the incidence of malignant arrhythmias. LBBB, left bundle branch block; LAD, left atrial diameter.
Figure 3Receiver operating characteristic curves and AUCs of different prediction models. (A) Training set and (B) validation set. AUC, area under the receiver operating characteristic curve; CART, classification and regression tree; MARS, multivariate adaptive regression splines; XGBoost, eXtreme gradient boosting.
Figure 4Calibration plots of predicted probabilities and actual proportions for different prediction models. (A) Training set and (B) validation set. CART, classification and regression tree; MARS, multivariate adaptive regression splines; XGBoost, eXtreme gradient boosting.
Figure 5Decision analysis curves of the Lasso‐logistic models. (A) Training set and (B) validation set.