| Literature DB >> 34390311 |
Takeshi Tohyama1,2, Tomomi Ide2, Masataka Ikeda2, Hidetaka Kaku2, Nobuyuki Enzan2, Shouji Matsushima2, Kouta Funakoshi1, Junji Kishimoto1, Koji Todaka1, Hiroyuki Tsutsui2.
Abstract
AIMS: Individual risk stratification is a fundamental strategy in managing patients with heart failure (HF). Artificial intelligence, particularly machine learning (ML), can develop superior models for predicting the prognosis of HF patients, and administrative claim data (ACD) are suitable for ML analysis because ACD is a structured database. The objective of this study was to analyse ACD using an ML algorithm, predict the 1 year mortality of patients with HF, and finally develop an easy-to-use prediction model with high accuracy using the top predictors identified by the ML algorithm. METHODS ANDEntities:
Keywords: Artificial intelligence; Heart failure; Machine learning; Risk prediction
Mesh:
Year: 2021 PMID: 34390311 PMCID: PMC8497366 DOI: 10.1002/ehf2.13556
Source DB: PubMed Journal: ESC Heart Fail ISSN: 2055-5822
Figure 1Data extraction from the Japanese Registry of Acute Decompensated Heart Failure (JROADHF) study.
Baseline patient characteristics
| Characteristics |
|
|---|---|
| Age, years | 80 [70–86] |
| Female, | 4606 (45.3) |
| Body mass index, kg/m2 | 22.4 [19.9–25.3] |
| Aetiologies, | |
| Ischaemic | 3657 (35.9) |
| Hypertensive disease | 2578 (25.3) |
| Valvular disease | 3587 (35.3) |
| Cardiomyopathy | 1571 (15.4) |
| Comorbidities, | |
| Hypertension | 7411 (72.8) |
| Diabetes mellitus | 3637 (35.7) |
| Dyslipidaemia | 3329 (32.7) |
| COPD | 663 (6.5) |
| Chronic kidney disease | 3980 (39.1) |
| NYHA class on admission, | |
| I or II | 1566 (15.6) |
| III or IV | 8461 (84.4) |
| NYHA class at discharge, | |
| I or II | 8800 (91.0) |
| III or IV | 868 (9.0) |
| Blood pressure, mmHg | |
| Systolic | 137 [118–160] |
| Diastolic | 78 [65–92] |
| Pulse rate, b.p.m. | 88 [73–107] |
| LVEF, % | 46 [32–61] |
| LVEF < 40%, | 3444 (38.0) |
| Serum sodium, mmol/L | 139 [137–141] |
| Serum potassium, mmol/L | 4.3 [4.0–4.7] |
| Estimated GFR, mL/min/1.73 m2 | 44.1 [29.4–59.5] |
| Haemoglobin, g/dL | 11.5 [10.1–13.2] |
| Barthel index | 100 [65–100] |
| Medication, | |
| Diuretics | 8408 (82.8) |
| RAS inhibitor | 6140 (60.5) |
| Beta‐blockers | 6099 (60.1) |
| MRA | 4512 (44.4) |
COPD, chronic obstructive pulmonary disease; GFR, glomerular filtration rate; LVEF, left ventricular ejection fraction; MRA, mineralocorticoid receptor antagonist; NYHA, New York Heart Association; RAS, renin–angiotensin–aldosterone system.
Values are medians [interquartile ranges] or n (%).
Figure 2(A) Comparison of the receiver operating characteristic curves of the six machine learning models when applied to the test set. All six models have similar predictive performance. (B) Calibration plot of six models. The data points are subdivided into deciles of predicted probability. Each mean observed mortality is plotted against the mean model probability. Perfect model calibration corresponds with the y = x line.
Predictive performances of the machine learning models and conventional risk models
| Model | Accuracy | Sensitivity | Specificity | c‐statistic | Brier score |
|---|---|---|---|---|---|
| Logistic lasso regression | 73.3 | 66.8 | 74.7 | 0.776 [0.750–0.802] | 0.121 |
| Support vector machine | 69.6 | 69.7 | 69.6 | 0.764 [0.738–0.791] | 0.123 |
| Random forest | 72.4 | 66.5 | 73.6 | 0.767 [0.740–0.794] | 0.125 |
| Gradient boosting tree | 75.0 | 61.0 | 77.9 | 0.765 [0.738–0.792] | 0.123 |
| Voting classifier (ACD‐VC) | 70.7 | 71.1 | 70.6 | 0.777 [0.751–0.803] | 0.121 |
| Neural network | 73.8 | 66.8 | 75.2 | 0.776 [0.750–0.802] | 0.121 |
| SMART‐HF | 71.2 | 67.9 | 71.8 | 0.765 [0.739–0.791] | 0.124 |
| SHFM | 80.5 | 26.6 | 91.7 | 0.713 [0.684–0.742] | 0.139 |
| MAGGIC | 70.4 | 58.7 | 72.9 | 0.726 [0.698–0.753] | 0.130 |
| SHFM‐VC | 73.5 | 64.5 | 75.3 | 0.760 [0.734–0.785] | 0.125 |
| MAGGIC‐VC | 64.8 | 75.1 | 62.6 | 0.754 [0.728–0.781] | 0.126 |
ACD‐VC, voting classifier based on administrative claim data; MAGGIC, meta‐analysis global group in chronic heart failure risk score; MAGGIC‐VC, voting classifier based on the variables in MAGGIC; NYHA, New York Heart Association; SHFM, Seattle Heart Failure Model; SHFM‐VC, voting classifier based on the variables in SHFM; SMART‐HF, simple model by artificial intelligence for heart failure risk stratification.
Values are means [95% confidence intervals].
Figure 3Top 20 significant predictors of the developed machine learning model for 1 year mortality in heart failure patients calculated by permutation feature importance. ARB, angiotensin II receptor blocker; DOAC, direct oral anticoagulant; RHC, right heart catheterization.
Figure 4(A) Comparison of ACD‐based models (ACD‐VC and SMART‐HF) with the conventional risk models of SHFM and MAGGIC when applied to the test set. (B) Calibration plot of the four models. Data points are subdivided into deciles of predicted probability. Each mean observed mortality is plotted against the mean model probability. ACD, administrative claim data; ACD‐VC, ACD‐based voting classifier; MAGGIC, Meta‐Analysis Global Group in Chronic Heart Failure; SHFM, Seattle Heart Failure Model; SMART‐HF, Simple Model by ARTificial intelligence for HF risk stratification.