| Literature DB >> 31499517 |
Shinya Suzuki1, Takeshi Yamashita1, Tsuyoshi Sakama2, Takuto Arita1, Naoharu Yagi1, Takayuki Otsuka1, Hiroaki Semba1, Hiroto Kano1, Shunsuke Matsuno1, Yuko Kato1, Tokuhisa Uejima1, Yuji Oikawa1, Minoru Matsuhama3, Junji Yajima1.
Abstract
AIMS: Non-linear models by machine learning may identify different risk factors with different weighting in comparison to conventional linear models. METHODS ANDEntities:
Mesh:
Year: 2019 PMID: 31499517 PMCID: PMC6733605 DOI: 10.1371/journal.pone.0221911
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Parameters assessed in the prediction models.
| Category | Number of parameters | Parameters |
|---|---|---|
| Patient information | 9 | age, sex, body height, body weight, body mass index, systolic blood pressure, diastolic blood pressure, smoking habit, drinking habit |
| Comorbidity | 43 | hypertension, dyslipidemia, diabetes mellitus, uric acid, chronic kidney disease, anemia, heart failure, stable angina pectoris, vasospastic angina pectoris, acute coronary syndrome, old myocardial infarction, silent myocardial ischemia, ischemic cardiomyopathy, atherosclerosis obliterans, history of percutaneous coronary intervention, history of coronary artery bypass graft, mitral stenosis, mitral regurgitation, aortic stenosis, aortic regurgitation, tricuspid regurgitation, history of heart valve replacement, dilated cardiomyopathy, hypertrophic cardiomyopathy, dilated-phase hypertrophic cardiomyopathy, hypertensive heart disease, congenital heart disease, aortic dissection, aortic aneurism, sick sinus syndrome, atrioventricular block (II or more degrees), atrial fibrillation, atrial tachycardia/atrial flutter, ventricular fibrillation/sustained ventricular tachycardia, non-sustained ventricular tachycardia, history of catheter ablation, permanent pacemaker/implanted cardioverter defibrillator/cardiac resynchronization therapy implantation, history of symptomatic ischemic stroke or transient ischemic attack, history of intracranial hemorrhage, hyperthyroidism, chronic obstructive pulmonary disease, chronic hemodialysis |
| UCG parameters | 14 | interventricular septum thickness, posterior wall thickness, left ventricular end-diastolic diameter, left ventricular diameter at end systole, left ventricular ejection fraction, left atrial dimension, mitral regurgitation, aortic regurgitation, tricuspid regurgitation, right ventricular systolic pressure, E, A, E/A, deceleration time |
| Laboratory data | 26 | total protein, albumin, blood urea nitrogen, creatinine, estimated glomerular filtration rate, uric acid, sodium, potassium, chlorine, triglyceride, total cholesterol, aspartate aminotransferase, alanine transaminase, lactate dehydrogenase, creatine kinase, blood sugar, brain natriuretic peptide, white blood cell count, red blood cell count, hemoglobin, hematocrit, red cell distribution width, platelet count, mean platelet volume, plateletcrit, platelet distribution width |
| Medications | 26 | hypertensive drugs, beta blockers, calcium blockers, angiotensin converting enzyme inhibitors, angiotensin-II receptor blockers, alfa blockers, sodium glucose transporter-2 inhibitors, insulin, statin, eicosapentenoic acid drugs, diuretics, class I anti-arrhythmic drugs, carvedilol, bisoprolol, atenolol, class III anti-arrhythmic drugs, class IV anti-arrhythmic drugs, digitalis, antiplatelet, warfarin, direct oral anticoagulants, anti-thyroid drugs, thyroid drugs, non-steroidal anti-inflammatory drugs, benzodiazepines, non-benzodiazepine |
UCG: ultrasound cardiogram
Patient characteristics.
| Total, | |
|---|---|
| Age, years | 61 ± 14 |
| Male | 10,352 (65) |
| Hypertension | 8,110 (51) |
| Dyslipidemia | 6,250 (39) |
| Diabetes | 3,154 (20) |
| Heart failure | 2,831 (18) |
| Ischemic heart disease | 4,133 (26) |
| Valvular heart disease | 2,197 (14) |
| Cardiomyopathy | 1,352 (9) |
| Atrial fibrillation | 2,805 (18) |
Data are presented as n (%) of patients or mean ± standard deviation.
Incidence rates of patient outcomes.
| Total, | Incidence rate within 2 years |
|---|---|
| All-cause mortality | 217 (1) |
| Cardiovascular events | 786 (5) |
| Heart failure events | 417 (3) |
| Acute coronary syndrome | 247 (2) |
| Ischemic stroke events | 95 (0.6) |
| Intracranial hemorrhage | 59 (0.4) |
Data are presented as n (%).
Top five parameters for patient outcome.
| Machine learning | Logistic regression model | |||
|---|---|---|---|---|
| A. All-cause death | ||||
| Model | AUC | Model | AUC | |
| Support vector machine | 0.900 | ---- | 0.881 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | Albumin | 100 | Albumin | 100 |
| 2 | Hemoglobin | 78 | Age | 36 |
| 3 | Aortic aneurism | 44 | Total protein | 29 |
| 4 | Body mass index | 43 | Dyslipidemia | 28 |
| 5 | Hemodialysis | 43 | Carvedilol use | 25 |
| B. Cardiovascular events | ||||
| Model | AUC | Model | AUC | |
| Random forest | 0.848 | ---- | 0.831 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | Heart failure | 100 | History of acute coronary syndrome | 100 |
| 2 | History of acute coronary syndrome | 76 | Heart failure | 57 |
| 3 | Left ventricular ejection fraction | 72 | Left ventricular ejection fraction | 30 |
| 4 | Estimated glomerular filtration rate | 57 | Tricuspid regurgitation (degree) | 17 |
| 5 | Mitral regurgitation (degree) | 42 | Statin use | 17 |
| C. Heart failure events | ||||
| Model | AUC | Model | AUC | |
| Random forest | 0.912 | ---- | 0.907 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | Left ventricular ejection fraction | 100 | Left ventricular dimension at end-systole | 100 |
| 2 | Heart failure | 93 | Diuretics use | 77 |
| 3 | Age | 57 | Heart failure | 71 |
| 4 | Left ventricular dimension at end-diastole | 57 | Direct oral anticoagulant | 61 |
| 5 | Left atrial dimension | 49 | Left atrial dimension | 58 |
| D. Acute coronary syndrome events | ||||
| Model | AUC | Model | AUC | |
| Elastic-Net | 0.879 | ---- | 0.884 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | History of acute coronary syndrome | 100 | History of acute coronary syndrome | 100 |
| 2 | Antiplatelet use | 26 | Antiplatelet use | 23 |
| 3 | Diuretics use | 18 | Stable angina | 20 |
| 4 | Heart failure | 17 | Old myocardial infarction | 9 |
| 5 | Angiotensin receptor-II blocker | 10 | Creatinine | 8 |
| E. Ischemic stroke events | ||||
| Model | AUC | Model | AUC | |
| Support vector machine | 0.758 | ---- | 0.757 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | History of ischemic stroke or TIA | 100 | History of ischemic stroke or TIA | 100 |
| 2 | Deceleration time | 98 | Systolic blood pressure | 52 |
| 3 | History of intracranial hemorrhage | 76 | Blood glucose | 51 |
| 4 | Diastolic blood pressure | 48 | Aortic aneurism | 30 |
| 5 | Left ventricular ejection fraction | 34 | Tricuspid regurgitation | 28 |
| F. Intracranial hemorrhage events | ||||
| Model | AUC | Model | AUC | |
| Elastic-Net | 0.753 | ---- | 0.726 | |
| Parameters | IRF (%) | Parameters | PI (%) | |
| 1 | History of intracranial hemorrhage | 100 | History of intracranial hemorrhage | 100 |
| 2 | Warfarin use | 65 | Tricuspid regurgitation (degree) | 79 |
| 3 | Interventricular septal thickness | 37 | Interventricular septal thickness | 50 |
| 4 | Dilated-phase hypertrophic cardiomyopathy | 29 | Estimated glomerular filtration rate | 33 |
| 5 | Sick sinus syndrome | 29 | History of coronary artery bypass graft | 31 |
Abbreviations: AUC; area under the curve, IRF; impact of risk factors, PI; permutation importance, TIA; transient ischemic attack.
Fig 1Impacts of risk factors selected in the prediction models with machine learning (ML) and logistic regression (LR) for six patient outcomes.
(A) All-cause mortality: The areas under the curve (AUC) for all-cause mortality by prediction models with ML and LR were 0.900 and 0.881, respectively, and the risk factor with the strongest impact was albumin (impact 100, reference) for both models, followed by hemoglobin (78) and age (36) in ML and LR models, respectively. (B) Cardiovascular events: AUC 0.848 and 0.831, respectively, the risk factor with the strongest impact was heart failure and acute coronary syndrome (ACS) (both, 100, reference), followed by ACS (76) and heart failure (57), respectively. (C) Heart failure events: AUC 0.912 and 0.907, respectively, the risk factor with the strongest impact was left ventricular ejection fraction and left ventricular dimension at end-systole (both, 100, reference), respectively, followed by heart failure (93) and diuretics (77), respectively. (D) ACS: AUC 0.879 and 0.884, respectively, the risk factor with the strongest impact was ACS (100, reference) for both models, followed by anti-platelet for both models (26 and 23 in ML and LR, respectively). (E) Ischemic stroke events: AUC 0.758 and 0.757, respectively, the risk factor with the strongest impact was a history of ischemic stroke or transient ischemic attack (100, reference) for both models, followed by deceleration time (98) and systolic blood pressure (52), respectively. (F) Intracranial hemorrhage: AUC 0.753 and 0.726, respectively, the risk factor with the strongest impact was a history of intracranial hemorrhage (100, reference) for both models, followed by warfarin (65) and tricuspid regurgitation (79) in models with ML and LR, respectively.
Fig 2Relationships between parameters and incidence probability for six patient outcomes.
The associations between the top five parameters by machine learning models and incidence probability of six patient outcomes are shown. The incidence probability was determined as partial dependence in each model. (A) All-cause mortality, (B) Cardiovascular events, (C) Heart failure events, (D) Acute coronary syndrome events, (E) Ischemic stroke events, and (F) Intracranial hemorrhage events.