| Literature DB >> 35449883 |
Cheuk To Chung1, George Bazoukis2,3, Sharen Lee1, Ying Liu4, Tong Liu5, Konstantinos P Letsas6, Antonis A Armoundas7,8, Gary Tse1,4,5,9.
Abstract
Ventricular arrhythmias (VAs) and sudden cardiac death (SCD) are significant adverse events that affect the morbidity and mortality of both the general population and patients with predisposing cardiovascular risk factors. Currently, conventional disease-specific scores are used for risk stratification purposes. However, these risk scores have several limitations, including variations among validation cohorts, the inclusion of a limited number of predictors while omitting important variables, as well as hidden relationships between predictors. Machine learning (ML) techniques are based on algorithms that describe intervariable relationships. Recent studies have implemented ML techniques to construct models for the prediction of fatal VAs. However, the application of ML study findings is limited by the absence of established frameworks for its implementation, in addition to clinicians' unfamiliarity with ML techniques. This review, therefore, aims to provide an accessible and easy-to-understand summary of the existing evidence about the use of ML techniques in the prediction of VAs. Our findings suggest that ML algorithms improve arrhythmic prediction performance in different clinical settings. However, it should be emphasized that prospective studies comparing ML algorithms to conventional risk models are needed while a regulatory framework is required prior to their implementation in clinical practice.Entities:
Keywords: Artificial intelligence; Machine learning; Prediction models; Risk stratification; Ventricular arrhythmias; Ventricular fibrillation; Ventricular tachycardia
Year: 2022 PMID: 35449883 PMCID: PMC9020640 DOI: 10.1186/s42444-022-00062-2
Source DB: PubMed Journal: Int J Arrhythmia ISSN: 2466-0981
Fig. 1First decision tree of the random forest model predicting risk of atrial fibrillation [15]
Fig. 2Seven-layered optimal architecture of the CNN model predicting risk of atrial fibrillation [16]
Fig. 3LSTM recurrent network architecture detecting arrhythmia on imbalanced ECG datasets [17]
Summary of the baseline characteristics and the outcomes reported in ML studies provide data for risk stratification purposes
| First author | Year of publication | Enrollment period | Number of patients | Age (years) | Males (%) | Type of clinical setting | Machine learning technique | Performance | Type of predicted arrhythmia | Features used for the prediction model |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| Alis [ | 2020 | 2014–2019 | 64 | 48.13 ± 13.06 | 65.6 | HCM | SVM, naïve Bayes, RF, k-NN | Accuracy SVM: 82.8% Naïve Bayes: 83.3% RF: 82.8% k-NN: 93.8% | Ventricular tachyarrhythmia | mean LGE% |
| Bhattacharya [ | 2019 | Mean duration: 2.86 years | 711 | No VA: 54 ± 15 Yes VA: 49 ± 16 | 61.0 | HCM | ensemble of logistic regression and native Bayes classifiers | Sens: 73%, spe: 76%, C-index = 0.83 | Sustained VT, VF | 22 features identified as predictive of VAr by the HCM-VAr-Risk Model |
| Lyon [ | 2018 | N/A | 123 | 47 ± 15 | 66.0 | HCM | Density-based clustering algorithm | N/A | VA | Data from 12 lead ECG holter |
| Smole [ | 2021 | N/A | 2302 | 46 ± 19 | 62.9 | HCM | RF, XGBoost, SVM, NN | Accuracy RF: 72% SVM (linear): 69% Bosted trees: 75% NN: 74% F1 score RF: 0.68 SVM (linear): 0.63 Boosted trees: 0.72 NN: 0,68 | VT | demographic characteristics, genetic data, clinical investigations, medications and disease-related events |
|
| ||||||||||
| Kotu [ | 2015 | N/A | 54 | N/A | N/A | Post-myocardial infarction patients | K-NN, SVM, decision tree, RF | SVM classifier provided average accuracy of 92.6% and AUC of 0.921 | VA | Quantitative discriminative features extracted from LGE-CMR image |
| Rogers [ | 2021 | N/A | 42 | 64.7 ± 13.0 | 97.6 | Ischemic CMP | SVM, CNN | Accuracy of the SVM: 83.2% Accuracy of convolutional NN: 56.7% | VT/VF | Ventricular monophasic action potentials |
| Okada [ | 2021 | 2003–2015 | 122 | 60 ± 11 | 87.0 | Ischemic CMP | SVM | Accuracy: 81% Correct classification: 86% NPP: 91% | VA | LGE-CMR data |
| Au-yeung [ | 2018 | NA | 788 | 60 | 77.3 | HF | RF, SVM | Both RF and SVM methods achieve a mean AUC of 0.81 for 5-min prediction and mean AUC of 0.87–0.88 for 10-second prediction | Ventricular tachyarrhythmia | VA onset prediction with heart rate variability data from ICD |
| Meng [ | 2019 | 2017–2019 | Retrospective: 500 Prospective: 1000 | N/A | N/A | HFrEF | Information gain ranking, decision trees, logistic regression, SVM, RF, ANN | NA (study protocol) | VT/VF | Demographic, clinical, biological, electrophysiological, social and psychological variables |
| Wu [ | 2020 | 2003–2015 | 382 | 57 ± 13 | 72.0 | HFrEF | RF | AUC: 0.88 | VA | clinical heart failure course, baseline CMR imaging metrics, levels of the interleukin-6 |
| Rocon [ | 2020 | 2011–2017 | 108 | 38.3 ± 15.5 | 48.1 | Non-compaction CMP | SVD impute, Parameter Selection Algorithm, sequential forward selection, distance-weighted k-NN | Accuracy: 75.5% Sens: 77% Spe: 75% | Major adverse cardiovascular events | LVEF (by CMRI), RV end systolic volume (by CMRI), RV systolic dysfunction (by echo) and RV lower diameter (by CMRI) |
|
| ||||||||||
| Marzec [ | 2018 | Minimum one year of observation | 235 | N/A | N/A | Patients with implantable electronic devices | Naïve, RF, decision tree analysis, k-NN, SVM | Accuracy Naïve: 74.5% RF: 76.6% Decision tree: 70.2% K-NN: 72.3% SVM: 74.5% F1 score Naïve: NA RF: 0.27 Decision tree: 0.13 K-NN: 0.00 SVM: NA | VT | Data about physical daily activities |
| Shakibfar [ | 2019 | N/A | 19,935 | N/A | N/A | ICD patients | RF | Accuracy: 96% AUC: 0.80 | VA (Electrical Storm) | ICD variables |
|
| ||||||||||
| Lee [ | 2021 | 1997–2017 | 516 | 50 ± 16 | 92.0 | Brugada syndrome | Nonnegative matrix factorisation, RF | Random survival forest Precision: 83.4% Recall: 85.3% F1 score: 84.3% Nonnegative matrix factorization Precision: 87.1% Recall: 88.8% F1 score: 87.9% | VT/VF, Brugada syndrome | Clinical and electrocardiographic data |
|
| ||||||||||
| Dilller [ | 2020 | 10 year observation period | 372 | 16.0 | 54.8 | Tetralogy of Fallot | Deep learning algorithms | N/A | VA prognosis, cardiac death, cardiac arrest | Prediction of adverse outcome using CMR data |
| Sun [ | 2021 | 2009 – 2019 | 269 | N/A | 35.7 | Following ASD closure in pediatric patients | RF, SVM, K-nearest neighbor, AdaBoost, decision tree | Accuracy k-NN: 78.9% Decision tree: 82.6% AdaBoost: 84.95% SVM: 89.3% RF: 94.7% AUC k-NN: 0.82 Decision tree: 0.80 AdaBoost: 0.75 SVM: 0.87 RF: 0.90 | Postoperative arrhythmias | Demographic characteristics, cardiac imaging data and blood exams |
|
| ||||||||||
| Sessa [ | 2020 | January 2015–December 2016 | 18,018 | 74.8 | 39.9 | Geriatric patients | Conditional inference tree | NA | VA | Specific medications |
| Taye [ | 2019 | N/A | N/A | N/A | N/A | Public databases | ANN classifiers | Accuracy: 98.6% Sens: 98.4% Spe: 99% | Ventricular tachyarrhythmia | QRS complex shape |
| Okada [ | 2019 | 2000–2017 | 76 | 53 ± 10 | 59.0 | Cardiac sarcoidosis | Supervised RF | Correct classification: 87% C-statistic: 0.91 | VA, atrioventricular block | Regional scar burden |
| Taye [ | 2020 | N/A | 78 | 20.7–75.3 | 80.8 | Spontaneous ventricular tachyar- rhythmia database | CNN, k-NN, ANN, SVM | Accuracy CNN: 84.6% ANN: 73.5% SVM: 67.9% k-NN: 65.9% AUC CNN: 0.78 ANN: 0.65 SVM: 0.63 k-NN: 0.62 | Ventricular tachyarrhythmia | Heart rate variability signals |
| Wu [ | 2020 | 2012–2016 | 508 | 30.83 ± 6.17 | 75.0 | Young hypertensive patients | Recursive feature elimination, extreme gradient boosting | C statistic: 0.757 | Composite endpoint including sustained VT/VF | Demographics, medical history, vital signs, echocardiography, polysomnography, blood exams |
| Bergau [ | 2018 | N/A | N/A | N/A | N/A | N/A | SVM | Sens: 85% Spe: 90% | Torsades de pointes | Gene expression differences |
| Chen [ | 2021 | N/A | 17 | N/A | N/A | N/A | SVM, RF, XGboost | RF model with an average precision of 99.99% and recall of 88.98% | VA | Heartbeat interval time series |
| Yap [ | 2004 | N/A | N/A | N/A | N/A | N/A | SVM, probabilistic NN, k-NN, decision tree | Accuracy SVM: 97.4% Probabilistic NN: 71.8% k-NN: 89.7% Decision tree: 38.5% | Torsade de pointes | Set of agents |
| Lee [ | 2016 | 2013–2015 | N/A | N/A | N/A | N/A | ANN | Sens: 88.2% Spe: 82.4% PPV: 83.3% NPV: 87.5% | VT one hour prior to its occurrence | Heart rate variability and respiratory rate variability |
HCM, hypertrophic cardiomyopathy; CMR, cardiac magnetic resonance; LGE, late gadolinium enhancement; VT, ventricular tachycardia; VA, ventricular arrhythmias; VF, ventricular fibrillation; SVM, support vector machine; SCD, sudden cardiac death; ANN, artificial neural network; HFrEF, heart failure with reduced ejection fraction; CMP, cardiomyopathy; ICD, implantable cardioverter-defibrillator; ASD, atrial septal defect; Sens, sensitivity; Spe, specificity; RF, random forest; CNN, convolutional neural network