| AI and Risk Stratification Modeling |
| [10], 2017—1a | Location: United KingdomAim: Predicting the first CVD event over 10-years and comparing that with the American College of Cardiology guidelines.Variables: Routine clinical data from family practicesStrengths: Prospective; large sample sizeLimitations: Unbalanced datasetFindings: Highest achieving algorithm was NN: AUC 0.76, predicted 4998/7404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 7.6% more patients than the established algorithm | 378,256 | RF, LR, GBM, NN |
| [12], 2017—1a | Location: United StatesAim: Predict six cardiovascular outcomes in comparison to standard risk scores.Variables: 735 variables from imaging and non-invasive tests, questionnaires, and biomarker panelsStrengths: Prospective; included participants from the MESA (Multi-Ethnic Study of Atherosclerosis) [22]; 12-year follow-up; four ethnicitiesLimitations: Potential cause for biases due to imputation procedureFindings: Age was the most important predictor for all-cause mortality. Fasting glucose levels and carotid ultrasonography measures were important predictors of stroke. CAC was the most important predictor of coronary heart disease and all atherosclerotic cardiovascular disease combined outcomes. Left ventricular structure and function and cardiac troponin-T were among the top predictors for incident heart failure. Creatinine, age, and ankle-brachial index were among the top predictors of AF. TNF-α and IL-2 soluble receptors and NT-proBNP levels were important across all outcomes.Notable facts: ML in conjunction with deep phenotyping improves prediction accuracy in cardiovascular event prediction in an initially asymptomatic population. | 6814 | RF |
| [11], 2019—1a, 1b | Location: United StatesAim: Predicting of long-term risk of MI and cardiac death in asymptomatic subjects by integrating clinical parameters with CAC, and automated EAT quantification.Variables: Clinical co-variates, lipid panel, risk factors, CAC, aortic calcium, and automated EAT measuresStrengths: Prospective; subjects from EISNER trial [23]; 14.5 years follow-upLimitations: Unbalanced dataFindings: AUC 0.82; Subjects with a higher ML score had high hazard of suffering events (HR: 10.38, p < 0.001); the relationships persisted in multivariable analysis including ASCVD-risk and CAC measures (HR: 2.94, p = 0.005). Age, ASCVD-risk, and CAC were prognostically important for both genders. Notable facts: ML used to integrate clinical and quantitative imaging-based variables significantly improves prediction of MI and cardiac death. | 1912 | XGBoost |
| [14], 2017—>1a | Location: ChinaAim: Identifying the association between the clinical reference range of serum HbA1c and TSH, and the risk of CAD in non-diabetic and euthyroid patients.Variables: HbA1c and TSH levelsStrengths: Prospective; 10-year follow-upLimitations: Small sample sizeFindings: Baseline HbA1c and TSH within the reference range were positively associated with CAD risk. No correlation and interaction between the baseline HbA1c and TSH for the development of CAD. The combination of these baselines showed sensitivity of 87.2%, specificity of 92.7%, and accuracy of 92.3% for identifying the participants who will later develop CAD. | 538 | SVM |
| [107], 2018—1a | Location: LebanonAim: Comparing ANN-based prediction models to the other risk models being used in practice (the Diamond–Forrester and the Morise models).Variables: Imaging-based stress test measuresStrengths: ProspectiveLimitations: Small sample sizeFindings: Compared to other models, the ANN model had higher discriminatory power (DP) (1.61) for predicting ischemia, 98% negative predictive value, 91% sensitivity, 65% specificity, 26% positive predictive value, and a potential 59% reduction of non-invasive imaging. | 486 | ANN |
| [28], 2018—1b, 3a | Location: United Kingdom, Italy, NorwayAim: Discriminating between healthy and HFpEF subjects with impaired functional reserve and identifying new descriptors to better characterize HFpEF syndrome using basal myocardial long-axis velocity patterns at rest and exercise.Variables: Left ventricular long-axis myocardial velocity patternsStrengths: Prospective, 6–60 months survival analysisLimitations: Confounding effects (age, gender) not studied, small sample sizeFindings: ML-diagnostic zones differed for age, body mass index, six-minute walk distance, B-type natriuretic peptide, and left ventricular mass index. Correlation with diagnosis was 72.6%; ML identified 6% of healthy controls as HFpEF. Blinded reinterpretation of imaging from subjects with discordant clinical and ML diagnoses revealed abnormalities not included in diagnostic criteria. | 156 | Clustering |
| [71], 2015—1b, 3a | Location: United StatesAim: Identify phenotypically distinct HFpEF categories.Variables: Clinical, laboratory, ECG, and echocardiographic phenotyping(phenomapping)Strengths: ProspectiveFindings: Phenomapping classified study participants into three risk-stratified groups. Notable facts: A novel classification of HFpEF using phenomapping that can define therapeutically homogeneous patient subclasses. | 397 | Clustering |
| [16], 2019—1a, 3a, 4b | Location: United StatesAim: Predicting survival after echocardiography.Variables: 90 cardiovascular-relevant ICD-10 codes, age, sex, height, weight, heart rate, blood pressures, LDL, HDL, smoking, physician-reported EF, 57 echocardiographic measurementsStrengths: Large sample sizeLimitations: Retrospective, model derivation from EHR data missing important variablesFindings: Overall AUC > 0.82 over common clinical risk scores. RF outperformed LR. RF including all echocardiographic measurements yielded the highest prediction accuracy. Ten variables needed to achieve 96% maximum prediction accuracy, six from echocardiography. | 171,510 | RF |
| [17], 2019—3a, 3b | Location: United StatesAim: Using ML to develop a model of vessel features to discriminate between patients with and without subsequent death or cardiovascular events and comparing to CAD-RADS.Variables: Four CTA features for each of the sixteen coronary segmentsStrengths: Comparing four different ML methodsLimitations: Low MI incidence leading to possible misclassification biasFindings: ML all-cause mortality AUC = 0.77; ML CAD deaths AUC = 0.85. For starting statin therapy (NNT = 45), use of ML score ensures 93% of patients with events will be administered the drug; compared to 69% with CAD-RADS.Notable facts: Compared to CAD-RADS, ML better discriminated patients who subsequently experienced an adverse event from those who did not. | 6892 | Best models: bootstrap-aggregated DTE, KNN, |
| [13], 2018—1a, 1c | Location: United StatesAim: Developing a risk calculator for CAD incidence to aid initiation of statin therapy.Variables: Same as ACC/AHA risk calculatorStrengths: Model training by 13-year follow-up data from MESA cohort [22] and validation by FLEMENGHO cohort [108] Limitations: RetrospectiveFindings: ML Risk Calculator recommended only 11.4% to take statin, and only 14.4% of “Hard CVD” events occurred in those not recommended statin, resulting in sensitivity 0.86, specificity 0.95, and AUC 0.92. Notable facts: ML Risk Calculator outperformed the ACC/AHA Risk Calculator by recommending less drug therapy yet missing fewer events. | 10,291 | SVM |
| [109], 2019—1a, 1b | Location: IranAim: Compare ANN and SVM algorithms for predicting CAD.Variables: 25 variables affecting CAD including laboratory valuesStrengths: Data collected from three hospitalsLimitations: Retrospective; no detail provided regarding missingness, or lack thereofFindings: SVM model had higher AUC, higher sensitivity, higher Hosmer–Lemeshow test’s result and lower MAPE compared to ANN. Variables affecting CAD yielded better goodness of fit in SVM model and provided more accurate result than ANN. | 1324 | ANN, SVM |
| AI-enabled Diagnostic Studies |
| [76], 2016—3b | Location: Multi-nationalAim: Predicting five-year all-cause mortality in patients undergoing CCTA and comparing to existing prediction algorithms.Variables: 25 clinical and 44 CCTA parameters, SSS, SIS, DI, number of segments with non-calcified, mixed or calcified plaques, age, sex, gender, standard cardiovascular risk factors, and FRSStrengths: Data from CONFIRM registry [110]; large sample sizeLimitations: Selection bias; only LogitBoost was evaluated for efficacy.Findings: ML exhibited a higher area-under-curve compared with the FRS or CCTA severity scores alone (SSS, SIS, DI) for predicting all-cause mortality (ML: 0.79 vs. FRS: 0.61, SSS: 0.64, SIS: 0.64, DI: 0.62; p < 0.001).Notable facts: ML combining clinical and CCTA data was found to predict five-year all-cause mortality significantly better than existing clinical or CCTA metrics alone. | 10,030 | LogitBoost |
| [77], 2019—3a, 3b | Location: KoreaAim: Developing an angiography-based supervised ML algorithm with five-fold cross-validation to classify coronary lesions based on fractional flow reserve (≤0.80 vs. >0.80).Variables: 24 computed angiographic features based on the diameter plot and four clinical features (age, sex, body surface area, and involve segment)Strengths: Randomized controlled trial; external validation in 79 patientsLimitations: Data, analytic methods, and study materials not available to other researchers; model limited to left main disease, side branch, and diffuse and tandem lesions Findings: ML model predicted fractional flow reserve ≤ 0.80 with overall diagnostic accuracy of 78% (AUC = 0.84). Using 12 main angiography features, the ML predicted fractional flow reserve ≤ 0.80 in the test set with sensitivity of 84%, specificity of 80%, and overall accuracy of 82% (AUC = 0.87). The averaged diagnostic accuracy in bootstrap replicates was 81% (AUC = 0.87). External validation showed accuracy of 85% (AUC = 0.87). | 1501 | XGBoost |
| [39], 2017—2a, 2b | Location: CanadaAim: Automating the diagnosis of STEMI at the time of first contact with healthcare system and pre-hospital CCL activation.Variables: ECG reading dataLimitations: Retrospective analysis of real-time automated diagnosis; only ECG data used; small sample sizeFindings: Algorithm modification resulted in a 42% relative decrease in the rate of inappropriate activations (12% vs. 7%) without a significant effect on treatment delay. | 466 | Automated STEMI diagnosis and “physician-less” CCL activation |
| [41], 2019—2b, 2c, 2e | Location: JapanAim: Making an AI prediction model for the need for urgent revascularization from 12-lead ECG in patients presenting with chest pain in the ER.Variables: ECG reading dataLimitations: Retrospective; only ECG data used, small sample sizeFindings: Predictive value of the c-statistics 0.88 (95% CI 0.84–0.93) for detecting patients who required urgent revascularization. | 362 | LSTM |
| AI in Outcome Prediction/Prognosis |
| [89], 2017—3b | Location: United KingdomAim: Predicting patient survival in pulmonary hypertension using 3D patterns of systolic cardiac motion.Variables: Conventional imaging; hemodynamic, functional, and clinical markers; 3D motion pattern of right ventricleStrengths: ProspectiveLimitations: Limited patient selection including non-congenital cases of PH. Model trained to measure excursion rather than contractility.Findings: Survival prediction AUC 0.73; difference in median survival time between high- and low-risk groups was 13.8 years. | 256 | Supervised ML using nested multivariable risk prediction |
| [111], 2019—3a | Location: United StatesAim: Testing generalizability and precision in imaging biomarker analysis by comparing scan:rescan data.Variables: MR-measured left ventricular chamber volumes, mass, and ejection fractionStrengths: ProspectiveLimitations: Data from five institutions, but scans performed at the same institution; one-week interval between scans limited the ability to assess long-term changesFindings: Expert, trained junior, and automated scan:rescan precision were similar (coefficient of variation 6.1 vs. 8.8). Automated analysis was 186× faster than humans. | 110 | CNN |
| [82], 2017—3b | Location: SwedenAim: Predicting two-year survival vs. non-survival after first MI.Variables: 39 survival predictorsStrengths: Large sample sizeLimitations: RetrospectiveFindings: SVM had the highest performance (AUC = 0.845, PPV = 0.280, NPV = 0.966) outperforming Boosted C5.0 (AUC = 0.841), but not significantly higher than LR or RF. Models converged to the point of algorithm indifference with increased sample size and predictors. | 51,943 | SVM, RF, LR, Boosted C5.0 |
| [86], 2018—3b, 4b | Location: SwedenAim: Using mixture of supervised and unsupervised approach to predict outcome and identify distinct phenotypes of heart failure.Variables: Demographic, clinical, laboratory, and medication dataStrengths: Large sample sizeLimitations: RetrospectiveFindings: RF demonstrated excellent calibration and discrimination for survival (C-statistic = 0.83) whereas LVEF did not (C-statistic = 0.52). Cluster analysis using the eight highest predictive variables identified four clinically relevant subgroups of HF with marked differences in one-year survival. | 44,886 | RF, K-means clustering |
| [79], 2017—3b | Location: United StatesAim: Modeling all-cause in-hospital mortality in women admitted with STEMI.Variables: 11 variables for LR; 32 variables for full RF model; 17 variables for reduced RF model Strengths: Model validation using external cohort of 13,361 patientsLimitations: Retrospective; class imbalance (in-hospital mortality in 11% of patients)Findings: Internal validation C-index was 0.84, 0.81, and 0.80 for the LR, full, and reduced RF models, respectively. External validation C-index was 0.84, 0.85, and 0.81 for year 2011, and 0.82, 0.81, and 0.81 for the year 2013 for the LR, full, and reduced RF models, respectively.Notable facts: RF was comparable to LR in predicting in-hospital mortality in women with STEMI. | 12,047 | LR and RF |
| [84], 2019—3b | Location: KoreaAim: DL-based risk stratifying mortality of patients with acute MI.Variables: Initial demographic and laboratory dataStrengths: Large sample size; data from the Korean working group of myocardial infarction registry (network of 59 hospitals)Limitations: RetrospectiveFindings: AUC for STEMI = 0.905. AUC for NSTEMI = 0.870. DL predicted 30.9% of patients more accurately than conventional scores. During the six-month follow-up, the DL-defined high-risk group had a significantly higher mortality rate than the low-risk group (17.1% vs. 0.5%). | 22,875 | DL, LR, RF |
| [58], 2019—2d, 3b | Location: ChinaAim: Identify in-hospital cardiac arrest in hospitalized patients with acute coronary syndrome.Variables: Seven explanatory variables: VitalPAC Early Warning Score (ViEWS), fatal arrhythmia, Killip class, cardiac troponin I, blood urea nitrogen, age, and diabetesLimitations: Possibility of selection biasFindings: Sensitivity = 0.762; Specificity = 0.882; AUC = 0.844; a 10-fold cross-validated risk estimate = 0.198; optimism-corrected AUC = 0.823.Notable facts: The developed DT model may provide healthcare workers with a practical bedside tool and could positively impact decision-making in deteriorating patients with ACS. | 656 | DT |
| [78], 2019—3b | Location: United StatesAim: Identify patients at risk of death or CHF rehospitalization after PCI.Variables: 52 features at admission to predict in-hospital mortality; 358 features at discharge to predict CHF readmissionStrengths: Large sample sizeLimitations: Retrospective; high missingness level in certain features causing high data sparsityFindings: RF prediction of in-hospital mortality AUC = 0.925. RF outperformed LR for predicting 30-day CHF readmission (AUC: 0.90 vs. 0.85) and 180-day cardiovascular death (AUC: 0.88 vs. 0.81). | 11,709 | RF |
| [88], 2019—3b | Location: KoreaAim: Developing and validating a deep-learning-based AI algorithm for predicting mortality of acute HF.Variables: Demographics, treatment and medication, laboratory, ECG and echocardiography findings, final diagnosis, clinical outcome during hospital stay, and 12-month prognosisStrengths: Multi-center study; large sample sizeLimitations: RetrospectiveFindings: AUC of the DL was 0.880 for predicting in-hospital mortality, which outperformed other machine learning models. For predicting 12- and 36-month endpoints, DL had an AUC of 0.782 and 0.813, respectively. During the 36-month follow-up, the high-risk group, defined by the DL, had a significantly higher mortality rate than the low-risk group. | 6924 | DNN, RF, LR, SVM, BN |
| [53], 2019—2c, 2d | Location: KoreaAim: Using ML to predict ACS requiring revascularization in patients presenting with early-stage angina-like symptoms.Variables: 20 features relevant to ACSStrengths: Large sample sizeLimitations: Retrospective; inaccuracy in checking the vulnerable plaque burden of all coronary arteriesFindings: AUC = 0.860 for the prediction of ACS requiring revascularization. A reliable prediction of 2.60% of non-ACS patients was made with a specificity of 1.0 to only receive medical therapy. | 5882 | SVM, LDA |
| [87], 2019—3b | Location: United StatesAim: Using a ML algorithm to predict mortality in HF patients.Variables: Eight variables: diastolic blood pressure, creatinine, blood urea nitrogen, hemoglobin, white blood cell count, platelets, albumin, and red blood cell distribution widthStrengths: Large sample sizeLimitations: Retrospective; selection bias due to excluding significant number of patients with missingnessFindings: The risk score developed by DT accurately discriminated between low and high-risk of death with an AUC of 0.88. External validation in two separate HF populations gave AUCs of 0.84 and 0.81. | 5822 | DT |
| [83], 2019—3b | Location: United KingdomAim: Predicting long-term mortality after ACS using laboratory values. Variables: Hematological indices and inflammation markersStrengths: Large sample sizeLimitations: Imputation for the ML was performed using mean of all observations, the latter is typically not ideal since missing in EHR data tend to be not-at-randomFindings: The model achieved a c-statistic of 0.89 for in-hospital mortality. C-statistic was 0.77 for six-month mortality. Red cell distribution width (HR 1.23) and neutrophil to lymphocyte ratio (HR 1.08) showed independent association with all-cause mortality in multivariable Cox regression. | 5053 | XGBoost |
| [85], 2019—3b | Location: ChinaAim: Developing a DL model to predict major adverse cardiac events after ACS.Variables: 232 static feature types and 2194 dynamic feature types.Strengths: Large sample size; comparison to previous modelsLimitations: Retrospective; missing values (up to 30%) were imputed using median of all the observations; variables with more than 30% missing were excludedFindings: The best model presented had an AUC of 0.713 and an accuracy of 0.764.Notable facts: The proposed model adapted to leverage dynamic treatment information in EHR data boosted the performance of major adverse cardiac event prediction for ACS. | 2930 | RNN |
| [80], 2017—3b | Location: IsraelAim: Predicting mortality at 30-days in STEMI patients and to compare these to the conventional validated risk scores.Variables: 54 variables; performance of most models plateaued with 15 variablesStrengths: Large sample sizeLimitations: RetrospectiveFindings: ML models AUC range: 0.64 to 0.91. The best models had similar or better performance compared to standard scoring methods. Top predictors were creatinine, Killip class on admission, blood pressure, glucose level, and age.Notable facts: The algorithms selected showed competence in prediction across an increasing number of variables. | 2782 | NB, DT, LR, rules-based classification tree, RF, Adaptive Boosting |
| [112], 2018—3a | Location: CanadaAim: Assessing the prognostication of NN in HF patients using CPET data as opposed to using summary indicators alone.Variables: Detailed CPET dataStrengths: Using various ML modelsLimitations: RetrospectiveFindings: NN incorporating breath-by-breath data achieved the best performance (AUC = 0.842). All models outperformed summary indices (AUC ≤ 0.800). When compared with the CPET risk score (AUC = 0.759), the top-performing model obtained a net reclassification index of 4.9%.Notable facts: The current practice of considering summary indices in isolation fails to realize the full value of CPET data. Higher data resolution leads to improved prediction. | 1434 | LASSO, NN |
| [81], 2020—3b | Location: ChinaAim: Using ML to predict one-year mortality rate of anterior STEMI patients and comparing to conventional risk scores.Variables: 59 features; including all features as opposed to top 20 provided better performanceStrengths: Using six different ML algorithmsLimitations: RetrospectiveFindings: AUC of ML models ranged from 0.709 to 0.942. XGBoost achieved the highest accuracy (92%), specificity (99%) and f1 score (0.72) for predictions with the full variable model. After feature selection, XGBoost still obtained the highest accuracy (93%), specificity (99%) and f1 score (0.73). | 1244 | NB, LR, KNN, DT, RF and XGBoost |
| [105], 2019—4b | Location: United StatesAim: Using ML on EHR data to predict CRT outcome.Variables: Demographics, laboratory values, medications, clinical characteristics, and past health services utilization, bigrams (i.e., two-word sequences) in EHR dataStrengths: Comparing various ML modelsLimitations: No distinction between the type of CRT implant.Findings: The final model identified 26% of patients having a reduced benefit from the CRT device at a PPV of 79% (model performance: Fβ (β = 0.1): 77%; recall 0.26; precision 0.79; accuracy 0.65).Notable facts: A ML model that leveraged readily available EHR data and clinical notes identified a subset of CRT patients who may not benefit from CRT before the procedure. | 990 | LR, SVM, RF and GBM |
| [113], 2019—1a | Location: JapanAim: Assessing stroke risk by ML using integrated risk factors. Variables: 47 features comprised of 13 conventional risk factors and 34 carotid ultrasound image-based phenotypes (carotid intima-media thickness, carotid plaque and carotid artery stenosis)Strengths: Using integrated risk factorsLimitations: Retrospective; small sample size; data imbalance (12 high-risk patients vs. 190 low-risk patients)Findings: ML with integrated risk factors (AUC = 0.80) showed an improvement of ~18% against conventional ML (AUC = 0.68).Notable facts: ML model integrated with the event-equivalent gold standard as percentage stenosis is powerful and offers low cost and high-performance stroke risk assessment. | 202 | RF |
| AI in Treatment Strategies |
| [99], 2018—3a, 4a | Location: Multi-nationalAim: Using ML to phenotypically classify a heterogeneous HF cohort and aid in optimizing the rate of responders to specific therapies.Variables: 50 variables including clinical parameters, biomarker values, and measures of left and right ventricular structure and functionStrengths: Data from MADIT-CRT trial [114]; randomized cohortLimitations: Possibility of selection bias; results confined to a selected population of HF patients enrolled in a clinical trial with robust inclusion/exclusion criteriaFindings: Four phenogroups identified, significantly different in the primary outcome occurrence. Two phenogroups included a higher proportion of known clinical characteristics predictive of CRT response and were associated with a substantially better treatment effect of CRT-D on the primary outcome (HR = 0.35 and HR = 0.36) than observed in the other groups.Notable facts: By integrating clinical parameters and full heart cycle imaging data, unsupervised ML can provide a clinically meaningful classification of a phenotypically heterogeneous HF cohort and might aid in optimizing the rate of responders to specific therapies. | 1106 | Multiple Kernel Learning, K-means clustering |
| [104], 2019—4b | Location: United StatesAim: Develop and compare ML models to predict response to CRT.Variables: Nine variables; QRS morphology, QRS duration, New York Heart Association classification, left ventricular ejection fraction and end-diastolic diameter, sex, ischemic cardiomyopathy, AF, and epicardial left ventricular leadStrengths: Multi-center study comparing various ML modelsLimitations: RetrospectiveFindings: The best ML model was a naïve Bayes classifier. On the testing cohort, ML demonstrated better response prediction than guidelines (AUC 0.70 vs. 0.65) and greater discrimination of event-free survival (concordance index, 0.61 vs. 0.56). The fourth quartile of the ML model had the greatest risk of reaching the composite end point, whereas the first quartile had the least (hazard ratio, 0.34). | 925 | Supervised ML |
| [106], 2018—4b | Location: United StatesAim: Using ML to predict all-cause mortality or heart failure hospitalization 12 months post-CRT.Variables: 45 features: demographics, physical characteristics, heart failure, LV assessment, ECG, medical history, medication classStrengths: Used data from COMPANION trial [115]Limitations: Possibility of selection bias; only class III and IV HF patients enrolled with specific inclusion/exclusion criteriaFindings: RF model produced quartiles of patients with an eight-fold difference in survival between those with the highest and lowest predicted probability for events (hazard ratio, 7.96). The model discriminated the risk of the composite end point of all-cause mortality or heart failure hospitalization better than conventional methods. | 1076 | Multiple models with RF producing best results |
| AI-enabled Diagnostic Imaging Studies |
| [24], 2018—1b | Location: United StatesAim: Determining the diagnostic performance of cPSTA in assessing CAD in patients presenting with chest pain who had been referred by their physician for coronary angiography.Variables: cPSTA recorded signalsStrengths: ProspectiveLimitations: Small sample sizeFindings: The machine-learned algorithm had a sensitivity of 92% and specificity of 62% on blind testing in the verification cohort. The NPV was 96%.Notable facts: Resting cPSTA may have comparable diagnostic utility to functional tests currently used to assess CAD without requiring cardiac stress (exercise or pharmacological) or exposure of the patient to radioactivity. | 606 | Elastic net |
| [25], 2018—1b, 3a | Location: Multi-nationalAim: Predicting lesion-specific ischemia by invasive FFR using an integrated ML ischemia risk score from quantitative plaque measures from CCTA.Variables: Quantitative CTA data: stenosis, NCP, low-density NCP (LD-NCP), calcified and total plaque volumes, contrast density difference (maximum difference in luminal attenuation per unit area) and plaque lengthStrengths: Multi-center data from NXT trial [116]Limitations: Small sample size; plaque findings were not confirmed by invasive intravascular ultrasoundFindings: Information gain for predicting ischemia was highest for contrast density difference (0.172), followed by LD-NCP (0.125), NCP (0.097), and total plaque volumes (0.092). ML had higher AIUC (0.84) than individual CTA measures, including stenosis (0.76), LD-NCP volume (0.77), total plaque volume (0.74) and pre-test likelihood of CAD (0.63). | 254 | LogitBoost |
| [15], 2020—1a | Location: Multi-nationalAim: Evaluate the prognostic value of fully automated DL-based EAT volume and attenuation quantified from non-contrast cardiac CT. Variables: Non-contrast cardiac CT scan data, inflammatory biomarkersStrengths: Data from the EISNER trial [23]Limitations: Long-term follow-up not obtainedFindings: Increased EAT volume and decreased EAT attenuation were independently associated with MACE. CAD risk score, CAC, and EAT volume were associated with increased risk of MACE (hazard ratio: 1.03, 1.25, and 1.35). EAT attenuation was inversely associated with MACE (hazard ratio: 0.83, Harrell C statistic: 0.76). MACE risk progressively increased with EAT volume ≥ 113 cm3 and CAC ≥ 100 AU; highest in subjects with both. EAT volume correlated with inflammatory biomarkers; EAT attenuation inversely related to inflammatory biomarkers. | 2068 | DL |
| [117], 2018—1a | Location: Multi-nationalAim: Investigating whether a ML score, using only plaque stenosis and composition information from the 16 coronary segments, has better predictive accuracy compared to the traditional CCTA based risk scores.Variables: 16 segment based coronary stenosis (0%, 1–24%, 25–49%, 50–69%, 70–99% and 100%) and composition (calcified, mixed and non-calcified plaque) derived from CCTAStrengths: Data from CONFIRM registry [110]Findings: ML-based approach showed better AUC for event discrimination (0.771) vs. other scores (ranging from 0.685 to 0.701). Improved risk stratification was the result of down-classification of risk among patients that did not experience events (non-events). | 8844 | XGBoost |
| [26], 2018—1b, 1c, 2c | Location: Multi-nationalAim: Evaluating DL-based automatic prediction of obstructive disease from MPI, compared with TPD.Variables: MPI recorded dataStrengths: Multi-center studyLimitations: Retrospective; degree of stenosis from invasive angiography was interpreted visuallyFindings: AUC for DL was higher than for TPD (per patient: 0.80 vs. 0.78; per-vessel: 0.76 vs. 0.73). Sensitivity per patient improved from 79.8% (TPD) to 82.3% (DL), and per-vessel sensitivity improved from 64.4% (TPD) to 69.8% (DL). | 1018 | DCNN |
| [52], 2018—2c, 2d | Location: United StatesAim: Evaluating the effectiveness of using Computer-Aided Diagnosis in the triage of low to intermediate risk emergency chest pain patients with CCTA.Variables: Data from 64 and 320 slice CCTA scannersStrengths: Looking at 30-day outcomeLimitations: RetrospectiveFindings: Sensitivity: 85%; specificity: 50.6% and 56.5% for the 64 and 320 slice scanners. NPV: 97.8 and 97.1 for the 64 and 320 slice scanners. AUC: 0.6794 and 0.7097 for the 64 and 320 slice scanners. Software unable to read 18% of the cases. | 923 | Computer aided diagnosis software |
| [118], 2018—2c | Location: Multi-nationalAim: Improving diagnostic performance of CTA to potentially reducing the number of unnecessary referrals for invasive coronary angiography.Variables: 28 variables from CTA dataStrengths: Multi-centerLimitations: Retrospective; possibility of selection bias due to the inclusion of patients with the disease onlyFindings: ML-FFR (AUC = 0.84) and CFD-FFR (AUC = 0.84) outperformed visual CTA (AUC = 0.69). Per-vessel and per-patient diagnostic accuracy improved 78% and 85%, respectively. ML-FFR correctly reclassified 73% of false-positive CTA results.Notable facts: On-site ML-FFR improves the performance of CTA by correctly reclassifying hemodynamically nonsignificant stenosis and performs equally well as CFD-FFR. | 351 | NN |
| [27], 2017—1b, 1c | Location: United StatesAim: Evaluating the incremental benefit of ML-powered resting myocardial CTP over coronary CT stenosis for predicting ischemiaVariables: CCTA and FFR dataStrengths: Data from DeFACTO study [119]Limitations: Small sample sizeFindings: Accuracy, sensitivity, specificity, PPV, and NPV of resting CTP were 68.3%, 52.7%, 84.6%, 78.2%, and 63.0%, respectively, for predicting ischemia. Addition of resting CTP improved discrimination (AUC = 0.75) and reclassification (net reclassification improvement: 0.52) of ischemia compared to CT stenosis alone (AUC = 0.68).Notable facts: The addition of resting CTP analysis acquired from ML techniques may improve the predictive utility of significant ischemia over coronary stenosis. | 252 | Gradient boosting classifier |