| Literature DB >> 35154361 |
Ashir Javeed1, Shafqat Ullah Khan2, Liaqat Ali3, Sardar Ali4, Yakubu Imrana5,6, Atiqur Rahman7.
Abstract
One of the leading causes of deaths around the globe is heart disease. Heart is an organ that is responsible for the supply of blood to each part of the body. Coronary artery disease (CAD) and chronic heart failure (CHF) often lead to heart attack. Traditional medical procedures (angiography) for the diagnosis of heart disease have higher cost as well as serious health concerns. Therefore, researchers have developed various automated diagnostic systems based on machine learning (ML) and data mining techniques. ML-based automated diagnostic systems provide an affordable, efficient, and reliable solutions for heart disease detection. Various ML, data mining methods, and data modalities have been utilized in the past. Many previous review papers have presented systematic reviews based on one type of data modality. This study, therefore, targets systematic review of automated diagnosis for heart disease prediction based on different types of modalities, i.e., clinical feature-based data modality, images, and ECG. Moreover, this paper critically evaluates the previous methods and presents the limitations in these methods. Finally, the article provides some future research directions in the domain of automated heart disease detection based on machine learning and multiple of data modalities.Entities:
Mesh:
Year: 2022 PMID: 35154361 PMCID: PMC8831075 DOI: 10.1155/2022/9288452
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Anatomy of the heart [6].
Figure 2Types of heart diseases.
Figure 3Different modalities used for automated heart failure diagnosis.
Figure 4Overview of ML-based diagnostic system.
Figure 5Selected research articles published from 1996 to 2021 as shown in Figure 5(a). The topic has gradually attracted the attention of researchers with the passage of time. In recent years, the topic got a peak attraction from researchers as a lot of articles have been published in the past few years, while Figure 5(b) depicts the comparison of published articles with respect to the modality.
Summary of dataset properties.
| Dataset_IDa | Dataset | Total samplesb | Featuresc |
|---|---|---|---|
| 01 | Cleveland (UCI), heart disease dataset | 303 | 76 raw features, 14 prominent features |
| 02 | StatLog heart disease dataset (UCI) | 150, (healthy: 150, patient: 120) | 13 distinct features |
| 03 | CHF database (chf2db) | 136, (healthy: 46, patient: 90) | 12 distinct features |
| 04 | MIT-BIH Normal Sinus Rhythm (NSR) database | 54, (male: 30, female: 24) | Sampling rate: 128 samples per second |
| 05 | Congestive heart failure database (BIDMC-CHF) | 15, (male: 11, female: 4) | Sampling rate: 500 samples per second |
| 06 | Fantasia database (FD) | 18, (male: 5, female: 13) | Sampling rate: 128 samples per second |
| 07 | Congestive Heart Failure RR Interval Database (CHF-RR) | 29 | Sampling rate: 500 samples per second |
| 08 | Normal Sinus Rhythm RR Interval Database (NSR-RR) | 40 | Sampling rate: 500 samples per second |
| 09 | Cleaveland(UCI), Hungarian heart disease dataset | 590 | 76 features |
| 10 | mARSupio database, Italy | 14616, (patients: 347) | 572 features |
| 11 | NHANES CVD dataset | 4434 | 23-65 features |
| 12 | MIMIC-II clinical database | 8059 | 32 features |
| 13 | Z-Alizadeh Sani dataset | 303 | 54 features |
| 14 | Heart disease dataset, Andhra Pradesh, India | N/A | 14 features |
| 15 | Physionet databases | 40, (male: 20, female:20) | 95300 segmented ECG |
| 16 | MITDB database, Physionet | 47 | 22 features |
| 17 | China Kadoorie Biobank (CKB) | 520000 | 86 features |
| 18 | MIT-BIR arrhythmia database | 47 | Sampling rate of 360 Hz |
| 19 | MIT/Beth Israel hospital (BIH), arrhythmia database | 4,000 ambulatory ECGs | 360 samples per second |
| 20 | PTB diagnostic ECG database | 52 healthy, 7 HCM, 8 DCM, and 148 MI subjects | Sampled at 1,000 Hz, 250 samples per second |
| 21 | Physikalisch-Technische Bundesanstalt diagnostic, ECG database | 200 (patients: 148, healthy: 52) | Sampling rate of 1000 Hz |
| 22 | 1st China Physio-logical Signal Challenge | 6877 | Sampled at 500 Hz |
| 23 | Mayo Clinic ECG laboratory | 180922, (patients: 116061, healthy: 64931) | Sampling 1500 Hz |
| 24 | Subrogated fragmented database (Sfrag-DB) + subrogated wide-fragmented database (SWfrag-DB) + fragmented database (FHCM-DB) + fibrosis database (HCM-DB) | 616 records | Sampling rate: 500 Hz |
| 25 | Collected at the University of Pennsylvania | 209 | 20 features |
| 26 | MICCAI 2017 challenge on Automated Cardiac Diagnosis | 100 | 567 features, 13 optimal features |
| 27 | STACOM 2015 challenge | 200 | 11 features |
| 28 | St.Francis Heart Hospital in Roslyn, New York | 200 | 3 feature |
| 29 | Nuclear Medicine Department | 288 | 10 features |
| 30 | Cedars-Sinai Medical Center, Los Angeles, CA | 713 | 13 features |
| 31 | MCG data | 800 | 2 features |
| 32 | Hospital Fernando Fonseca dataset | 496 | 80 features |
| 33 | Siemens Somatom sensation | 137 | N/ |
| 34 | ACS dataset (Mersin University Research and Training Hospital) | 228 | 6 features |
| 35 | University Hospital Arnau de Vilanova, Lleida, Spain | 56 | Image resolution: 8.5 pixels per mm |
| 36 | Sutter Palo Alto Medical Foundation | 58652000 | 2 attribute |
| 37 | LIDC-IDRI public dataset | 802 | NoGT transformation |
| 38 | SunnyBrook Cardiac Data (SCD) | 45 (male: 32 and female: 13) | Sampling: 30 frames per second |
| 39 | NSTEACS | 2302 patients | N/A |
| 40 | Hospital Universiti Kebangsaan Malaysia | 10 | N/A |
| 41 | Department of Medicine, University of Alabama at Birmingham | 109 | 9 features |
| 42 | UK Biobank | 9135867 | N/A |
| 43 | SPECT | 135 | 30 Fourier components |
| 44 | Ham-mersmith Hospitals | 1093 subjects | N/A |
| 45 | Cohn-Kanade dataset (CK+) | 400 | N/A |
| 46 | Sacred Heart Medical Center, Eugene | 215 | N/A |
| 47 | Sacred Heart Medical Center | 2619 | 50 features |
| 48 | AGES-I Dataset | 628 (male: 419, female: 209) | 11 Radiodensitometric features |
| 49 | Clinical Research Centre of Medical University of Bialystok, Poland | 67 | 63 features |
| 50 | Sugam Multispecialty Hospital, India | 507 patients (35 to 90 years of age) | 22 features |
| 51 | Germany | 15510 observations | N/A |
| 52 | Italian Local Health Authority (ASL) | 2722 | 06 features |
| 53 | ML repository | 3000 | 13 features |
| 54 | USA | 1000 | 15 echocardiographic variables |
| 55 | USA | 340 | 15 echocardiographic variables |
| 56 | Faisalabad Institute of Cardiology and at the Allied Hospital in Faisalabad (Punjab, Pakistan) | 299 | 13 features |
aDataset_ID is a reference number used for the identification of the dataset. bTotal samples represent the total number of records in a dataset. cFeature represents the total number of features a dataset consist.
Figure 6Demonstrates the number of samples and features in each dataset. X-axis of the graph represents the dataset ID while the Y-axis displays the number of samples and number of features. Blue bar in the figure depicts number of sample, and the orange line denotes number of features.
Figure 7Performance of clinical feature-based data modality based on ML models is depicts from this figure. The performance of each ML model is measured in term of accuracy along with number of samples in the dataset.
Figure 8Performance of ML models based on image modality is depicted in this figure. The performance of each ML model is measured in term of accuracy along with number of samples in the dataset.
Figure 9Performance of ECG modality-based ML models is depicted in this figure. The performance of each ML model is measured in term of accuracy along with number of samples in the dataset.
Summary of state-of-the-art research articles.
| P_ID | Author | Technique | Data | Feature selection | Data sampling | Conclusion |
|---|---|---|---|---|---|---|
| PI_106 | Ricciardi et al.,(2020) [ | Logistic regression + tree-based ML | AGES-I dataset + AGES-II dataset | Nonlinear trimodal regression analysis (NTRA) + RF |
| CVD (AUC: 91.4%) |
| PI_107 | Butun et.al. (2020) [ | Capsule networks (DNN) | Physionet database | Layer of CNN | Crossvalidation, 5-fold | Accuracy: 99.44% |
| PI_108 | Ramachandran et al., (2020) [ | Softmax discriminant classifier (SDC) and Gaussian mixture model classifier (GMM) | IEEE TMBE pulse oximeter dataset | Singular value decomposition (SVD) |
| Accuracy: 97.88% |
| PI_109 | Ghiasi et al. (2020) [ | Decision tree | Z-Alizadeh Sani CAD dataset | Classification and regression tree (CART) | Crossvalidation,10-fold | Accuracy: 100% |
| PI_110 | Joloudari et al. (2020) [ | RT + SVM + C5.0 | Z-Alizadeh Sani dataset | Random trees | Crossvalidation, 10-fold | Accuracy: 91.47% |
| PI_111 | Ali et al. (2019) [ |
| Cleveland (UCI), heart disease dataset |
| Matthews relation coefficient (MCC) | Accuracy: 92.22% |
| PI_112 | Ali et al.,(2020) [ | Mutual information based feature selection and deep neural network | Cleveland (UCI), heart disease dataset | Mutual information | Matthews relation coefficient (MCC) | Accuracy: 93.33% |
| PI_113 | Gjoreski et al. (2020) [ | Fully connected neural network (FCNN) | 947 subjects | openSMILE feature extraction tool | Crossvalidation 10-fold | Accuracy: 93.2% |
| PI_114 | Hussain et al. (2020) [ | DT + SVM + KNN | Physionet databases | Multimodal features | Crossvalidation 10-fold | Accuracy: 97% (SVM) |
| PI_115 | Aouabed et al. (2019) [ | Nested ensemble (NE) model | Cleveland (UCI), heart disease dataset | GA | Crossvalidation 10-fold | Accuracy: 98.34% |
| PI_116 | Liu et al. (2020) [ | Multiscale convolutional neural networks (CNN) | 1000 OCT images | Layers of CNN | Matthews relation coefficient (MCC) | Accuracy: 94.12% |
Figure 10Performance analysis of ML techniques based on datasets for automated diagnosis of heart failure. This figure shows the highest accuracy achieved by the clinical feature-based data modality-based methods while average accuracy of ECG modality-based methods is higher. As the number of samples in dataset is increased, the performance of the clinical feature-based data modality reduces. The image modality has shown lower performance as compared to the other two modalities.
Summary of clinical features-based data modality articles.
| P_ID | Author | Technique | Data | Feature selection | Data-sampling | Conclusion |
|---|---|---|---|---|---|---|
| PI_01 | Verma et al. (2016) [ | FURIA + MLR+ clustering + MLP | Cleveland heart disease dataset, IGMC data | CFS + PSO |
| Accuracy: 90.28% |
| PI_02 | Shah et al. (2017) [ | Radial basis function (RBF) kernel-based SVM | Cleveland heart disease dataset, 303 instances | PPCA+PA |
| Accuracy: 91.30% |
| PI_03 | Dwivedi (2018) [ | LR + KNN + ANN + NB + classification tree + vector machines (SVM) | StatLog heart disease dataset | N/A |
| Accuracy: 85% |
| PI_04 | Haq et al. (2018) [ | Logistic regression (LR) + KNN + ANN + NB + DT + SVM | Cleveland heart disease dataset, 303 instances | Relief + mRMR + LASSO |
| Accuracy: 89% |
| PI_05 | Guidi et al. (2014) [ | NN + SVM + fuzzy-genetic + regression tree + random forest | Cardiology Department at the St. Maria Nuova Hospital in Florence, Italy | N/A |
| Prediction accuracy: NN: 84.73%, SVM: 85.2%, FG: 85.9%, CART: 87.6%, RF: 85.6% |
| PI_06 | Pawlovsky (2018) [ | An ensemble based on distances for a kNN ( | Cleveland heart disease dataset, 303 instances | Distances(Mahalanobis) + voting scheme using weights |
| Accuracy: 84.83% |
| PI_07 |
| SVM+ bispectral analysis | CHF database (chf2db), Physionet database (nsr2db) | Bispectrum-related features + GA |
| Accuracy: 98.79% |
| PI_08 | Wang et al. (2019) [ | DNN + ensemble learning method | BIDMC-CHF, NSR-RR | Time, frequency domain, nonlinear features | Blindfold validation | Accuracy: 99.96% |
| PI_09 | Methaila et al. (2014) [ | NN + NB + DT + apriori (algorithm + MAFIA algorithm) | Cleveland heart disease dataset, 303 instances | Significance weightage calculation | Crossvalidation | Accuracy: 99.62% (DT) |
| PI_10 | jan et al. (2018) [ | Ensemble model + NB + ANN + weight+ random forest + SVM | Cleveland heart disease Hungarian dataset, 590 instances | N/A |
| NB: 93.22%accuracy |
| PI_11 | ali et al.(2019) [ | Optimized stacked support vector machines | Cleveland heart disease dataset, 303 instances | SVM with kernels including linear + RBF. | Matthews correlation coefficient (MCC) | Accuracy: 92.22% |
| PI_12 | Pecchia et al. (2010) [ | CART | CHF RR interval database | Short-term HRV analysis | MCC + ROC | Accuracy: 96.39%, |
| PI_13 | Kurnar (2012) [ | ANN + fuzzy logic | Cleveland heart disease dataset, 303 instances | Fuzzy resolution | Matthews correlation coefficient (MCC) | Accuracy: 91.83% |
| PI_14 | Kurnar (2012) [ | LR + RF + NB + GB + SVM | Cleveland, Hungarian, Switzerland | Data cleaning | Confusion matrix | Accuracy: 86% |
| PI_15 | Panicacci et al. (2019) [ | RF+ MACRO +SMOTE28 S | mARSupio database, Italy. 14616 subjects, 347 patient | N/A | F1-score, F2-score | Accuracy: 98.74% |
| PI_16 | Beulah et al. (2019) [ | Majority vote with NB, BN, RF, and MP | Cleveland heart disease dataset, 303 instances | Bagging, MV, stacking, boosting | F1-score, F2-score | Accuracy: 85.48% |
| PI_17 | Zikos et al. (2019) [ | Conditional probability +Bayesian | Medicare and Medicaid services CMS, 564,875 records | Clinical Classification Software (CSS) | N/A | Mortality rate: 2.61% |
| PI_18 | Daset et al. (2009) [ | Neural networks ensembles | Cleveland heart disease dataset, 303 instances | SAS base software 9.1.3 for diagnosing | MCC + ROC | Accuracy: 89.01% |
| PI_19 | Mohan et al. (2019) [ | Hybrid random forest with a linear model | Cleveland heart disease dataset, 303 instances | NB, GLM, LR, DL, DT, RF, GBT, and SVM | Confusion matrix | Accuracy: 88.4% |
| PI_20 | Kahramanli and Allahverdi (2008) [ | ANN + FNN | Cleveland heart disease dataset, 303 instances | N/A |
| Accuracy: 86.8% |
| PI_21 | Maji and Arora (2018) [ | Decision tree+C4.5 + ANN | UCI, dataset with 13 attributes and 270 instances | Pruning |
| Accuracy: 78.14% |
| PI_22 | Polat et al. (2005) [ | Fuzzy weighted + AI | Cleveland heart disease dataset, 303 instances | Fuzzy weighted preprocessing |
| Accuracy: 96.30% |
| PI_23 | Ster and Dobnikar (1996) [ | Neural networks | CAD:263 subjects, UCI: 297 | N/A |
| HD accuracy: 84.5% |
| PI_24 | Chen et al. (2017) [ | Deep learning with RR intervals | 72 healthy persons and 44 CHF patients | Autoencoder |
| Accuracy: 72.41 |
| PI_25 | Purushottam and Sharma (2015) [ | Decision trees | Cleveland heart disease dataset, 303 instances | C4.5 | Confusion matrix | Accuracy: 87% |
| PI_26 | Rajliwall et al. (2018) [ | ML-based models for cardiovascular risk prediction | NHANES dataset + Framingham heart study dataset | C4.5 | Fivefold crossvalidation | Accuracy (RF): 98.5% |
| PI_27 | Samuel et al. (2017) [ | ANN and Fuzzy_AHP | Cleveland heart disease dataset, 303 instances | Fuzzy_AHP | ROC | Accuracy: 91.10% |
| PI_28 | Venkatalakshmi and Shivsankar (2014) [ | Decision tree + naive Bayes (NB) | Cleveland heart disease dataset, 303 instances | Weka tool | Confusion matrix | NB: 85.03%accuracy |
| DT: 84.01%accuracy | ||||||
| PI_29 | Maio et al. (2017) [ | Random survival forest | MIMIC II clinical database, 8059 | N/A | OOB, C-statistics | Accuracy: 82.01% |
| PI_30 | Arabasadi et al. (2017) [ | Hybrid neural network-genetic algorithm | Z-Alizadeh Sani dataset | Genetic algorithm | 10-fold crossvalidation | Accuracy: 93.85% |
| PI_31 | Abdar et al. (2017) [ | N2Genetic optimizer + N2Genetic-nuSVM | Z-Alizadeh Sani dataset | GA + PSO | Crossvalidation 10-fold + F1-score | Accuracy: 93.08% |
| PI_32 | Mezzatesta et.al. (2019) [ | LR + KNN + CART + NB + SVM | HEMO clinical trial + IFC-CNR, Italy | Scaling techniques | Crossvalidation | LR: 80%, SVM: 80% |
| PI_33 | Lakshmi et al. (2016) [ | NB classifier + SVM | Cleveland heart dataset | Reprocessing | ROC | NB: 84.87%, accuracy |
| PI_34 | Bashir et al. (2019) [ | DT + NB + LR + SVM | Cleveland heart disease dataset, 303 instances | MRMR | 5-fold crossvalidation | Accuracy: 84.85% |
| PI_35 | javeed et al. (2019) [ | RSA + ORFA | Cleveland heart dataset | Hybrid Feature Subset | MCC | Accuracy: 93.33% |
Summary of image modality based research articles.
| P_ID | Author | Technique | Data | Feature selection | Data sampling | Conclusion |
|---|---|---|---|---|---|---|
| PI_36 | Nirschl et al. (2018) [ | CNN+ whole-slide images of H&E tissue | 209 patients | WND-CHARM |
| Accuracy: 97.4% |
| PI_37 | Cetin et al. (2017) [ | Radiomic approach + cardiac cine-MRI+ SVM | MICCAI 2017 challenge on automated cardiac diagnosis | Sequential forward feature selection (SFFS) | Crossvalidation | Accuracy: 98% |
| PI_38 | Bai et al. (2016) [ | SVM | STACOM 2015 dataset | ED + ES phases + PCA |
| Accuracy: 97.5% |
| PI_39 | Qazi et al. (2007) [ | SLFD | 200 cases | LFD | ROC + | Accuracy: 89.1% |
| PI_40 | Sajn and Kukar (2011) [ | Image processing + ML | 288 patients | PCA | ROC + | Accuracy: 81.3% |
| PI_41 | R.Arsanjani et al.,(2015) [ | Myocardial perfusion SPECT + ML | Cedars-Sinai Medical Center | LogitBoost | ROC + | Accuracy: 81% |
| PI_42 | Arsanjani et al. (2013) [ | SPECT for detection of CVD | Cedars-Sinai Medical Center | LogitBoost | ROC + | Accuracy: 87.2% |
| UPI_43 | Udovychenko et al. (2015) [ |
| MCG data | Variance, kurtosis, and skewness | MMC | Accuracy: 80-88% |
| PI_44 | Carneiro and Nascimento (2013) [ | Multiple dynamic models and deep learning architectures | Hospital Fernando Fonseca dataset, 496 images | PCA | HMD, AV, MAD, AVP | d_HMD: 83%accuracy |
| PI_45 | Zheng et al. (2008) [ | 3-D cardiac CT volumes using marginal space learning | Siemens Somatom Sensation | Steerable features |
| Mean error: 2.3% |
| PI_46 | Berikol et al. (2016) [ | SVM | Mersin University Research | N/A |
| Accuracy: 99.13% |
| PI_47 | Lekadir et al. (2016) [ | Plaque CNN architecture | Arnau de Vilanova | Deep learning CNN |
| Accuracy: 80% |
| PI_48 | Sundaresan et al. (2017) [ | Fully convolutional neural networks (FCN) | C.Ioannou | Rectified linear units (ReLUs) | ROC | Classification error rate: 23.48% |
| PI_49 | Choi et al. (2016) [ | Recurrent neural network | Sutter Palo Alto Medical Foundation | Gated recurrent unit GRU |
| Accuracy: 88.3% |
| PI_50 | Toth et al. (2018) [ | Convolutional neural networks | LIDC-IDRI public dataset | (ReLU) | Qualitatively + quantitatively | Error rate: 2.92% |
| PI_51 | Maraci et al. (2017) [ | Analysis of linear ultrasound videos to detect fetal presentation and heartbeat | Dataset of 323 predefined free-hand videos | PCA |
| Accuracy: 93.1% |
| PI_52 | Kurgan et al. (2001) [ | Automated cardic SPECT diagnosis | Database of features(DF) | CLIP algorithm | Qualitative and Quantitative test | Accuracy: 83.08% |
| PI_53 | Moreno et al. (2019) [ | Multiscale motion for cardiac disease prediction | SPECT images dataset | RF + CLIP algorithm | F1-score + | Accuracy: 51.06% |
| PI_54 | Liu et al. (2016) [ | ML prediction for cardiovascular | NSTEACS | PCA + MCE |
| Accuracy: 75% |
| PI_55 | Shin et al. (2016) [ | Deep convolutional neural networks for computer-aided detection | ImageNet dataset for CAD | CNN features of AlexNet pretrained + GoogleNet-RI | k-fold crossvalidation | Accuracy:95% |
| PI_56 | Hisham et al. (2011) [ | Grid independent technique | 10 patients | Grid the images | Linear correlation | Accuracy:80% |
| PI_57 | Allison et al. (2005) [ | ANN | LAD model | Crossvalidation | Accuracy: 92% | |
| PI_58 | Welikala et al. (2017) [ | Automated arteriole and venule classification using deep learning | UK Biobank | RGB and HSI color spaces | Crossvalidation | Accuracy: 86.97% |
| PI_59 | Curiale et al. (2017) [ | Deep learning network in cardiac MRI | Sunnybrook Cardiac Dataset (SCD) | RGB and HSI color spaces. | Dice's coefficient | Accuracy: 90% |
| PI_60 | Lindahl et al. (20197) [ | Interpretation of myocardial SPECT perfusion images using ANN | Sunnybrook Cardiac Dataset (SCD) | Two-dimensional Fourier trans form technique | ROC + | Sensitivity: 54.4% |
| PI_61 | Bai et al. (2015) [ | Statistical parametric mapping(SPM) + linear model | Hammersmith Hospitals | PCA | Dice overlap metric + mean surface distance | LV_cavity:0.950 ± 0.024 |
| PI_62 | Moreno et al. (2019) [ | Regional multiscale motion representation for cardiac disease prediction | Sunnybrook Cardiac Data (SCD) | Random Forest algorithm (RaF) | No. true positive over total of samples + F1-score | Accuracy: 77.83% |
| PI_63 | Gulsun et al. (2016) [ | Coronary centerline extraction + CNN | CTA datasets | CNN | Up-to-first-error evaluation | Sensitivity: 97% |
| PI_64 | Narula et al. (2016) [ | Automate morphological and functional assessments in 2D echocardiography | 77 ATH+ 62 HCM patients | Information gain (IG) algorithm |
| Sensitivity: 96% |
| PI_65 | Carneiro et al. (2011) [ | Deep learning architectures and derivative-based search methods | Cohn-Kanade dataset (CK+) | PCA | ROC + HMD, HDF, MAD, MSSD | d_AVP: 95% |
| PI_66 | Xu et al. (2012) [ | Transient ischemic dilation for coronary artery disease in quantitative analysis | Nuclear Medicine Department, Sacred Heart Medical Center, Eugene | Mibi-Mibi TID | Standard deviation (SD) | Sensitivity: 76% |
| PI_67 | Betancur et al. (2017) [ | ML | Sacred Heart Medical Center |
| Quantitative imaging analysis | Accuracy: 81% |
| PI_68 | Coenen et al. (2018) [ | ML + coronary computed tomographic | 351 patients | ROC | ML-based CT-FFR model | Accuracy: 73% |
| PI_69 | Wolterink et al. (2015) [ | CNN | 116 CT patients |
| ML-based CT-FFR model | Accuracy: 95% |
| PI_70 | Nakazato et al. (2010) [ | Perfusion imaging for detection of CAD | 142 patients | N/A |
| Accuracy: 95% |
Summary of ECG modality based research articles.
| P_ID | Author | Technique | Data | Feature selection | Data sampling | Conclusion |
|---|---|---|---|---|---|---|
| PI_71 | Zhao et al. (2019) [ | HRV + PTTV + SVM | 40 heart failure patients | RR + PTT |
| Accuracy: 90% |
| PI_72 | Sudarshan, et al. (2017) [ | DTCWT-based methodology | BIDMC + rhythm (NSR) + fantasia | ROC + |
| Accuracy: 99.86% |
| PI_73 | Acharya et al. (2017) [ | CNN + ECG signal | Physionet databases | Single CNN structure |
| Accuracy: 95.1% |
| PI_74 | Chen et al. (2019) [ | Two-step predictive framework for ECG | MITDB + Physionet | Daubechies wavelet + PCA |
| Accuracy: 96.26% |
| PI_75 | Shen et al. (2016) [ | Generative kernel density estimator | China Kadoorie biobank (CKB) | RR interval + P wave duration |
| One-class: 75.6% Acc |
| PI_76 | Acharya et al. (2016) [ | Automated diagnosis of serious arrhythmias | MIT-BIH A-fib + MIT-BIR arrhythmia | Approximate entropy | Confusion matrix | Accuracy: 96.3% % |
| PI_77 | Mathews et al. (2018) [ | Deep learning | MIT/Beth Israel Hospital (BIH) | Heartbeat interval features + RR intervals | MCC | Accuracy: 96.94% |
| PI_78 | Adam et al. (2018) [ | DWT + nonlinear features | PTB Diagnostic ECG Database | SFS | 10-fold crossvalidation | Accuracy: 99.27% |
| PI_79 | Tan et al. (2018) [ | Stacked convolutional + long short-term memory network | Physionet database | CNN | 10-fold crossvalidation | Accuracy: 99.85% |
| PI_80 | Sharma et al. (2018) [ | Two-band optimal biorthogonal filter bank (FB) | Physikalisch-Technische ECG database | Fuzzy entropy + signal-fractal-dimension+ Renyi entropy | 10-fold crossvalidation | Noisy data:99.62%, Acc |
| PI_81 | Puceret et.al. (2018) [ | Topological approach | MIT-BIH database | ADMT | 10-fold crossvalidation | Accuracy: 92.73% |
| PI_82 | Huang et.al. (2011) [ | Vector cardiogram-based classification | PTB database from Physionet | FFS + BFS | 10-fold crossvalidation | Accuracy: 96.96% |
| PI_83 | Zhou et.al. (2018) [ | Premature ventricular contraction + RNN | MIT-BIH arrhythmia database | Long short-term memory (LSTM) | Detection indexes | Accuracy: 96-99% |
| PI_84 | U.Satija et al.,(2018) [ | ECG signal quality assessment algorithms | MIT-BIH arrhythmia database | CEEMD + temporal features | 10-fold crossvalidation | Accuracy: 98.80% |
| PI_85 | Sudarshan et al. (2017) [ | Dual tree complex wavelet transform | PhysioBank MIT-BIH NSR + fantasia + BIDMC CHF | Statistical features extracted from 2 seconds of ECG signals | 10-fold crossvalidation | Accuracy: 99.86% |
| PI_86 | Diker et al. (2019) [ | Genetic algorithm wavelet kernel | Physikalisch-Technische Bundesanstalt diagnostic ECG database (PTBDB) | Discrete wavelet transform (DWT) | 10-fold crossvalidation | Accuracy: 95% |
| PI_87 | Acharya et al. (2017) [ | Deep CNN | PTBDB | N/A | 10-fold crossvalidation | Accuracy: 95.22% |
| PI_88 | Yao et al. (2020) [ | Attention-based time-incremental convolutional neural network (ATI-CNN) | 1 | CNN-LSTM, 1st layer | Matthews correlation coefficient(MCC) | Accuracy: 81.2% |
| PI_89 | Vafaie et al. (2014) [ | Genetic-fuzzy + dynamical model of ECG signals | Physionet database | IF, THEN rules | N/A | Accuracy: 93.34% |
| PI_90 | Sahoo et al. (2017) [ | Multiresolution wavelet transform + ECG classification | MIT-BIH arrhythmia database | Principal component analysis (PCA) | 10-fold crossvalidation | NN: 93.34% Acc |
| PI_91 | Dohare et al. (2018) [ | Myocardial infarction (MI) detection + SVM | Physikalisch-Technische Bundesanstalt (PTB) | Principal component analysis (PCA) | 10-fold crossvalidation | Accuracy: 96.66% |
| PI_92 | Attia et al. (2019) [ | (AI)-enabled electrocardiograph (ECG) using a convolutional neural network | Mayo Clinic ECG laboratory | Non-linear ReLU | ROC | Accuracy: 87% |
| PI_93 | Melgare et al. (2019) [ | ML approach + electrocardiographic fragmented | Sfrag-DB + SWfrag-DB + FHCM-DB + HCM-DB | Statistics + PCA | Matthews correlation coefficient (MCC) | Accuracy: 90% |
| PI_94 | Feng et al. (2019) [ | CNN + RNN | PTB database | CNN and LSTM | 10-fold crossvalidation | Accuracy: 95.4% |
| PI_95 | Raka et.al. (2017) [ | Time-based detection | SDDB + MIH-BIH database (NSRDB) | R-R interval duration | 5-fold crossvalidation | Accuracy: 83.9% |
| PI_96 | Kumar et al. (2017) [ | ECG beat with flexible analytic wavelet transform (FAWT) + LS-SVM | ECG database from the Physiobank | Sample entropy (SEnt) | 10-fold crossvalidation | Accuracy: 99.31% |
| PI_97 | Yin et al. (2019) [ | LS-SVM + multidomain electrocardiogram | MIT-BIH arrhythmia database | RR intevals, DWT, SampEn | 10-fold crossvalidation | Accuracy: 99.31% |
| PI_98 | Sahoo et al. (2017) [ | SVM + NN | MITBIH arrhythmia database | Multiresolution wavelet transform | 10-fold crossvalidation | Accuracy: 98.39% |
| PI_99 | Masetic et al. (2016) [ | Random forest | BIDMC CHF database (CHFDB) + NSRDB. | Autoregressive burg method | 10-fold cross validation | Accuracy: 100% |
| PI_100 | Isler and Kuntalp (2007) [ | Classical HRV indices with wavelet entropy measures | MIT/BIH database | Genetic algorithm | Crossvalidation | Accuracy: 91.33% |
| PI_101 | Bhurane et al. (2019) [ | Frequency localized filter banks | NSRDB +BIDMC | Feature extraction | 10-fold crossvalidation | Accuracy:99.66% |
| PI_102 | Orhan (2013) [ | Discretization method | NSRDB + BIDMC | EFiA-EWiT | 10-fold crossvalidation | Accuracy: 99.33% |
| PI_103 | Liao et al. (2015) [ | SVM | CHFDB + MIT-BIH NSR database NSRDB | QRS wave | Ratio (ACC/SV) | Accuracy: 97.27% |
| PI_104 | Yıldırım et.al. (2018) [ | Deep CNN | MIT-BIH arrhythmia database | PCANet algorithm | Confusion matrix of | Accuracy: 95.20% |
| PI_105 | Yang et al. (2018) [ | LS-SVM + PCA | MIT-BIH database | PCANet algorithm | 10-fold crossvalidation | Accuracy: 97.94% |
Figure 11The performance of ML models with respect to modality can be seen in this figure. SVM, RF, and DNN models have obtained higher accuracy as compared to the other ML models. Modalities of the ML models can also be seen in this figure.