| Literature DB >> 35187099 |
Javad Hassannataj Joloudari1, Faezeh Azizi1, Mohammad Ali Nematollahi2, Roohallah Alizadehsani3, Edris Hassannatajjeloudari4, Issa Nodehi5, Amir Mosavi6,7,8,9,10.
Abstract
BACKGROUND: Coronary artery disease (CAD) is one of the crucial reasons for cardiovascular mortality in middle-aged people worldwide. The most typical tool is angiography for diagnosing CAD. The challenges of CAD diagnosis using angiography are costly and have side effects. One of the alternative solutions is the use of machine learning-based patterns for CAD diagnosis.Entities:
Keywords: coronary artery disease; diagnosis; genetic algorithm; machine learning; support vector machine
Year: 2022 PMID: 35187099 PMCID: PMC8855497 DOI: 10.3389/fcvm.2021.760178
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
Figure 1The proposed methodology framework.
Description of the Z-Alizadeh-Sani dataset (5).
|
|
|
| ||
|---|---|---|---|---|
|
| ||||
| Demographic | Age | (30–80) | 0.6 | |
| Demographic | Weight | (48–120) | 0.69 | |
| Demographic | Length | (140–188) | 0.54 | |
| Demographic | Sex | Male, Female | — | |
| Demographic | Body mass index (BMI) (Kb/m2) | (18-41) | 0.24 | |
| Demographic | Diabetes mellitus (DM) | (0, 1) | 0.03 | |
| Demographic | Hypertension (HTN) | (0, 1) | 0.03 | |
| Demographic | Current smoker | (0, 1) | 0.02 | |
| Demographic | Ex-smoker | (0, 1) | 0.01 | |
| Demographic | Family history (FH) | (0, 1) | 0.02 | |
| Demographic | Obesity | Yes if MBI > 25, No otherwise | — | |
| Demographic | Chronic renal failure (CRF) | Yes, No | — | |
| Demographic | Cerebrovascular accident (CVA) | Yes, No | — | |
| Demographic | Airway disease | Yes, No | — | |
| Demographic | Thyroid disease | Yes, No | — | |
| Demographic | Congestive heart failure (CHF) | Yes, No | — | |
| Demographic | Dyslipidemia (DLP) | Yes, No | — | |
| Symptom and examination | Blood pressure (BP) (mmHg) | (90–190) | 1.09 | |
| Symptom and examination | Pulse rate (PR) (ppm) | (50–110) | 0.51 | |
| Symptom and examination | Edema | (0, 1) | 0.01 | |
| Symptom and examination | Weak peripheral pulse | Yes, No | — | |
| Symptom and examination | Lung rales | Yes, No | — | |
| Symptom and examination | Systolic murmur | Yes, No | — | |
| Symptom and examination | Diastolic murmur | Yes, No | — | |
| Symptom and examination | Typical chest pain | (0, 1) | 0.03 | |
| Symptom and examination | Dyspnea | Yes, No | — | |
| Symptom and examination | Function class | 1, 2, 3, 4 | 0.06 | |
| Symptom and examination | Atypical | Yes, No | — | |
| Symptom and examination | Nonanginal chest pain | Yes, No | — | |
| Symptom and examination | Exertional chest pain | Yes, No | — | |
| Symptom and examination | Low TH Ang (low-threshold angina) | Yes, No | — | |
| ECG | Rhythm | Sin, AF | — | |
| ECG | Q wave | (0, 1) | 0.01 | |
| ECG | ST elevation | (0, 1) | 0.01 | |
| ECG | ST depression | (0, 1) | 0.02 | |
| ECG | T inversion | (0, 1) | 0.03 | |
| ECG | LVH (left ventricular hypertrophy) | Yes, No | — | |
| ECG | Poor R-wave progression | Yes, No | — | |
| Laboratory and echo | FBS (fasting blood sugar mg/dl) | (62–400) | 2.99 | |
| Laboratory and echo | Cr (creatine mg/dl) | (0.5–2.2) | 0.02 | |
| Laboratory and echo | TG (triglyceride mg/dl) | (37–1050) | 5.63 | |
| Laboratory and echo | LDL (low-density lipoprotein mg/dl) | (18–232) | 2.03 | |
| Laboratory and echo | HDL (high-density lipoprotein mg/dl) | (15–111) | 0.61 | |
| Laboratory and echo | BUN (blood urea nitrogen mg/dl) | (6–52) | 0.4 | |
| Laboratory and echo | ESR (erythrocyte sedimentation rate mm/h) | (1–90) | 0.92 | |
| Laboratory and echo | HB (hemoglobin g/dl) | (8.9–17.6) | 0.09 | |
| Laboratory and echo | K (potassium mEq/lit) | (3.0–6.6) | 0.03 | |
| Laboratory and echo | Na (sodium mEq/lit) | (128–156) | 0.22 | |
| Laboratory and echo | WBC (white blood cell cells/ml) | (3,700–18.000) | 138.67 | |
| Laboratory and echo | Lymph (lymphocyte %) | (7–60) | 0.57 | |
| Laboratory and echo | Neut (neutrophil %) | (32–89) | 0.59 | |
| Laboratory and echo | PLT (platelet 1,000/ml) | (25–742) | 3.49 | |
| Laboratory and echo | EF (ejection fraction %) | (15–60) | 0.51 | |
| Laboratory and echo | Region with RWMA | (0–4) | 0.07 | |
| Laboratory and echo | VHD (valvular heart disease) | Normal, Mild, Moderate, Severe | — | |
| Categorical | Target classes | CAD, Normal | — |
Std: Standard.
Figure 2Optimized hyperplane for two-dimensional space.
The parameters setting of the LSVM method.
|
|
|
|---|---|
| Kernel cache | 200 |
| C | 0.1 |
| Convergence epsilon | 0.001 |
| Max iterations | 100,000 |
| L pos | 1.0 |
| L neg | 1.0 |
| Balance cost | ✓ |
The parameters setting of the LIBSVM method.
|
|
|
|---|---|
| SVM type | C-SVC |
| Kernel type | RBF |
| Gamma | 1.0 |
| C | 0.1 |
| Cache size | 80 |
| Epsilon | 0.001 |
| Shrinking | ✓ |
The parameters setting of the SVM with the ANOVA method.
|
|
|
|---|---|
| Kernel type | ANOVA |
| Kernel gamma | 1.0 |
| Kernel degree | 2.0 |
| Kernel cache | 200 |
| C | 0.1 |
| Convergence epsilon | 0.001 |
| Max iterations | 100,000 |
| L pos | 1.0 |
| L neg | 1.0 |
| Balance cost | ✓ |
The parameters setting of the genetic optimization algorithm.
|
|
|
|---|---|
| Population size | 50 |
| Maximum number of generations | 10 |
| Normalize weights | ✓ |
| Plot generations | 10 |
| Draw dominated points | ✓ |
| Initial probability | 0.5 |
| Size of each chromosome = total of features | 55 |
| Probability of mutation | 1.0 |
| Crossover probability | 0.75 |
| Crossover type | Shuffle |
| Maximum fitness | Infinity |
| Fitness function | Accuracy, PPV, F-measure, sensitivity, specificity, and NPV |
Confusion matrix for diagnosis of CAD.
|
|
| |
|---|---|---|
|
|
| |
| Positive | True positive | False positive |
| Negative | False negative | True negative |
The comparison of the methods based on the Z-Alizadeh Sani dataset in this study.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| SVM with ANOVA | 85.13 | 80.24 | 72.01 | 69.19 | 91.62 | 68.97 | 89 |
| Linear SVM | 86.11 | 77.21 | 75.55 | 74.75 | 90.71 | 74.71 | 92.4 |
| LIBSVM with RBF | 84.78 | 76.24 | 72.38 | 70.03 | 90.74 | 70.11 | 82.1 |
|
|
|
|
|
|
|
|
|
Bold the best values of the proposed method (GSVMA) to show our method is the best.
Figure 3ROC curve for GSVMA method.
Figure 4ACC diagram for GSVMA method for generation 10.
Figure 5F-measure diagram for GSVMA method for generation 10.
Figure 6PPV diagram for GSVNA method for generation 10.
Figure 7Sensitivity diagram for GSVMA method for generation 10.
Figure 8Specificity diagram for GSVNA method for generation 10.
Figure 9NPV diagram for GSVNA method for generation 10.
Figure 10ROC curve for LSVM method.
Figure 12ROC curve for LIBSVM with RBF method.
Figure 13A comparison between the performance of methods based on the seven criteria.
Comparison between the proposed GSVMA method and the work of other researchers based on the original Z-Alizadeh Sani dataset.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Qin et al. ( | Ensemble algorithm-multiple feature selection, (EA-MFS) | 10-FCV | 34 | 93.70 + −0.48% | NC |
| Cüvitoglu et al. ( | Artificial neural network | 10-FCV | 25 | 87.85 | 93 |
| Kiliç et al. ( | Artificial Bee colony | - | 16 | 89.4 | NC |
| Abdar et al. ( | NE-nu-SVC | 10-FCV | 16 | 94.66 | NC |
| Abdar et al. ( | N2Genetic-nuSVM | 10-FCV | 29 | 93.08 | NC |
| Kolukisa et al. ( | Ensemble classifier with Fisher linear discriminant analysis | 5, 10, 20-FCV | 55 | 92.07 | 95.3 |
| Tama et al. ( | Two-tier ensemble particle swarm optimization(PSO)-based feature selection | 10-FCV | 27 | 98.13 | 98.7 |
| Terrada et al. ( | (ANN) | - | 17 | 94 | 94 |
| Shahid et al. ( | Hybrid PSO-EmNN | 10-FCV | 22 | 88.34 | NC |
| Ghiasi et al. ( | CART | 10-FCV | 5 | 92.41 | NC |
| Dahal et al. ( | SVM | 10-FCV | 15 | 89.47 | 88.68 |
| Velusamy et al. ( | Weighted-average Voting ensemble (WAVEn) | 10-FCV | 5 | 98.97 | NC |
| Hassannataj et al. ( | Random trees | 10-FCV | 40 | 91.47 | 96.7 |
| The Proposed Method | GSVMA | 10-FCV | 31 | 89.45 | 100 |
NC, Not considered.
The used LSVM method for CAD diagnosis.
| 1: |
| 2: Data preprocessing: Transforming nominal records to numerical records and normalizing between zero and one |
| 3: Divide the data using a 10-fold crossvalidation technique |
| 4: Choose the value of the parameters setting based on |
| 5: |
| 6: Employ LSVM train for each record |
| 7: Generate LSVM model |
| 8: Employ LSVM classify for testing records |
| 9: |
| 10: |
| 11: |
The used LIBSVM method for CAD diagnosis.
| 1: |
| 2: Data preprocessing: Transforming nominal records to numerical records and normalizing between zero and one |
| 3: Divide the data using a 10-fold crossvalidation technique |
| 4: Choose the value of the parameters setting based on |
| 5: |
| 6: Employ LIBSVM train for each record |
| 7: Generate LIBSVM model |
| 8: Employ LIBSVM classify for testing records |
| 9: |
| 10: |
| 11: |
The used SVM with ANOVA method for CAD diagnosis.
| 1: |
| 2: Data preprocessing: Transforming nominal records to numerical records and normalizing between zero and one |
| 3: Divide the data using a 10-fold cross-validation technique |
| 4: Choose the value of the parameters setting based on |
| 5: |
| 6: Employ SVM with ANOVA train for each record |
| 7: Generate SVM with ANOVA model |
| 8: Employ SVM with ANOVA classify for testing records |
| 9: |
| 10: |
| 11: |
The used GSVMA method for CAD diagnosis.
| 1: |
| 2: Data preprocessing: Transforming nominal records to numerical records and normalizing between zero and one |
| 3: Divide the data using a 10-fold crossvalidation technique |
| 4: Generating a new population of members randomly (the population size= 50 and the maximum number of generations=10) based on |
| 5: |
| 6: set a random of (0,1) value to gene(i) of the chromosome |
| 7: |
| 8: Determining the fitness function based on evaluation criteria |
| 9: select a member of a generation (the selection scheme: uniform, roulette wheel) based on formula (16) |
| 10: Performing crossover (crossover type: shuffle) |
| 11: Performing mutation action (Probability of mutation=1.0) |
| 12: |
| 13: Execute the most important features using Genetic optimization algorithm |
| 14: Feeding feathers SVM with ANOVA model based on algorithm 3 |
| 15: Employ SVMA classify for testing records |
| 16: |
| 17: |
| 18: |