| Literature DB >> 35747725 |
Shumaila Shehzadi1, Muhammad Abul Hassan2, Muhammad Rizwan3, Natalia Kryvinska4, Karovič Vincent4.
Abstract
Ischemic heart disease (IHD) causes discomfort or irritation in the chest. According to the World Health Organization, coronary heart disease is the major cause of mortality in Pakistan. Accurate model with the highest precision is necessary to avoid fatalities. Previously several models are tried with different attributes to enhance the detection accuracy but failed to do so. In this research study, an artificial approach to categorize the current stage of heart disease is carried out. Our model predicts a precise diagnosis of chronic diseases. The system is trained using a training dataset and then tested using a test dataset. Machine learning methods such as LR, NB, and RF are applied to forecast the development of a disease. Experimental outcomes of this research study have proven that our strategy has excelled other procedures with maximum accuracy of 99 percent for RF, 97 percent for NB, and 98 percent for LR. With such high accuracy, the number of deaths per year of ischemic heart disease will be slightly decreased.Entities:
Mesh:
Year: 2022 PMID: 35747725 PMCID: PMC9213158 DOI: 10.1155/2022/3823350
Source DB: PubMed Journal: Comput Intell Neurosci
Abbreviations and acronyms.
| Acronyms | Description |
|
| |
| WHO | World health organization |
| AIRS | Artificial immune recognition system |
| ML | Machine learning |
| LR | Logistic regression |
| RF | Random forest |
| NB | Naive Bayes |
| KNN |
|
| CHD | Coronary heart disease risk |
| NB | Naive Bayes |
| CANFIS | Coactive neuro-fuzzy inference method |
| TP | True positive |
| FN | False negative |
| TN | True negative |
| FP | False negative |
| IoT | Internet of things |
| ID3 | Iterative dichotomized 3 |
| CART | Classification and regression tree |
Machine learning approach to predict heart diseases.
| Reference | Method | Accuracy |
|
| ||
| [ | Fuzzy-expert system | 94% |
| [ | Svm, KNN, LG, RF, NB, & LSTM | 58%, 76%, 78%, 79%, 82% & 94% |
| [ | Hybrid model | 85.71% |
| [ | KNN with parameter weighting | 81.9% |
| [ | ANN & BPNN | 83% |
| [ | LR, RF, NB, GB & SVM | 86%, 80%, 84%, 84% & 79% |
| [ | NB, SVM & KNN | 75%, 45.11% & 50.44%, |
| [ | Fuzzy logic | 98% |
| [ | GUI and WAC | 81.51% |
| [ | KNN | 80% |
| [ | CNN-UDRP (KNN, NB) | 82%, |
| [ | GDB tree algorithm & RF | 96.75% & 97.98% |
| [ | CSHCP | 97% |
| [ | CA-SHR | 96.02% |
| [ | CervDetect | 93.6% |
| [ | Modified YOLOv5 | 96.50% |
| [ |
| 89.0% & 81.9% |
Dataset attribute, icon, detail, and range.
| Sr. no. | Attribute | Representative icon | Details | Range |
|
| ||||
| 1 | Age | Age | Patients age, in years | 29–71 |
| 2 | Sex | Sex | 0 = female; 1 = male | 0,1 |
| 3 | Chest pain | Cp | 4 types of chest pain (1—typical angina; 2—atypical angina; 3—nonanginal pain; 4—asymptomatic) | 0,1,2,3 |
| 4 | Rest blood pressure | Trestbps | Resting systolic blood pressure (in mm Hg on admission to the hospital) | 94–200 |
| 5 | Serum cholesterol | Chol | Serum cholesterol in mg/dl | 126–564 |
| 6 | Fasting blood sugar | Fbs | Fasting blood sugar >120 mg/dl (0—false; 1—true) | 0,1 |
| 7 | Rest electrocardiograph | Restecg | 0—normal; 1—having ST-T wave abnormality; 2—left ventricular hypertrophy | 0,1,2 |
| 8 | MaxHeart rate | Thalch | Maximum heart rate achieved | 71–202 |
| 9 | Exercise-induced angina | Exang | Exercise-induced angina (0—no; 1—yes) | 0,1 |
| 10 | ST depression | Oldpeak | ST depression induced by exercise relative to rest | 0–6.2 |
| 11 | Slope | Slope | Slope of the peak exercise ST segment (1—upsloping; 2—flat; 3—down sloping) | 1,2,3, |
| 12 | No. of vessels | Ca | No. of major vessels (0–3) colored by fluoroscopy | 0,1,2,3 |
| 13 | Thalassemia | Thal | Defect types; 3—normal; 6—fixed defect; 7—reversible defect | 0,1,2,3 |
| 14 | Num (class attribute) | Class | Diagnosis of heart disease status (0—nil risk; 1—low risk; 2—potential risk; 3—high risk; 4—very high risk) | 0,1 |
Figure 1Histogram representation of old peak attribute.
Figure 2Histogram representation of age attribute.
Figure 3Histogram representation of red blood pressure attribute.
Figure 4Confusion matrix representation of random forest and logistic regression.
Figure 5Confusion matrix representation of naïve Bayes.
Figure 6Normalization matrix representation of naïve Bayes.
TP, FN, FP, and TN rate of RF, LR, and NB machine learning algorithms.
| RF | LR | NB | |
|
| |||
| TP | 69 | 69 | 69 |
| FN | 0 | 0 | 0 |
| FP | 83 | 0 | 0 |
| TN | 0 | 83 | 83 |
Performance of evaluation matrix.
| Algorithms | Accuracy | Precision | Recall |
|
|
| ||||
| Naïve Bayes | 0.97 | 0.96 | 0.98 | 0.99 |
| Logistic regression | 0.98 | 0.99 | 0.99 | 0.99 |
| Random forest | 0.99 | 1.00 | 1.00 | 1.00 |